Welcome to Scribd!

Computing Average Log Likelihood

Uploaded by

0% found this document useful (0 votes)

4 views3 pages

The document discusses log-likelihood, which is a measure of how well a probabilistic model predicts correct outcomes. Log-likelihood is computed by taking the average of the log of the predicted probability for the correct category over many examples. While the value for individual examples varies, averaging over many examples provides a useful summary statistic for comparing models.

Original Description:

Original Title

computing-average-log-likelihood

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

4 views3 pages

Computing Average Log Likelihood

Uploaded by

kitisakp99

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

1

Mahout in Action
Sean Owen, Robin Anil, Ted Dunning, and Ellen Friedman

In this article, Mahout in Action author explain log-likelihood, a probability score

measure that can be used with multiple target categories.

To save 35% on your next purchase use Promotional Code owen1535 when you check
out at www.manning.com.

Computing Average Log-Likelihood

When we have a model that produces probability score outputs, the log of the score for the correct answer is a
useful measure of how good the model is. When this value is averaged over a large number of held-out examples,
you get the measure that is known as the average log-likelihood. This term is often abbreviated as log-likelihood,
at the risk of some confusion.
Log-likelihood is a useful contrast with AUC (area under the receiver operating characteristic curve). Log-
likelihood can be used with multiple target categories, while AUC is limited to binary outputs. AUC, on the other
hand, can accept any kind of score, while log-likelihood requires scores that reflect probability estimates. AUC has
an absolute scale from 0 to 1 that can be used to compare different models while the scale for log-likelihood is
open ended.
The value of log-likelihood for a single training example is not quite useful because it varies so much from
example to example, but the average over a number of examples is useful. OnlineSummarizer can average the
log-likelihood estimates from a classifier and summarize the estimates using median, mean, and upper and lower
quartiles like this:
OnlineSummarizer summarizeLogLikelihood = new OnlineSummarizer();
for (int i = 0; i < 1000; i++) {
Example x = Example.readExample(); #1
double ll = classifier.logLikelihood(x.target, x.features); #2
summarizeLogLikelihood.add(ll); #3
}

System.out.printf(
"Average log-likelihood = %.2f (%.2f, %.2f) (25%%-ile,75%%-ile)\n",
summarizeLogLikelihood.getMean(),
summarizeLogLikelihood.getQuartile(1), #4
summarizeLogLikelihood.getQuartile(2));
#1 Gets example
#2 Computes log-likelihood
#3 Adds to summary
#4 Computes and prints statistics
In this code, examples are read (#1) and a classifier computes the log likelihood (#2) which is then passed to an
OnlineSummarizer (#3). After the input has been read, this summarize can be queried (#4) to get descriptive
statistics for the distribution of log-likelihood.
Log-likelihood has a maximum value of zero and no bound on how negative it can go. For highly accurate
classifiers, the value of average log-likelihood should be close to the average percent correct for the classifier
multiplied by the number of target categories. For less accurate classifiers, especially those with many target
categories, the average percent correct tends to go to zero, making it difficult to compare classifiers. The log-

For source code, sample chapters, the Online Author Forum, and other resources, go to
http://www.manning.com/owen/
2

likelihood, on the other hand, can still distinguish better from worse classifiers even when the classifiers being
compared are rarely or even never exactly correct, which makes the average percent correct zero.
For internal developer audiences, log-likelihood is probably a better measure for comparing models but, when
presenting the results to a less technically sophisticated audience, percentage correct or even a confusion matrix is
probably going to work better to convey your results. Metrics like AUC and log-likelihood give us a picture of which
models perform better or worse overall.

Summary
We discussed log-likelihood, which is computed by taking the average of the log of the score for the target
category.

For source code, sample chapters, the Online Author Forum, and other resources, go to
http://www.manning.com/owen/
3

Here are some other Manning titles you might be interested in:

Hadoop in Action
Chuck Lam

The Cloud at Your Service

Jothy Rosenberg and Arthur Mateos

Real-World Functional Programming

Tomas Petricek with Jon Skeet

Last updated: May 20, 2011

For source code, sample chapters, the Online Author Forum, and other resources, go to
http://www.manning.com/owen/

Lesson Plan ABRACADABRA
Document9 pages
Lesson Plan ABRACADABRA
Adelina Hasas
No ratings yet
Process Point Analysis
Document8 pages
Process Point Analysis
kitisakp99
0% (1)
Probability Distributions Summary - Exam P
Document1 page
Probability Distributions Summary - Exam P
roy_getty
No ratings yet
McMillan 2007. Fish - Histology PDF
Document603 pages
McMillan 2007. Fish - Histology PDF
Marcela Mesa
100% (1)
Uswah
Document4 pages
Uswah
Sawera Bibi
No ratings yet
Social Media Mining
Document10 pages
Social Media Mining
Hari Atharsh
No ratings yet
Book Based Question
Document2 pages
Book Based Question
Angel Leo
No ratings yet
11 Most Common Machine Learning Algorithms Explained in A Nutshell by Soner Yıldırım Towards Data Science
Document16 pages
11 Most Common Machine Learning Algorithms Explained in A Nutshell by Soner Yıldırım Towards Data Science
Dheeraj Sonkhla
No ratings yet
Definition of Terms
Document3 pages
Definition of Terms
Jane Rob
No ratings yet
How To Use ROC Curves and Precision-Recall Curves For Classification in Python
Document47 pages
How To Use ROC Curves and Precision-Recall Curves For Classification in Python
Adrian Stan
No ratings yet
FAI - ch5 - Remaining Topics
Document3 pages
FAI - ch5 - Remaining Topics
patelhemv1143
No ratings yet
Performance Metrics Deep Dive
Document7 pages
Performance Metrics Deep Dive
kamel Mahdy
No ratings yet
Calibration of Software Quality: Fuzzy Neural and Rough Neural Computing Approaches
Document4 pages
Calibration of Software Quality: Fuzzy Neural and Rough Neural Computing Approaches
jatin1001
No ratings yet
(Miranda Et Al. 2019) Sizing User Stories Using Paired Comparisons
Document26 pages
(Miranda Et Al. 2019) Sizing User Stories Using Paired Comparisons
Edouard_Carvalho
No ratings yet
Malignant Comments Classifier Project
Document30 pages
Malignant Comments Classifier Project
Saranya M
No ratings yet
Regression in R & Python Paper
Document38 pages
Regression in R & Python Paper
desaiha
No ratings yet
DL Unit-2
Document24 pages
DL Unit-2
Kalpana M
No ratings yet
Data Science Interview Guide
Document23 pages
Data Science Interview Guide
Mary Koko
No ratings yet
(IJCST-V9I3P23) :aditi Linge, Bhavya Malviya, Digvijay Raut, Payal Ekre
Document3 pages
(IJCST-V9I3P23) :aditi Linge, Bhavya Malviya, Digvijay Raut, Payal Ekre
EighthSenseGroup
No ratings yet
Project Lit Final1
Document15 pages
Project Lit Final1
poornima
No ratings yet
Statlect: Log-Likelihood
Document6 pages
Statlect: Log-Likelihood
Petros Piano
No ratings yet
SVM Classification Thesis
Document7 pages
SVM Classification Thesis
debrapereaalbuquerque
100% (1)
Important Tems
Document61 pages
Important Tems
komala
No ratings yet
NTAL Report PDF
Document17 pages
NTAL Report PDF
Tushar Chotlani
No ratings yet
Report
Document11 pages
Report
J Manuel Bueno
No ratings yet
Cocomo-William Roetzheim PDF
Document6 pages
Cocomo-William Roetzheim PDF
Alex
No ratings yet
All Slide
Document522 pages
All Slide
waqar jutt
No ratings yet
Influential Vocabulary Detection
Document15 pages
Influential Vocabulary Detection
api-263491997
No ratings yet
A Guide To Model Selection For Survival Analysis
Document26 pages
A Guide To Model Selection For Survival Analysis
Debela Nanesso
No ratings yet
Final Assessment MBAS901-Sabina.K
Document9 pages
Final Assessment MBAS901-Sabina.K
Sabina
No ratings yet
An Awesome Tutorial To Learn Outlier Detection in Python Using The Pyod Library
Document23 pages
An Awesome Tutorial To Learn Outlier Detection in Python Using The Pyod Library
Twitter Content You Cannot Miss on YouTube
No ratings yet
Python Tytorial PDF
Document23 pages
Python Tytorial PDF
ankit
No ratings yet
Learn Outlier Detection in Python PyOD Library 1566237490
Document23 pages
Learn Outlier Detection in Python PyOD Library 1566237490
Douglas Souza
No ratings yet
Report Sentiment Analysis Marcos Matheus
Document12 pages
Report Sentiment Analysis Marcos Matheus
jfernandesj13
No ratings yet
Effort Estimation in Agile Software Development A Systematic Literature Review
Document6 pages
Effort Estimation in Agile Software Development A Systematic Literature Review
afdtjozlb
No ratings yet
Superagent: A Customer Service Chatbot For E-Commerce Websites
Document4 pages
Superagent: A Customer Service Chatbot For E-Commerce Websites
Abdullah Mapari
No ratings yet
Transcript: Summary
Document2 pages
Transcript: Summary
AYUSH ALTERNATE
No ratings yet
Credit Card Fraud Analysis Ashutosh
Document3 pages
Credit Card Fraud Analysis Ashutosh
Hemang Khandelwal
No ratings yet
(Aboque) Finalproject Mathelec01
Document11 pages
(Aboque) Finalproject Mathelec01
Pj Vale
No ratings yet
RD1814A28
Document15 pages
RD1814A28
Vivek
No ratings yet
Modelling and Error Analysis
Document8 pages
Modelling and Error Analysis
Atmuri Ganesh
No ratings yet
Data Mining by Evolutionary Learning (DMEL) Using HBase
Document46 pages
Data Mining by Evolutionary Learning (DMEL) Using HBase
Abhijan Carter Biswas
No ratings yet
ML Terminologies PDF
Document44 pages
ML Terminologies PDF
Kapildev Kumar
No ratings yet
Merge +1
Document107 pages
Merge +1
SHIKHA SHARMA
No ratings yet
4227 GUI Ebook Data Science Interview Guide
Document25 pages
4227 GUI Ebook Data Science Interview Guide
olabisiesther83
No ratings yet
Project Paper
Document5 pages
Project Paper
api-521345502
No ratings yet
A Soft Introduction To NLP - Semantic Similarity Calculations Using Python - Medium
Document13 pages
A Soft Introduction To NLP - Semantic Similarity Calculations Using Python - Medium
Tridiv
No ratings yet
Data Science Intervieew Questions
Document16 pages
Data Science Intervieew Questions
Satyam Anand
100% (1)
Basic Interview Q's On ML PDF
Document243 pages
Basic Interview Q's On ML PDF
sourajit roy chowdhury
100% (2)
Maintainability Index Revisited
Document3 pages
Maintainability Index Revisited
MartijnvanderMaas
No ratings yet
Acquisition Analytics Assignment
Document15 pages
Acquisition Analytics Assignment
Harshad Ambekar
No ratings yet
PA Simple BN Knowledge Engineering
Document4 pages
PA Simple BN Knowledge Engineering
Hans Wu
No ratings yet
Ac Reily 3
Document11 pages
Ac Reily 3
Rafsan Al- Amin
No ratings yet
Receiver Operator Characteristic
Document5 pages
Receiver Operator Characteristic
EngSafwanQadous
No ratings yet
AWS Certified Machine Learning Specialty Exam Guide
Document7 pages
AWS Certified Machine Learning Specialty Exam Guide
Deepak Gupta
No ratings yet
NN - G Quantitative UX Glossary
Document5 pages
NN - G Quantitative UX Glossary
Jorge A G Torres
No ratings yet
AWS Certified Machine Learning Specialty Exam Guide
Document7 pages
AWS Certified Machine Learning Specialty Exam Guide
Deepak Gupta
No ratings yet
3 - Loss Functions
Document14 pages
3 - Loss Functions
debasis nag
No ratings yet
Credit Risk Analysis
Document6 pages
Credit Risk Analysis
Subangkit Ramadiputra
No ratings yet
Project 2 Factor Hair Revised Case Study
Document25 pages
Project 2 Factor Hair Revised Case Study
rishit
No ratings yet
Machine Learning Questions
Document13 pages
Machine Learning Questions
Anwarul Bashir Shuaib
No ratings yet
IEA-02 Operations Research
Document52 pages
IEA-02 Operations Research
Sagar Shinde
No ratings yet
Computer Basics Document
Document27 pages
Computer Basics Document
Aruneswarvh Arun
No ratings yet
TouchCode Class 8
From Everand
TouchCode Class 8
Team Orange
No ratings yet
Ebook MS Thailand
Document16 pages
Ebook MS Thailand
kitisakp99
No ratings yet
ชุมนุมนิทานสอนใจ
Document57 pages
ชุมนุมนิทานสอนใจ
kitisakp99
No ratings yet
เขาวงกต
Document8 pages
เขาวงกต
kitisakp99
No ratings yet
Abraham Maslow
Document12 pages
Abraham Maslow
kitisakp99
No ratings yet
Unauthenticated Download Date - 1/31/17 4:04 AM
Document5 pages
Unauthenticated Download Date - 1/31/17 4:04 AM
Fahmi Razi
No ratings yet
E38 Closed Circuit Current Test
Document4 pages
E38 Closed Circuit Current Test
CezaryCezas
No ratings yet
Example 10 58
Document104 pages
Example 10 58
christian dawit
No ratings yet
Issue 14
Document20 pages
Issue 14
The Muse is rad
No ratings yet
B.tech 3 Cse Syllabus
Document65 pages
B.tech 3 Cse Syllabus
Vijayendra Goud
No ratings yet
0332713
Document200 pages
0332713
JS DUFFEY
No ratings yet
NWPC Annual Report 2014
Document56 pages
NWPC Annual Report 2014
lupethe3
No ratings yet
SK 2015 Q PDF
Document4 pages
SK 2015 Q PDF
Atlas Ernhardt
No ratings yet
Word 2016, Using Mail Merge
Document6 pages
Word 2016, Using Mail Merge
Pelah Wowen Daniel
No ratings yet
100 Patient Names
Document1 page
100 Patient Names
Ted anadilo
No ratings yet
Higher Education Loans Board: Loan Disbursement Report
Document2 pages
Higher Education Loans Board: Loan Disbursement Report
Edward Kalvis
No ratings yet
Minerals
Document40 pages
Minerals
Ghillian Mae Guiang
No ratings yet
Herniated Nucleus Pulposus
Document12 pages
Herniated Nucleus Pulposus
shezar
No ratings yet
7th Circular Motion Solutions
Document60 pages
7th Circular Motion Solutions
Ajay D Kumar
No ratings yet
Project Charter and Stakeholder Template
Document15 pages
Project Charter and Stakeholder Template
Atul Patil
No ratings yet
Seam 325 Prelim Exam Reviewer
Document6 pages
Seam 325 Prelim Exam Reviewer
Kyle Steven Cayme
No ratings yet
Algorithm Exam Help
Document9 pages
Algorithm Exam Help
Programming Exam Help
No ratings yet
Aileen Flores Lesson Plan 2021
Document8 pages
Aileen Flores Lesson Plan 2021
Sissay Semblante
No ratings yet
LP Grade 8 Molave
Document19 pages
LP Grade 8 Molave
Blessing Shielo Vicente
No ratings yet
Three-Quater Face Schematics PDF
Document20 pages
Three-Quater Face Schematics PDF
Schiteanu Claudiu
No ratings yet
Csec It Mock Exam
Document10 pages
Csec It Mock Exam
vidur_talreja
100% (1)
Development Letter
Document3 pages
Development Letter
tan balan
No ratings yet
IA Economics
Document3 pages
IA Economics
Elisa Elisa
No ratings yet
Gaap Quiz
Document3 pages
Gaap Quiz
Shadab Khan
No ratings yet
Independent Contractor Agreement For Accountant & Bookkeeper
Document7 pages
Independent Contractor Agreement For Accountant & Bookkeeper
Wen' George Bey
No ratings yet
Cantor Set Function
Document15 pages
Cantor Set Function
Renato Galois
No ratings yet
Experiment No.1. (Monograph)
Document3 pages
Experiment No.1. (Monograph)
ayeza.sarwar2021
No ratings yet