Welcome to Scribd!

Skip carousel

Information-Gain-Calculator

Uploaded by

Tuong Vi

0% found this document useful (0 votes)

10 views12 pages

Original Title

_f6f6db4ff17701c688b604c51a9c1159_Information-Gain-Calculator.xlsx

Copyright

Available Formats

XLSX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as XLSX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

10 views12 pages

Information-Gain-Calculator

Uploaded by

Tuong Vi

Copyright:

Available Formats

Download as XLSX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 12

Search inside document

Instructions: Create any confusion matrix by inputting values for cells labelle

Confusion Matrix

Condition X "+" 0.2 a

[defective computer chip]
"-" 0.8 b

Individual Probabilities Name

P("+") a Incidence of Condition "+"
p("-") b Incidence of "Condition "-"
p(Test POS) c Classification Incidence "POS"
p(Test NEG) d Classification Incidence "NEG"
p(Test POS, "+") e True Positives
p(Test NEG, "+") f False Negatives
p(Test "POS, "-") g False Positives
p(Test "NEG", "-") h True Negatives

Probability Distributions
P(X) p(a,b)
P(Y) p(c,d)
p(X,Y) p(e,f,g,h)
P(X)p(Y) p(ac,ad,bc,bd)

Definition of Independence P(X,Y) = P(X)p(Y)

X, Y Independent or Dependent?
Dependent

Conditional Probabilities
p(Test POS | "+") e/a 0.50
p(Test NEG | "+") f/a 0.50
p(Test POS | "-") g/b 0.25
p(Test NEG | "-") h/b 0.75

p("+" | Test POS) e/c 0.33

p( "-" | Test POS) g/c 0.67
p("+" | Test NEG) f/d 0.14
p("-" | Test NEG) h/d 0.86

"Relative Entropy" of p and q, written D(p||q)

is the summation of all p(i)*log(p(i)/qIi)
It is also called "Kullback-Leibler Divergence" (or "KL Divergence" for sh
The Relative Entropy of the Joint distribution p [row 28] and the produ
[Note that this definition is not required for Course - advanced topic]
tting values for cells labelled a, c, and e.

Test Classification Y
[optical scanner on assembly line]
"Positive" "Negative"
0.3 c 0.7 d

0.1 e 0.1 f

0.2 g 0.6 h

ence of Condition "+" H(X)

ence of "Condition "-" 0.7219
fication Incidence "POS"
fication Incidence "NEG"
H(Y)
Negatives 0.8813
Positives
Negatives

Name H(X,Y)
Probability of the Condition 1.5710
Probability of the Classification
Joint Distribution of X and Y 0.1
Product Distribution of X and Y 0.06
Mutual Information I(X:Y) = Rela
= e*log(e/ac)
Name 0.0323
True Positive Rate
False Negative Rate
False Positive Rate H(Y|X)
True Negative Rate
0.8490
Positive Predictive Value (PPV)
1- PPV
1- NPV H(X|Y)
Negative Predictive Value (NPV)
0.6897

(or "KL Divergence" for short)

p [row 28] and the product distribution q [row 29] is the mutual information [cell L35]
Course - advanced topic]
The spreadsheet outputs entropy (information) measures for all relevant distributions

Percentage Information Gain (P.I.G.)

A correlation measure defined as mutual information between X and Y
divided by the entropy of the Condition X
I(X;Y) 0.0323 bits
divided by
H(X) 0.7219 bits
equals
4.47%
Average reduction in uncertainty of one outcome in X upon learning on

= alog(1/a) + blog(1/b) I(X;Y) = H(X)

0.4644 0.2575 0.0323 0.7219

I(X;Y) = H(Y)
= c*log(1/c) + d*log(1/d) 0.0323 0.8813
0.5211 0.3602
I(X;Y) = H(X)
0.0323 0.7219

= elog(1/e) + flog(1/f) + gLog(1/g) + hlog(1/h)

0.3322 0.3322 0.4644 0.4422

e 0.10 f 0.20 g
ac 0.14 ad 0.24 bc
Information I(X:Y) = Relative Entropy of Joint and Product Distributions --- D(p(X,Y||p(X)p(Y))
0.0736965594 + f*log(f/ad) -0.04854268272 + g*log(g/bc) -0.05261

= (a H(e/a, f/a)) + (b H(g/b, h/b)

0.2000 1.0000 0.8000 0.8113

= (c H(e/c, g/c) + (d H(f/d, h/d)

0.3000 0.9183 0.7000 0.5917

mation [cell L35]

relevant distributions.

tion between X and Y

e in X upon learning one outcome in Y

- H(X|Y)
0.6897

- H(Y|X)
0.8490

+ H(Y) - H(X,Y)
0.8813 1.5710

0.60 h
0.56 bd
- D(p(X,Y||p(X)p(Y))
+ h*log(h/bd) 0.059721

H(g/b, h/b)

H(f/d, h/d)
Copyright Daniel Egger/ Attribution 4.0 Inter
Venn diagram courtesy of Konrad Voelkel - Wikipedia: https://en.
ribution 4.0 International (CC BY 4.0)
l - Wikipedia: https://en.wikipedia.org/wiki/Information_diagram

Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (122)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4610)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (266)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (590)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (842)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5807)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2259)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (599)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (821)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (401)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (897)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (537)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brene Brown
Rating: 4 out of 5 stars
4/5 (1091)
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1716)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4203)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1929)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (346)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1898)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (440)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3811)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2104)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2520)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (74)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1104)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Toibin
Rating: 3.5 out of 5 stars
3.5/5 (1946)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1850)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John Le Carré
Rating: 3.5 out of 5 stars
3.5/5 (104)
Probability Distributions
Document10 pages
Probability Distributions
Jaff Lawrence
No ratings yet
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (104)
How To Use The Excel "Linest" Function For Linear Regression Models With Multiple Input Variables X
Document8 pages
How To Use The Excel "Linest" Function For Linear Regression Models With Multiple Input Variables X
ANIL PAL
No ratings yet
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (792)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (806)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1016)
Examples of Bayes Theorem PDF
Document2 pages
Examples of Bayes Theorem PDF
Miranda
50% (2)
UPL Report
Document32 pages
UPL Report
ANIL PAL
No ratings yet
Handbook of Univariate and Multivariate Data Analysis With IBM SPSS, Second Edition
Document15 pages
Handbook of Univariate and Multivariate Data Analysis With IBM SPSS, Second Edition
Sandra Milena
0% (2)
Company and Product Profile Market Statistics Commercial Aspects
Document27 pages
Company and Product Profile Market Statistics Commercial Aspects
ANIL PAL
No ratings yet
Illullu Batata
Document5 pages
Illullu Batata
ANIL PAL
No ratings yet
CLT and Excel Rand
Document162 pages
CLT and Excel Rand
ANIL PAL
No ratings yet
Correlation and Model Error
Document7 pages
Correlation and Model Error
ANIL PAL
No ratings yet
Histograms-Spreadsheet
Document12 pages
Histograms-Spreadsheet
ANIL PAL
No ratings yet
Using The Weighted Sum of Dependent Gaussians Formula For Markowitz Portfolio Optimization
Document10 pages
Using The Weighted Sum of Dependent Gaussians Formula For Markowitz Portfolio Optimization
ANIL PAL
No ratings yet
Histograms-Spreadsheet
Document12 pages
Histograms-Spreadsheet
ANIL PAL
No ratings yet
Histograms-Spreadsheet
Document12 pages
Histograms-Spreadsheet
ANIL PAL
No ratings yet
Excel Probability Functions
Document41 pages
Excel Probability Functions
ANIL PAL
No ratings yet
Histograms-Spreadsheet
Document12 pages
Histograms-Spreadsheet
ANIL PAL
No ratings yet
Removal of Acetone From Wastewater by POSS Loaded PDMS Membrane
Document8 pages
Removal of Acetone From Wastewater by POSS Loaded PDMS Membrane
ANIL PAL
No ratings yet
Weighted Ensemble of Statistical Mode - 2020 - International Journal of Forecast
Document5 pages
Weighted Ensemble of Statistical Mode - 2020 - International Journal of Forecast
crackend
No ratings yet
STAT100 Fall19 Test 2 ANSWERS Practice Problems PDF
Document23 pages
STAT100 Fall19 Test 2 ANSWERS Practice Problems PDF
abuti
No ratings yet
4 Descriptive Statistics
Document13 pages
4 Descriptive Statistics
NguyễnLanHương
No ratings yet
Practice Test 3 - Spring 2010
Document9 pages
Practice Test 3 - Spring 2010
Vasudha97
No ratings yet
Binomial and Hypergeometric PDF
Document12 pages
Binomial and Hypergeometric PDF
nuriyesan
No ratings yet
Business Statistics - CW2
Document7 pages
Business Statistics - CW2
fmuffet
No ratings yet
Advanced Statistics
Document1 page
Advanced Statistics
Sasam Kc
No ratings yet
FRM P-1 - Lecture Guide - 2023 - Google Sheets
Document34 pages
FRM P-1 - Lecture Guide - 2023 - Google Sheets
cha7738713649
No ratings yet
Summarize The Methods of Studying Correlation.: Module - 3
Document17 pages
Summarize The Methods of Studying Correlation.: Module - 3
aditya deva
No ratings yet
Metocean Analysis Part A
Document29 pages
Metocean Analysis Part A
Muhd Radzif
No ratings yet
SYLLABUS Statistics 2019
Document17 pages
SYLLABUS Statistics 2019
Rehman Usman
No ratings yet
Spatial Data Analysis of Fusarium. David Brown
Document72 pages
Spatial Data Analysis of Fusarium. David Brown
Víctor Hugo
No ratings yet
Kamper 2022 Sample Size
Document2 pages
Kamper 2022 Sample Size
sfoco
No ratings yet
Q4 Stat Las-Week 5
Document12 pages
Q4 Stat Las-Week 5
suzannevillasis19
No ratings yet
1 - Binary Dependent Variable Models
Document63 pages
1 - Binary Dependent Variable Models
Daniel Patraboy
No ratings yet
Practice Exam Paper 1 - Solutions
Document14 pages
Practice Exam Paper 1 - Solutions
Marcel Jonathan
No ratings yet
Logistic Regression Course Note
Document23 pages
Logistic Regression Course Note
HOD OD
No ratings yet
SPSS Module-1
Document48 pages
SPSS Module-1
nahid mushtaq
No ratings yet
BA 339 - CH 3 Quiz
Document3 pages
BA 339 - CH 3 Quiz
Cassie Lemons
No ratings yet
Ornstein Isomorphism Theorem
Document2 pages
Ornstein Isomorphism Theorem
danny222
No ratings yet
Linear Regression
Document2 pages
Linear Regression
AINDRILA BERA
No ratings yet
Forecasting - Solutions
Document46 pages
Forecasting - Solutions
banmali
No ratings yet
Auto Regressive Integrated Moving Average - Material
Document21 pages
Auto Regressive Integrated Moving Average - Material
badaltel
No ratings yet
Operations Management, 10e: (Heizer/Render) Chapter 4 Forecasting
Document22 pages
Operations Management, 10e: (Heizer/Render) Chapter 4 Forecasting
Enas El-Amleh
No ratings yet
Ch-5 Decision Making
Document21 pages
Ch-5 Decision Making
temesgen yohannes
No ratings yet
Final Term Assessment FALL 2020: Student's Name Noor Nabi Shaikh Registration Number 1711125
Document8 pages
Final Term Assessment FALL 2020: Student's Name Noor Nabi Shaikh Registration Number 1711125
Noor Nabi Shaikh
No ratings yet
VGBSXSX PDF
Document1 page
VGBSXSX PDF
James Santos
No ratings yet