Welcome to Scribd!

Skip carousel

EE769-12 Density Estimation

Uploaded by

fun world

0% found this document useful (0 votes)

2 views19 pages

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

2 views19 pages

EE769-12 Density Estimation

Uploaded by

fun world

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 19

Search inside document

EE769 Intro to ML

Density Estimation and Sampling

Amit Sethi
Faculty member, IIT Bombay
Learning objectives

• List applications of density estimation

• Estimate sufficient statistics of some distributions

• Test goodness of fit of distributions

• Write EM algorithm for mixture of Gaussians

• Fit a kernel density estimator on given samples

• List methods to generate samples from a distribution

2
Why estimate densities

• Perform basic statistical tests

• Identify outliers

• Use Bayesian methods

• Generate new samples

3
Parametric Density Estimation

• Visualize data

• Shortlist candidate distributions

• Estimate distribution parameters

• Check fit or likelihood

4
Likelihood maximization and sufficient
statistics
• A statistic is a descriptor of a distribution

5
Fitting a Gaussian distribution
• (2π σ2)-2 exp(− (x−μ)2 / 2σ2)
• E[X] = μ
• E[(X- μ)2] = σ2
• Multivariate: ( (2π)d |C|)-2 exp(− (x−μ)T C-1 (x−μ) / 2)

6
Fitting an exponential distribution
• λ e-λx , if x ≥ 0
• E[X] = λ-1

7
Fitting a uniform distribution
• (b-a)-1, if a ≤ x ≤ b

8
Comparing two distributions using likelihood
Q-Q plot
• Plot Actual data quantiles versus
• Theoretical quantiles
• (of any distribution)

Skbkekas, CC BY-SA 3.0 <https://creativecommons.org/licenses/by-sa/3.0>, via Wikimedia Commons

Mixture of Gaussians for multimodal density
• Assume K Gaussians
• Parameters Θ: { αk, μk, Ck }, k = 1…K

11
EM Algorithm for MoG
• Iterate:
• Expectation: Keeping Θ fixed, estimate membership
• wik = αk pk(xi) / ∑j αj pj(xi)
• Maximization: Keeping wik fixed, find MLE Θ. i.e.,
• αk = Nk / N, where Nk = ∑i wik
• μk = 1/Nk ∑i wik xi
• Ck = 1/Nk ∑i wik (xi − μk) (xi − μk)T

12
Non-parametric density estimation
• Kernel density estimation a.k.a. Parzen window

13
Finding optimal window size
• Minimize MSE in cumulative distribution

• Rules of thumb:
• h = 1.06 σ N-1/5
• h = 0.9 min(σ, IQR/1.34) N-1/5

14
Generating a sample from a distribution
• Why generate samples?
• Approximate expectations (Monte Carlo)
• Augment data
• If CDF is “simple”
• Generate a pseudo-random number
• Transform a uniform distribution to CDF
• X = F-1 (U), where F is the desired CDF, and U is a standard uniform RV

• Factorize multivariate distributions

• Sample from a proposal distribution and filter

15
Rejection sampling
• Sample from q(z)
• Accept with probability p(z) / k q(z)

16
Importance sampling
• Sample from q(z)
• Assign weight p(z) / q(z)

17
Markov-chain Monte Carlo
• Construct a Markov chain q(zA|zB)
• Iterate over t
• Generate q(z|z(t))
• Accept probability A = min( 1, p(z) q(z(t)|z) / p(z(t)) q(z|z(t)) )
• If accepted, z(t+1) = z
• Else, z(t+1) = z(t)

18
Gibbs sampling

• Require: Conditional distributions p(zi | z\i)

• Iterate over t
• Iterate over i
• Sample zi(t+1) from p(zi(t+1) | z\i(t))

Clustering 4
Document46 pages
Clustering 4
Snr Kofi Agyarko Ababio
No ratings yet
Clustering Gene Expression Data: CS 838 WWW - Cs.wisc - Edu/ Craven/cs838.html Mark Craven Craven@biostat - Wisc.edu April 2001
Document12 pages
Clustering Gene Expression Data: CS 838 WWW - Cs.wisc - Edu/ Craven/cs838.html Mark Craven Craven@biostat - Wisc.edu April 2001
Fadhili Dunga
No ratings yet
STA302 Week12 Full
Document30 pages
STA302 Week12 Full
tianyuan gu
No ratings yet
ch17 Monte Carlo Methods
Document36 pages
ch17 Monte Carlo Methods
黃良初
No ratings yet
Multivariate Analysis (Slides 8)
Document19 pages
Multivariate Analysis (Slides 8)
John Fogarty
No ratings yet
UNSUPERVISED LEARNING AND DIMENSIONALITY REDUCTION
Document58 pages
UNSUPERVISED LEARNING AND DIMENSIONALITY REDUCTION
SanaullahSunny
No ratings yet
Lecture 7: Sampling Based On Slides By: Probabilis6c Graphical Models
Document32 pages
Lecture 7: Sampling Based On Slides By: Probabilis6c Graphical Models
Rohit Pandey
No ratings yet
Hidden Markov Models: Modified From
Document32 pages
Hidden Markov Models: Modified From
Abdel-Ilah
No ratings yet
SoICT-Eng - ProbComp - Lec 4
Document32 pages
SoICT-Eng - ProbComp - Lec 4
Sope Coto
No ratings yet
07 Fenton Simulation
Document52 pages
07 Fenton Simulation
Andres Pino
100% (2)
Statistics Study Guide: Matthew Chesnes The London School of Economics September 22, 2001
Document22 pages
Statistics Study Guide: Matthew Chesnes The London School of Economics September 22, 2001
kjnero
No ratings yet
6.2 K Means
Document23 pages
6.2 K Means
Matrix Bot
No ratings yet
Input Data Analysis (3) Goodness-of-Fit Tests
Document28 pages
Input Data Analysis (3) Goodness-of-Fit Tests
mohanpakkurthi8397697
No ratings yet
Molecular Modeling: Conformational Molecular Field Analysis (Comfa)
Document40 pages
Molecular Modeling: Conformational Molecular Field Analysis (Comfa)
Quty Papa Kanna
No ratings yet
Scale Invariant Feature Transform (SIFT)
Document24 pages
Scale Invariant Feature Transform (SIFT)
Willy Ivan Satria
No ratings yet
Lecture 12&13
Document89 pages
Lecture 12&13
QUANG ANH B18DCCN034 PHẠM
No ratings yet
Normal Distribution
Document9 pages
Normal Distribution
Saurabh Tripathi
No ratings yet
M&S 05 Output Data Analysis
Document27 pages
M&S 05 Output Data Analysis
Felipe Vásquez Minaya
No ratings yet
Bayesian Networks: Construction, Inference, Learning and Causal Interpretation
Document58 pages
Bayesian Networks: Construction, Inference, Learning and Causal Interpretation
Aneez Shrestha
No ratings yet
Bioinformatics-Lesson 07 - Hidden Markov Model
Document28 pages
Bioinformatics-Lesson 07 - Hidden Markov Model
mahedi hasan
No ratings yet
MCMC - Markov Chain Monte Carlo: One of The Top Ten Algorithms of The 20th Century
Document31 pages
MCMC - Markov Chain Monte Carlo: One of The Top Ten Algorithms of The 20th Century
TigerHATS
100% (1)
Lect 5
Document43 pages
Lect 5
norain ismail sulieman
No ratings yet
Statistics 202C Study Guide: Part I: Sampling Basic Unstructured Distributions and Monte Carlo Basics
Document14 pages
Statistics 202C Study Guide: Part I: Sampling Basic Unstructured Distributions and Monte Carlo Basics
mr_joshuagordon
No ratings yet
Random Number Generation
Document42 pages
Random Number Generation
Nikhil Aggarwal
No ratings yet
Sparse Optimization Lecture: Basic Sparse Optimization Models
Document33 pages
Sparse Optimization Lecture: Basic Sparse Optimization Models
Catalin Toma
No ratings yet
ML Lecture06 2
Document63 pages
ML Lecture06 2
Marche Remi
No ratings yet
Biostats Gibbs
Document41 pages
Biostats Gibbs
justdls
No ratings yet
Chapter Three: Random Variant Generation Random Numbers
Document9 pages
Chapter Three: Random Variant Generation Random Numbers
haiminal
No ratings yet
Lecture8 2015
Document51 pages
Lecture8 2015
hu jack
No ratings yet
Tutorial: Gaussian Process Models For Machine Learning
Document35 pages
Tutorial: Gaussian Process Models For Machine Learning
Saundaraya Gupta
No ratings yet
Finite Mixture Modelling Model Specification, Estimation & Application
Document11 pages
Finite Mixture Modelling Model Specification, Estimation & Application
Pepe Garcia Estebez
No ratings yet
Seq Slides
Document43 pages
Seq Slides
myturtle game01
No ratings yet
Bayesian Modelling Tuts-12-15
Document4 pages
Bayesian Modelling Tuts-12-15
Shubhs
No ratings yet
L10-Naive Bayes Continuous
Document16 pages
L10-Naive Bayes Continuous
Fatima Sabir Masood Sabir Chaudhry
No ratings yet
Random Variable RB - Stat-3
Document70 pages
Random Variable RB - Stat-3
Pratik Das
No ratings yet
Markov Chain Models: BMI/CS 576 WWW - Biostat.wisc - Edu/bmi576/ Cdewey@biostat - Wisc.edu Fall 2010
Document36 pages
Markov Chain Models: BMI/CS 576 WWW - Biostat.wisc - Edu/bmi576/ Cdewey@biostat - Wisc.edu Fall 2010
Bin Zang
No ratings yet
PMBD 04 Clustering
Document59 pages
PMBD 04 Clustering
Diana Pernas
No ratings yet
Probability 1
Document26 pages
Probability 1
Jericho Nadanza
No ratings yet
CS-6777 Liu Abs
Document103 pages
CS-6777 Liu Abs
ILLA PAVAN KUMAR (PA2013003013042)
No ratings yet
Cs 224S / Linguist 281 Speech Recognition, Synthesis, and Dialogue
Document59 pages
Cs 224S / Linguist 281 Speech Recognition, Synthesis, and Dialogue
Burime Grajqevci
No ratings yet
Cryptography: Perfect Secrecy, Part 2
Document22 pages
Cryptography: Perfect Secrecy, Part 2
AnggieTasya
No ratings yet
Clustering Algorithms: I I M M M N S
Document16 pages
Clustering Algorithms: I I M M M N S
Marudi Tri Subakti
No ratings yet
Monte Carlo Sampling Methods
Document25 pages
Monte Carlo Sampling Methods
nyj martin
No ratings yet
Ch. 9 Scalar Quantization Uniform Quantizers and Adaptive Quantization Techniques
Document19 pages
Ch. 9 Scalar Quantization Uniform Quantizers and Adaptive Quantization Techniques
Er Aditya Singh
No ratings yet
Image Analysis and Markov Random Fields (MRFS)
Document22 pages
Image Analysis and Markov Random Fields (MRFS)
divar_ritzal
No ratings yet
Bayesian Estimation Example Using Pymc: Scipy 2010 Lightning Talk
Document12 pages
Bayesian Estimation Example Using Pymc: Scipy 2010 Lightning Talk
Armando
No ratings yet
Quantum Ai
Document65 pages
Quantum Ai
Vikram Babu mamidishetti
No ratings yet
w2 - Fundamentals of Learning
Document37 pages
w2 - Fundamentals of Learning
Swastik Sindhani
No ratings yet
Dynamic Programming: Department of CSE JNTUA College of Engg., Kalikiri
Document66 pages
Dynamic Programming: Department of CSE JNTUA College of Engg., Kalikiri
Maaz Chauhann
No ratings yet
Tensor Decomp Presentation
Document9 pages
Tensor Decomp Presentation
Mahbod (Matt) OLFAT
No ratings yet
16 dm2 Dimred 2022 23
Document49 pages
16 dm2 Dimred 2022 23
nimra
No ratings yet
CS 179: GPU Computing: Lecture 16: Simulations and Randomness
Document61 pages
CS 179: GPU Computing: Lecture 16: Simulations and Randomness
Rajul
No ratings yet
OCR S1 Revision Sheets
Document12 pages
OCR S1 Revision Sheets
Ganesh
No ratings yet
Particle Filter
Document64 pages
Particle Filter
Jose
No ratings yet
© 2013 by Greg Ozbirn, UT-Dallas, For Use With Data Structures Book by Mark Allen Weiss 1 Summer 2013
Document42 pages
© 2013 by Greg Ozbirn, UT-Dallas, For Use With Data Structures Book by Mark Allen Weiss 1 Summer 2013
Anna Hoang
No ratings yet
Gap Statistic
Document32 pages
Gap Statistic
Kikie Goguma Gyu
No ratings yet
Robust Video Stabilization Using Particle Filter Tracking
Document45 pages
Robust Video Stabilization Using Particle Filter Tracking
Owais Shah
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
Rating: 3.5 out of 5 stars
3.5/5 (8)
Foundations of Estimation Theory
From Everand
Foundations of Estimation Theory
L. Kubacek
No ratings yet
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet
9 RDT Part3 Jan 23 24
Document14 pages
9 RDT Part3 Jan 23 24
Pavan Bodke
No ratings yet
8_RDT_Part2_Jan_25
Document27 pages
8_RDT_Part2_Jan_25
fun world
No ratings yet
EE769-14 Learning Theory
Document26 pages
EE769-14 Learning Theory
fun world
No ratings yet
EE769 13 Density Estimation and Sampling
Document19 pages
EE769 13 Density Estimation and Sampling
fun world
No ratings yet
EE769 13 Density Estimation and Sampling
Document19 pages
EE769 13 Density Estimation and Sampling
fun world
No ratings yet
EE769 7 Introduction to Neural Networks
Document52 pages
EE769 7 Introduction to Neural Networks
fun world
No ratings yet
Packet Switching Jan 3 5 9
Document31 pages
Packet Switching Jan 3 5 9
Geethika Chamana
No ratings yet
Delay Packet Loss Jan 10 12
Document20 pages
Delay Packet Loss Jan 10 12
Geethika Chamana
No ratings yet
Throughput, Jitter and Message Segmentation: Gaurav S. Kasbekar Dept. of Electrical Engineering IIT Bombay
Document24 pages
Throughput, Jitter and Message Segmentation: Gaurav S. Kasbekar Dept. of Electrical Engineering IIT Bombay
Dev Pratap Singh
No ratings yet
Introduction To Power Amplifiers
Document22 pages
Introduction To Power Amplifiers
GaneshVenkatachalam
No ratings yet
Access Networks and Physical Media Overview
Document28 pages
Access Networks and Physical Media Overview
Geethika Chamana
No ratings yet
Introduction Jan 2 3
Document40 pages
Introduction Jan 2 3
Geethika Chamana
No ratings yet
TELE3013 2005s2 Quiz1
Document20 pages
TELE3013 2005s2 Quiz1
fun world
No ratings yet
Advanced Full Syllabus Test-6 Report Analysis
Document69 pages
Advanced Full Syllabus Test-6 Report Analysis
fun world
No ratings yet
Signal Energy and Power
Document6 pages
Signal Energy and Power
sandeep
No ratings yet
Transformations of Graphs
Document1 page
Transformations of Graphs
Sivaraman
No ratings yet
Communication PDF
Document10 pages
Communication PDF
fun world
No ratings yet
07 Project PERT
Document37 pages
07 Project PERT
Bokul Hossain
No ratings yet
Coca Cola Start
Document1 page
Coca Cola Start
xtremewhiz
No ratings yet
Correlation Coefficient & Determination Questions
Document3 pages
Correlation Coefficient & Determination Questions
KIRAN JAGTAP
No ratings yet
Seven Tools of Quality Control
Document18 pages
Seven Tools of Quality Control
rashmi123vaish
No ratings yet
Chapter 4
Document43 pages
Chapter 4
Komi David ABOTSITSE
No ratings yet
TOPIC-4 Descriptive Statistics
Document3 pages
TOPIC-4 Descriptive Statistics
Romina Gravamen
No ratings yet
Lower Bounds On Sample Size in Structural Equation Modeling (Electronic Commerce Research and Applications, Vol. 9, Issue 6) (2010)
Document12 pages
Lower Bounds On Sample Size in Structural Equation Modeling (Electronic Commerce Research and Applications, Vol. 9, Issue 6) (2010)
luisgalvan
No ratings yet
Research Methods and Statistics A Critical Thinking Approach 5th Edition
Document61 pages
Research Methods and Statistics A Critical Thinking Approach 5th Edition
georgia.walsh984
97% (38)
Parameter Estimation of The Weibull Distribution Comparison of The Least-Squares Method and The Maximum Likelihood Estimation
Document15 pages
Parameter Estimation of The Weibull Distribution Comparison of The Least-Squares Method and The Maximum Likelihood Estimation
IJAERS JOURNAL
No ratings yet
Assignment 3: Logistic Regression (Individual Submission)
Document3 pages
Assignment 3: Logistic Regression (Individual Submission)
Serin Silué
0% (1)
Statistics - Practice Sheet - Manzil Legends-JEE
Document3 pages
Statistics - Practice Sheet - Manzil Legends-JEE
Biden bhai
No ratings yet
517 (Sims, Princeton) PDF
Document6 pages
517 (Sims, Princeton) PDF
Invest
No ratings yet
Model Question of Statistics For BBS Students
Document5 pages
Model Question of Statistics For BBS Students
Puskar Bist
No ratings yet
Regresion Lineal Simple PDF
Document16 pages
Regresion Lineal Simple PDF
Cristian Diaz
100% (1)
MACHINE LEARNING LAPTOP PRICE PREDICTION PROJECT USING LINEAR REGRESSION
Document17 pages
MACHINE LEARNING LAPTOP PRICE PREDICTION PROJECT USING LINEAR REGRESSION
Ssvg Sumanth
No ratings yet
Estimating A Demand Function - It's About Time
Document13 pages
Estimating A Demand Function - It's About Time
Sebastián Rojas
No ratings yet
Slide Chap7
Document9 pages
Slide Chap7
Ton Khanh Linh
No ratings yet
CAPE Applied Mathematics 2016 U1 P2
Document28 pages
CAPE Applied Mathematics 2016 U1 P2
Idris Segulam
No ratings yet
Prediction of Road Accidents in Qatar by 2022
Document7 pages
Prediction of Road Accidents in Qatar by 2022
Muhammad Arham
No ratings yet
Audit Fee
Document27 pages
Audit Fee
Laksmi Mahendrati Dwiharja
No ratings yet
One Way ANOVA Post Hoc Tests in Excel
Document3 pages
One Way ANOVA Post Hoc Tests in Excel
SamuNavarroD
No ratings yet
Exploratory Factor Analysis
Document61 pages
Exploratory Factor Analysis
Izzul Syahmi
No ratings yet
AP Statistics Practice Exam - Answers
Document3 pages
AP Statistics Practice Exam - Answers
ldlewis
No ratings yet
Telecom Customer Churn Prediction Assessment-Pratik Zanke
Document19 pages
Telecom Customer Churn Prediction Assessment-Pratik Zanke
pratik zanke
No ratings yet
ML - LAB - FILE Pankaj
Document13 pages
ML - LAB - FILE Pankaj
khatmalmain
No ratings yet
Hypothesis Testing PDF
Document52 pages
Hypothesis Testing PDF
Dhansu Tunnu
No ratings yet
Ahli Bab 2 Work Life Balance
Document12 pages
Ahli Bab 2 Work Life Balance
Fany Meyliana
No ratings yet
Children Height As Predicted by Their Mid-Parental Heights
Document6 pages
Children Height As Predicted by Their Mid-Parental Heights
Anne Nicole Cruz
No ratings yet
SCHOOL OF PURE AND APPLIED SCIENCES PROBABILITY & STATISTICS II
Document8 pages
SCHOOL OF PURE AND APPLIED SCIENCES PROBABILITY & STATISTICS II
cyrus
No ratings yet