Lab4 Density

Uploaded by

Choker Joy

0% found this document useful (0 votes)

8 views6 pages

Original Title

lab4.density

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

8 views6 pages

Lab4 Density

Uploaded by

Choker Joy

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 6

Search inside document

Lab 4 Density estimation and Box-cox

transformation
Lian Heng

Histogram and Kernel density estima-

tion
Histogram can be regarded as a crude density estimator (hist(x,20)).
Mathematically, histogram can be defined as the function
n
X
fb(y) = I{Yi ∈ Bin(y)}.
i=1

To use it as a density estimator, we should use the scaled version

(hist(x,20,freq=F))
n
1 X
f (y) =
b I{Yi ∈ Bin(y)}
nb i=1
R
(it can be shown for the scaled version, fb(y)dy = 1).

1
Figure 1: Illustration of histrogram.

2
A much better estimator is the kernel density estimator (KDE).
The estimator takes its name from the so-called kernel function, de-
noted here by K, which is a probability density function that is
symmetric about 0. The standard normal density function is a com-
mon choice for K and will be used here. The kernel density estimator
based on Y1, . . . , Yn is
n
1 X y − Yi
fb(y) = K ,
nb i=1 b

where b, which is called the bandwidth, determines the resolution of

the estimator.
My code illustrates the similarity of histogram and KDE, and
the effect of b. My code also shows how to compare the KDE (non-
parametric in nature) with some parametric density estimators (for
example fitting the data using a normal or t density).

Box-Cox transformation
The logarithm transformation is probably the most widely used trans-
formation in data analysis, followed by squared-root transformation.
They are special cases of Box-Cox transformation
yα −1
y (α) = α , α ̸= 0
log(y), α = 0

3
Figure 2: Illustration of kernel density estimator.

4
Box-Cox transformation is often used to transform data being
right-skewed (using α < 1) or being left-skewed (using α > 1) to a
roughly symmetric distribution. An explanation of why α < 1 (in
particular log transformation) can deal with right-skewed data is the
following picture:

Figure 3: Illustration of logarithm transformation.

My code illustrates the effect of transformation.

Task
Using the CPSch3 data (average hourly earnings data from the Cur-
rent Population Survey) in the Ecdat package, we look at earning
for males. Using QQ-plot, Box-plot, KDE, all methods suggests a
square-root transformation is reasonably good in transforming the
data to a normal distribution. Use boxcox() function (in the MASS

5
package) to find the best value of α, which is used to transform
the data (simply use y α instead of (y α − 1)/α). After transforma-
tion, plot the KDE, fitted normal density and fitted t distribution
with df = 5. Visually, does normal or t provides a better fit to the
transformed data? Fill in the missing part of the code and
upload the plot showing the three densities as .jpg file
on canvas (do not submit the picture for the result of
boxcox()). In comment box of canvas, paste the two-line
code (one line for using boxcox() and another line for do-
ing transformation y=...) and state whether you think
the normal distribution of t distribution is a better fit
to the transformed data.

Preserving and Randomizing Data Responses in Web Application Using Differential Privacy
Document9 pages
Preserving and Randomizing Data Responses in Web Application Using Differential Privacy
International Journal of Innovative Science and Research Technology
100% (1)
DS-003-2-En - Exertherm - IR06EMSC Sensor - Screen
Document2 pages
DS-003-2-En - Exertherm - IR06EMSC Sensor - Screen
Mohammad Asif
No ratings yet
KDW1.1 100 300 W1 220 Mainspindle Drive Indramat Manual
Document146 pages
KDW1.1 100 300 W1 220 Mainspindle Drive Indramat Manual
Sven Tack
No ratings yet
Oracle Process Manufacturing Master Setups
Document42 pages
Oracle Process Manufacturing Master Setups
Madhuri Uppala
100% (2)
Matlab Homework Experts 2
Document10 pages
Matlab Homework Experts 2
Franklin Deo
No ratings yet
Lec3-The Kernel Trick
Document4 pages
Lec3-The Kernel Trick
Shankaranarayanan Gopal
No ratings yet
Kernel Ridge Regression
Document8 pages
Kernel Ridge Regression
matin ashrafi
No ratings yet
Adaptive Mean Shift-Based Clustering
Document11 pages
Adaptive Mean Shift-Based Clustering
Ashish Bhardwaj
No ratings yet
Cs229 Notes Deep Learning
Document21 pages
Cs229 Notes Deep Learning
Chirag Pramod
No ratings yet
Information Theory and Machine Learning
Document21 pages
Information Theory and Machine Learning
hoai_thu_15
No ratings yet
Non Parametric Density Estimation
Document4 pages
Non Parametric Density Estimation
zeze1
No ratings yet
Cs 229, Autumn 2016 Problem Set #2: Naive Bayes, SVMS, and Theory
Document20 pages
Cs 229, Autumn 2016 Problem Set #2: Naive Bayes, SVMS, and Theory
Zeeshan Ali Sayyed
No ratings yet
Bayesian Optimization by Density Ratio Estimation: BO BO
Document8 pages
Bayesian Optimization by Density Ratio Estimation: BO BO
Nolan Gutierrez
No ratings yet
Programming Test: Learning Activations in Neural Networks: Monk AI
Document2 pages
Programming Test: Learning Activations in Neural Networks: Monk AI
NEHA SHIVANI
No ratings yet
Home Exercise 3: Dynamic Programming and Randomized Algorithms
Document5 pages
Home Exercise 3: Dynamic Programming and Randomized Algorithms
Jaokd
No ratings yet
Mathematics of Deep Learning: Lecture 1-Introduction and The Universality of Depth 1 Nets
Document12 pages
Mathematics of Deep Learning: Lecture 1-Introduction and The Universality of Depth 1 Nets
flowh_
No ratings yet
Machine Learning Course - Kernel Regression
Document9 pages
Machine Learning Course - Kernel Regression
nagybaly
No ratings yet
' - Magic: Recovery of Sparse Signals Via Convex Programming
Document19 pages
' - Magic: Recovery of Sparse Signals Via Convex Programming
Lan Vũ
No ratings yet
Point Operations and Spatial Filtering
Document22 pages
Point Operations and Spatial Filtering
Anonymous Tph9x741
No ratings yet
Convolution: 1-D and 2-D
Document36 pages
Convolution: 1-D and 2-D
Raman Bansal
No ratings yet
Assign 1
Document5 pages
Assign 1
darkmanhi
No ratings yet
Ass 1
Document3 pages
Ass 1
Vibhanshu Lodhi
No ratings yet
Vmls Additional Exercises
Document66 pages
Vmls Additional Exercises
marcosilvasegovia
No ratings yet
Experiment 6
Document5 pages
Experiment 6
Hassan Raja khan
No ratings yet
Edgar Osuna Robert Freund Federico Girosi Center For Biological and Computational Learning and Operations Research Center Massachusetts Institute of Technology Cambridge, MA, 02139, U.S.A
Document8 pages
Edgar Osuna Robert Freund Federico Girosi Center For Biological and Computational Learning and Operations Research Center Massachusetts Institute of Technology Cambridge, MA, 02139, U.S.A
RanaBilalShahid
No ratings yet
Auto-Encoder Based Data Clustering: Abstract. Linear or Non-Linear Data Transformations Are Widely Used
Document8 pages
Auto-Encoder Based Data Clustering: Abstract. Linear or Non-Linear Data Transformations Are Widely Used
pelican2016
No ratings yet
HW 1
Document8 pages
HW 1
Ben
No ratings yet
ICNN Supplementary
Document5 pages
ICNN Supplementary
Gyana Ranjan Nayak
No ratings yet
Nadaraya-Watson Teoria PDF
Document9 pages
Nadaraya-Watson Teoria PDF
LUIS FABIAN URREGO SANCHEZ
No ratings yet
DOS - Report
Document25 pages
DOS - Report
Sagar Simha
No ratings yet
03 - Image Segmentation
Document45 pages
03 - Image Segmentation
عبد الحميد عمرو عبد الحميد فرغلى هلالى
No ratings yet
Kernal Methods Machine Learning
Document53 pages
Kernal Methods Machine Learning
palani
No ratings yet
Cs 229, Public Course Problem Set #2 Solutions: Kernels, SVMS, and Theory
Document8 pages
Cs 229, Public Course Problem Set #2 Solutions: Kernels, SVMS, and Theory
suhar adi
No ratings yet
MECOM093
Document17 pages
MECOM093
Jose Risso
No ratings yet
Chapter 3
Document36 pages
Chapter 3
Misbah Ahmad
No ratings yet
Density Estimation
Document17 pages
Density Estimation
Graciela Marques
No ratings yet
Lecture 3.2 Image Enhancement
Document76 pages
Lecture 3.2 Image Enhancement
dave
No ratings yet
Denoising Autoencoders tr1316
Document16 pages
Denoising Autoencoders tr1316
penets
No ratings yet
Smoothspline PDF
Document4 pages
Smoothspline PDF
Dhiva Ryan
No ratings yet
Oracle Bounds and Exact Algorithm For Dyadic Classification Trees
Document15 pages
Oracle Bounds and Exact Algorithm For Dyadic Classification Trees
defwe
No ratings yet
Clustering With Gradient Descent: 1 Performance
Document4 pages
Clustering With Gradient Descent: 1 Performance
Christine Straub
No ratings yet
ML Recap
Document96 pages
ML Recap
Amit Mithun
No ratings yet
Chapter 2 - Combinational Logic Circuits: Logic and Computer Design Fundamentals
Document54 pages
Chapter 2 - Combinational Logic Circuits: Logic and Computer Design Fundamentals
boymatter
No ratings yet
Fuzzy Mathematical Programming
Document8 pages
Fuzzy Mathematical Programming
Tatiana Oliveira
No ratings yet
NIPS 2000 The Kernel Trick For Distances Paper
Document7 pages
NIPS 2000 The Kernel Trick For Distances Paper
stiananfinsen
No ratings yet
Computer Vision MCQ's For Interview
Document12 pages
Computer Vision MCQ's For Interview
Mallikarjun patil
No ratings yet
SGM With Random Features
Document25 pages
SGM With Random Features
idan kahan
No ratings yet
CS 229, Public Course Problem Set #4: Unsupervised Learning and Re-Inforcement Learning
Document5 pages
CS 229, Public Course Problem Set #4: Unsupervised Learning and Re-Inforcement Learning
suhar adi
No ratings yet
1 Distances and Metric Spaces: 1.1 Finite Metrics and Graphs
Document10 pages
1 Distances and Metric Spaces: 1.1 Finite Metrics and Graphs
Alvaro Alveo
No ratings yet
Cryptacus 2018 Paper 4
Document4 pages
Cryptacus 2018 Paper 4
Mircea Petrescu
No ratings yet
Mock Question Samples
Document3 pages
Mock Question Samples
MUHAMMAD AHAD
No ratings yet
Class03 PDF
Document40 pages
Class03 PDF
johan
No ratings yet
Fourier Analysis and Sampling Theory: Reading
Document10 pages
Fourier Analysis and Sampling Theory: Reading
essi90
No ratings yet
W9a Autoencoders Pca
Document7 pages
W9a Autoencoders Pca
zeliawillscumberg
No ratings yet
Introduction To Kernels: Max Welling
Document16 pages
Introduction To Kernels: Max Welling
Kamesh Reddi
No ratings yet
Kernel Methods in Machine Learning
Document50 pages
Kernel Methods in Machine Learning
vennela gudimella
No ratings yet
HELM Workbook 14 Applications of Integration 1
Document34 pages
HELM Workbook 14 Applications of Integration 1
Anonymous QI9xEjrbpl
No ratings yet
Homework 2: Mathematics For AI: AIT2005
Document3 pages
Homework 2: Mathematics For AI: AIT2005
Anh Hoang
No ratings yet
14 1 Integrtn As Limit of Sum
Document9 pages
14 1 Integrtn As Limit of Sum
tarek moahmoud khalifa
No ratings yet
HW 5
Document5 pages
HW 5
Johnathan Tucker
No ratings yet
Homework 3: SVM and Sentiment Analysis: Minted Listings
Document7 pages
Homework 3: SVM and Sentiment Analysis: Minted Listings
Mayur Agrawal
No ratings yet
Part of DL
Document24 pages
Part of DL
Seifedine El Mokni
No ratings yet
Lecture 2: Multiparty Number-On-The-Forehead Complexity: 1 Basic Definition
Document10 pages
Lecture 2: Multiparty Number-On-The-Forehead Complexity: 1 Basic Definition
Pedro Villalba
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
GTB-BOL Vetrificado
Document19 pages
GTB-BOL Vetrificado
eldueno
No ratings yet
Cj2m-Cpu, - md21 Cpu Units, Pulse I o Modules Datasheet en PDF
Document29 pages
Cj2m-Cpu, - md21 Cpu Units, Pulse I o Modules Datasheet en PDF
Khairy Yaakob
No ratings yet
MMM - Unit 1
Document133 pages
MMM - Unit 1
Ahmed Raza Mokashi
No ratings yet
Hofman Notes
Document114 pages
Hofman Notes
Noelia Pizzi
No ratings yet
Es3Pt: User Manual
Document53 pages
Es3Pt: User Manual
BITGEORGY
No ratings yet
SIF Corporate-Presentatie 2017
Document35 pages
SIF Corporate-Presentatie 2017
66apenlullen
No ratings yet
About The MS Regression Models
Document17 pages
About The MS Regression Models
Lars Larson
No ratings yet
Sesam and Bladed - Efficient Coupled Analyses - Webinar Presentation - tcm8-102589 PDF
Document31 pages
Sesam and Bladed - Efficient Coupled Analyses - Webinar Presentation - tcm8-102589 PDF
samiransmita
No ratings yet
Transformation To An Agile and Virtualized World: Operations Center of The Future
Document1 page
Transformation To An Agile and Virtualized World: Operations Center of The Future
pinardo
No ratings yet
Cantors Algebra of Sets
Document29 pages
Cantors Algebra of Sets
Kevs Sebastian
No ratings yet
07a80809 Operationsresearch
Document11 pages
07a80809 Operationsresearch
Sharanya Thirichinapalli
No ratings yet
IEEE and IEC Standards
Document11 pages
IEEE and IEC Standards
chupzpt
No ratings yet
Laboratory Requirements For ISO/IEC 17025: Accreditation of Radon Indoor Measurements Based On CR-39 Nuclear Track Detectors
Document17 pages
Laboratory Requirements For ISO/IEC 17025: Accreditation of Radon Indoor Measurements Based On CR-39 Nuclear Track Detectors
Chong Cong
No ratings yet
Bohmian Mechanics Versus Madelung Quantum Hydrodynamics
Document8 pages
Bohmian Mechanics Versus Madelung Quantum Hydrodynamics
regect
No ratings yet
Ashtakvarga KAS System
Document4 pages
Ashtakvarga KAS System
dakudkm
0% (1)
AN17825A
Document8 pages
AN17825A
Jose M Peres
No ratings yet
April 2024 - PSAD 2
Document2 pages
April 2024 - PSAD 2
rando12345
No ratings yet
Kluang (A) S2 STPM 2019
Document9 pages
Kluang (A) S2 STPM 2019
Rex Kal
No ratings yet
Whats A Job in Linux: $ Sleep 100 & (1) 1302 $
Document2 pages
Whats A Job in Linux: $ Sleep 100 & (1) 1302 $
Viraj Bhosale
No ratings yet
2-Way Doherty Amplifier With BLF888A
Document27 pages
2-Way Doherty Amplifier With BLF888A
erdemsecen
No ratings yet
Writing Rectangles
Document6 pages
Writing Rectangles
Vanessa Lincoln
No ratings yet
Heavy Welding Shop
Document6 pages
Heavy Welding Shop
Saurabh Katiyar
100% (1)
1st Mid-Term Test (Add Maths f4)
Document5 pages
1st Mid-Term Test (Add Maths f4)
Jarnice Ling Yee Ching
No ratings yet
Bearing Reliability Improvement-SKF
Document55 pages
Bearing Reliability Improvement-SKF
Abdulrahman Alkhowaiter
No ratings yet
Dewatering Screens: Single-Deck Twin Vibrator
Document8 pages
Dewatering Screens: Single-Deck Twin Vibrator
ekrem0867
No ratings yet
1001451317230
Document12 pages
1001451317230
JulioEdgarHanccoZea
No ratings yet