Probability Density Functions PDF

Probability density Function. this guide provided understanding in understanding how to obtain probability density function (pdf) and normalized function as well

Uploaded by

Sampson Valiant Onis

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

238 views3 pages

Probability Density Functions PDF

Probability density Function. this guide provided understanding in understanding how to obtain probability density function (pdf) and normalized function as well

Uploaded by

Sampson Valiant Onis

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Probability Density Functions, Page 1

Probability Density Functions

Author: John M. Cimbala, Penn State University Latest revision: 20 January 2010

Probability Density Functions Probability density function In simple terms, a probability density function (PDF) is constructed by drawing a smooth curve fit through the 0.03 vertically normalized histogram as f(x) sketched. You can think of a PDF as the 0.02 smooth limit of a vertically normalized histogram if there were millions of 0.01 measurements and a huge number of bins. o The main difference between a 0 x histogram and a PDF is that a x1 x2 x3 ... histogram involves discrete data (individual bins or classes), whereas a PDF involves continuous data (a smooth curve). dx dx P xi < x xi + dx dx 2 2 < x xi + o Mathematically, f(x) is defined as f ( xi ) = , where P xi dx 2 2 represents the probability that variable x lies in the given range, and f(x) is the probability density function (PDF). In other words, for the given infinitesimal range of width dx dx dx P xi < x xi + between xi dx/2 and xi + dx/2, the 0.03 2 2 integral under the PDF curve is the f(x) probability that a measurement lies 0.02 within that range, as sketched. dx o As shown in the sketch, this probability 0.01 is equal to the area (shaded blue region) under the f(x) curve i.e., the integral 0 x xi dx/2 xi xi + dx/2 under the PDF over the specified infinitesimal range of width dx. o The usefulness of the PDF is as follows: Suppose we choose a range of variable x, say between a and b. The probability that a measurement lies 0.03 between a and b is simply the integral f(x) under the PDF curve between a and b, 0.02 as sketched, where we define the probability as 0.01
P (a < x b) =
x =b x=a

f ( x ) dx

P(a < x b) b

0
x =

If a and b +, the probability must equal 1 (100%), i.e., P ( < x < ) =

x =

f ( x ) dx = 1 .

In other words, the probability that x lies between and + is 100% (a fact that should be obvious, since there are no other possibilities for real number x). Once we have defined the probability density function f(x), we leave the system of discrete random variables and enter the system of continuous random variables, on which we make some more formal definitions: Expected value is defined in terms of the probability density function as the mean of all possible x
values in the continuous system. Namely, expected value = = E ( x ) = xf ( x ) dx . In an ideal

situation in which f(x) exactly represents the population, is the mean of the entire population of x values, and that is why it is called the expected value. It is therefore also called the population mean. In general, x , but x when n is large, i.e., the sample mean approaches the

expected value when n is large. x and are often used interchangeably, but this should be done only if n is large. Standard deviation is defined in terms of the PDF as

Probability Density Functions, Page 2

standard deviation = =

( x ) f ( x ) dx . In an ideal situation in which f(x) exactly represents

the population, is the standard deviation of the entire population. It is therefore also called the population standard deviation. If n is large, S . Often, S and are used interchangeably, but this should be done only if n is large.

Normalized probability density function a normalized probability density function is constructed by transforming both the abscissa (horizontal axis) and ordinate (vertical axis) of the PDF plot as follows: x z= and f ( z ) = f ( x ) .

o o

The above transformations accomplish two things: The first transformation normalizes the abscissa such that the PDF is centered around z = 0. The second transformation normalizes the ordinate such that the PDF is spread out in similar fashion regardless of the value of standard deviation. When normalized in this way, the normalized PDF can be directly compared to standard PDFs, which we discuss in a later learning module. To summarize, here are several steps used in Excel to generate a normalized PDF of experimental data: 1. Generate the histogram with Excel as discussed in the histogram learning module. Excel generates a table called a frequency table. The table contains two columns, bin and frequency. Bin is the maximum value of the range of each bin, and frequency is the number of data points in that bin range. (For example, suppose there are 200 data points total, the mean value of x is 10.0, and the standard deviation of the data set is 3.0. Also suppose that 8 of those data points lie in the bin with x between 4 and 6 (4 < x 6). Thus, for this bin, Bin = 6 and Frequency = 8.) 2. Create a new column called probability in which you divide each frequency by the total number of data points. This gives the probability that a data point lies in that bin, i.e. probability = frequency / n . (In the example here, probability = 8/200 = 0.040 or 4.0%.) 3. Create a new column called xmid in which you list the mid value of each bin: xmid = ( xmin + xmax ) / 2 . (In the example here, the mid value of the sample bin is (4 + 6)/2 = 5.0.) 4. Create a new column called f(x) in which you divide each probability by the appropriate bin width, i.e., f ( x ) = probability / x . (In the example here, the bin width of the sample bin is x = 6 4 = 2, and f(x) = 0.04/2 = 0.02 at x = xmid = 5.0.) A smoothed plot of f(x) versus x is the PDF. 5. Create a new column called z in which you normalize the x values into nondimensional z values. This is accomplished by converting each mid value of x into z: z = ( x ) / . (In the example here, z for the sample bin is z = (5.0 10.0)/3.0 = 1.667.) 6. Create a new column called f(z) in which you normalize the PDF into the f(z) values. This is accomplished by converting each f(x) into f(z): f ( z ) = f ( x ) . (In the example here, f(z) of the sample bin is f(z) = 0.02*3.0 = 0.060 at z = 1.667.) 7. Finally, a plot of f(z) vs. z can be generated. A smooth curve through these data represents the normalized PDF.

Example: Given: The same 1000 temperature measurements used in a previous example for generating a histogram. The data are provided in an Excel spreadsheet (Temperature_data_analysis.xls) on the website. To do: Generate a PDF of these data. Normalize the PDF. Solution: o In a previous example (see the Histogram learning module), we generated a histogram of the temperature data. We begin with the bin and frequency data generated in Excel.

Probability Density Functions, Page 3

To generate the PDF, we follow the step-by-step instructions provided above. This will be shown in class in Excel. The vertically normalized PDF is shown below (left side).

Transform

Finally, we transform to normalized variables the fully normalized PDF is shown above (right side). Notice that the shape is the same, but the variable transformation to f(z) is nondimensional, making it more useful for comparison with other probability density distributions. The final PDF should be continuous, not discrete. Because of scatter, it is difficult to get Excel to draw a smooth curve through these data. For lack of a better method at this point, we sketch the smooth curve by eye below:

Discussion: o The peak in the vertically normalized PDF occurs at x 31, which is very close to the sample mean. This peak transforms to z 0 in the fully normalized PDF; this is a useful feature of the normalization. o We can estimate the area under the f(x) curve by eye by counting squares the area is indeed approximately 1.0 or 100%, as it must be. o We can also estimate the area under the f(z) curve by eye it is approximately 1.0 or 100%, as it also must be. There are several standard PDFs discussed in statistics literature. Of these, the normal PDF, is the most common, and will be discussed next. We will also compare the above results with the normal PDF.

Understanding Normal Distribution Basics
No ratings yet
Understanding Normal Distribution Basics
5 pages
M131-Lecture Notes No. 4
No ratings yet
M131-Lecture Notes No. 4
58 pages
Understanding Probability Densities in Statistics
No ratings yet
Understanding Probability Densities in Statistics
17 pages
Raghunath Chatterjee - Normal Distribution - Lecture
No ratings yet
Raghunath Chatterjee - Normal Distribution - Lecture
39 pages
Lesson 7.1 Introduction To The Normal Distribution
No ratings yet
Lesson 7.1 Introduction To The Normal Distribution
9 pages
Probability Distributions Explained
No ratings yet
Probability Distributions Explained
46 pages
Histogram and PDF Analysis in ME 288
No ratings yet
Histogram and PDF Analysis in ME 288
3 pages
Random Variables I I Filled in 2
No ratings yet
Random Variables I I Filled in 2
8 pages
Continuous Probability Distributions Overview
No ratings yet
Continuous Probability Distributions Overview
70 pages
Continuous Random Variables in Statistics
No ratings yet
Continuous Random Variables in Statistics
35 pages
Continuous Probability Distributions Explained
No ratings yet
Continuous Probability Distributions Explained
78 pages
Probability Measuring Uncertainty
No ratings yet
Probability Measuring Uncertainty
60 pages
Lecture 8
No ratings yet
Lecture 8
15 pages
Probability: Expectation, Variance, CLT
No ratings yet
Probability: Expectation, Variance, CLT
24 pages
Continuous Probability Distributions in Engineering
No ratings yet
Continuous Probability Distributions in Engineering
45 pages
Normal Distribution Applications
0% (1)
Normal Distribution Applications
5 pages
Unit - 3
No ratings yet
Unit - 3
19 pages
Descriptive Statistics and Probability Distributions: Session 1
No ratings yet
Descriptive Statistics and Probability Distributions: Session 1
34 pages
Continuous Probability Distributions
100% (1)
Continuous Probability Distributions
48 pages
Continuous Probability Distributions Overview
No ratings yet
Continuous Probability Distributions Overview
53 pages
Introduction to Continuous Probability Distributions
No ratings yet
Introduction to Continuous Probability Distributions
48 pages
Understanding Normal Distribution Basics
No ratings yet
Understanding Normal Distribution Basics
69 pages
Understanding Normal Probability Law
No ratings yet
Understanding Normal Probability Law
16 pages
Managerial Statistics Overview for OPAN 6500
No ratings yet
Managerial Statistics Overview for OPAN 6500
41 pages
Gamma Distribution and Transistor Lifetimes
No ratings yet
Gamma Distribution and Transistor Lifetimes
54 pages
Lecture - 07 - TP
No ratings yet
Lecture - 07 - TP
63 pages
Transforming Continuous Random Variables
No ratings yet
Transforming Continuous Random Variables
3 pages
Normal Distribution and Probability Concepts
No ratings yet
Normal Distribution and Probability Concepts
10 pages
Continuous Random Variables & Distributions
No ratings yet
Continuous Random Variables & Distributions
7 pages
Continuous Probability Distributions Explained
No ratings yet
Continuous Probability Distributions Explained
17 pages
Importance of Normal Distributions in Statistics
No ratings yet
Importance of Normal Distributions in Statistics
148 pages
Basic Probability Distributions Overview
No ratings yet
Basic Probability Distributions Overview
33 pages
Regression and Distribution Functions Guide
No ratings yet
Regression and Distribution Functions Guide
1 page
Discrete Probability Distributions Explained
No ratings yet
Discrete Probability Distributions Explained
53 pages
Continuous Probability Distribution PDF
No ratings yet
Continuous Probability Distribution PDF
47 pages
Lesson 4
No ratings yet
Lesson 4
51 pages
STA 211 Lecture 1
No ratings yet
STA 211 Lecture 1
18 pages
2 Other PDFs
No ratings yet
2 Other PDFs
51 pages
Continuous Probability Distribution-1
No ratings yet
Continuous Probability Distribution-1
19 pages
Normal, Binomial, Poisson, and Exponential Distributions
No ratings yet
Normal, Binomial, Poisson, and Exponential Distributions
39 pages
Department of Mathematics: Faculty of Basic Sciences Probability and Statistics Exercises
No ratings yet
Department of Mathematics: Faculty of Basic Sciences Probability and Statistics Exercises
8 pages
Understanding Normal Distribution Basics
No ratings yet
Understanding Normal Distribution Basics
32 pages
Lecture
No ratings yet
Lecture
6 pages
Lecture 6
No ratings yet
Lecture 6
57 pages
Normal Distribution
No ratings yet
Normal Distribution
1 page
5 ContinuousDiscributions
No ratings yet
5 ContinuousDiscributions
34 pages
Lecture 7
No ratings yet
Lecture 7
41 pages
Understanding Continuous Random Variables
No ratings yet
Understanding Continuous Random Variables
44 pages
Ch.3 Normal Distribution
No ratings yet
Ch.3 Normal Distribution
1 page
Overview of Continuous Distributions
No ratings yet
Overview of Continuous Distributions
17 pages
Normal Distribution Explained: Key Concepts
No ratings yet
Normal Distribution Explained: Key Concepts
73 pages
R Session: Probability Distributions
No ratings yet
R Session: Probability Distributions
6 pages
Understanding Normal Distribution
No ratings yet
Understanding Normal Distribution
17 pages
Probability Distributions
No ratings yet
Probability Distributions
18 pages
Normal Distribution
No ratings yet
Normal Distribution
51 pages
Algebra Problems for Economics and Management
90% (747)
Algebra Problems for Economics and Management
719 pages
Chapter 15
50% (2)
Chapter 15
70 pages
Biomass Power: Sugarcane Bagasse
100% (3)
Biomass Power: Sugarcane Bagasse
217 pages
A Process Model To Estimate Biodiesel Production Costs
100% (1)
A Process Model To Estimate Biodiesel Production Costs
8 pages

Probability Density Functions PDF

Uploaded by

Probability Density Functions PDF

Uploaded by

Probability Density Functions, Page 1

Probability Density Functions

If a and b +, the probability must equal 1 (100%), i.e., P ( < x < ) =

Probability Density Functions, Page 2

( x ) f ( x ) dx . In an ideal situation in which f(x) exactly represents

Probability Density Functions, Page 3

You might also like