Professional Documents
Culture Documents
Session 1 PDF
Session 1 PDF
• Categorical Data
• Numerical Data
• Probability Distribution
• Expected Value
• Standard Deviation
Data
• Hair color
• State of birth
• College major
• Political affiliation
2. Ordinal Data...
Outliers
A data value may be quite unlike the other data
values. In this case, it is permitted to remove it
(and a few others) from the data set if it helps to
show more clearly the pattern in the remaining
data.
ACCT
MKTG 22%
28%
FINC
20%
MGMT
15%
INFO
15%
Session 1 7
17
Session 1 9
1.5 IQR
Random Variable
A Random Variable is a variable that takes on
numerical values, each associated with a probability.
EXAMPLES:
1. Discrete
-- Possible to list all values.
2. Continuous
-- All values in a range are possible.
Session 1 11
Example:
Throw a die once.
x p(x)
1 1/6
2 1/6
3 1/6
4 1/6
5 1/6
6 1/6
0.18
0.16
0.14
0.12
Probability
0.10
0.08
0.06
0.04
0.02
0.00
1 2 3 4 5 6
Result
Session 1 12
Example:
From the coin-operated machine in front of the House
of Pancakes you tabulate the % of days that various
numbers of newspapers were sold:
x p(x)
Newspapers Sold Probability
0 0.1
1 0.1
2 0.1
3 0.2
4 0.2
5 0.1
6 0.1
7 0.1
0.20
Probability
0.15
0.10
0.05
0.00
0 1 2 3 4 5 6 7
Newspapers Sold
Session 1 13
Computing and
For Discrete Random Variables
= E (X )
= x p(x )
Standard Deviation:
2 = (x − ) p(x )
2
= x 2 p(x ) − 2
= x 2 p(x ) − 2
Session 1 14
Example:
a b
EXAMPLE:
If a person's height, weight, or IQ is measured to an
arbitrary number of decimal places then we get a
continuous random variable. Its density function
would be bell-shaped:
| | |
Session 1 17
Normal Distribution
= Mean
| | |
–1 0 1
Session 1 18
0.8
0.7
0.6
0.5
f(x)
0.4
0.3
0.2
0.1
0
0.00
0.30
0.60
0.90
1.20
1.50
1.80
2.10
2.40
2.70
3.00
-3.00
-2.70
-2.40
-2.10
-1.80
-1.50
-1.20
-0.90
-0.60
-0.30
NORMDIST(X,,,TRUE)
Returns cumulative probability under the normal
curve up to point X.
NORMINV(Probability,,)
Returns X for a given probability under normal curve.
NORMSDIST(Z)
NORMSINV (Probability)
Returns Z for a given cumulative probability
under the normal curve.
Session 1 21
| | |
| | |
| | |
| | |
Session 1 22
| | |
P(−1 Z 1) =
| | |
P(−2 Z 2) =
| | |
P(−3 Z 3) =
Session 1 23
| | |
z=
| | |
z=
| | |
z=
Session 1 24
Normal Transformation
A normal random variable with mean and standard
deviation can be transformed to a standard normal
random variable using:
Number - Mean
Standardized number =
Standard Deviation
or, symbolically:
X −
z=
Also:
X = + z
Session 1 25
Example:
I.Q.'s are normally distributed with = 100 and = 15 .
| | |
| | |
Session 1 26
Skewed
Let X1 and X 2 be any two independent random variables with means μ1 and μ2 and variances σ12 and σ22 .
Suppose Y = aX1 + bX 2 . Then,
• Suppose Y = aX1+bX2
2 2 2 2
– Var[Y] = a σ1 + b σ2
2 2
• Suppose X1~N(μ1, σ1 ) and X2~N(μ2, σ2 )
Summary of Session 1
Software Notes