Professional Documents
Culture Documents
P(A|B)= P(A) or
P(B|A) = P(B)
P ( A B ) P( A) P ( B ) P ( A B )
If a distribution is skewed then it has extreme values
Put your answer in context
Median/ Interquartile range better than mean/
standard deviation because they are not affected by Positive correlation (not enough)
outliers. However the mean takes into account all
values! Positive linear correlation (better)
Normal distribution not suitable for modeling skewed Strong positive linear correlation; as
distributions as it is symmetrical. temperature increased so did number of ice
creams sold (best)
Mean =
When talking about skew in a box plot look only at
the box!! x or xf
n f
Standard deviation is
A statistical experiment is a
x 2
mean 2 or
x f
2
mean 2
n f process used for collecting
data to provide evidence for or
LEARN THIS
against a hypothesis
Mean of squares square of mean
F(0.6) = P(X<0.6)
A MEASURE OF SKEWNESS IS
x axis years employed by company The response variable is the variable which
y axis salary occurs as a result of the behaviour of the
explanatory variable.
Then gradient would be salary increase per year
y intercept would be starting salary
x -1 0 1 2 3
p 0.3 0.2 0.2 0.2 0.1
Only use a regression line is the PMCC shows
strong correlation , as this shows the data can be Mean=E(X)=
modeled by a straight line. xp (1 0.3) (0 0.2) (1 0.2) (2 0.2) (3 0
= -0.3+0.2+0.4+0.3=0.6
> +0.7 Variance = E(X2) (E(X))2= x 2
p mean 2
< -0.7 =
( 1) 2 0.3 (0 2 0.2) (12 0.2) (2 2 0.2) (3 2 0.1)
= 0.3+0.2+0.8+0.9-0.36=1.84
Some features of a box plot are
To work out E(X2) just do the first bit of the
variance formula 1. Allows comparisons
2. Shows outliers
3. Indicates spread/range/IQR
x 2
p
4. Shows max/min/median/quartiles
5. Shows skewness
Dont round until the end; never use a rounded When using formulae, show your substitutions
value in calculations carefully
This will lose you marks THIS WILL GET YOU MARKS FOR WORKING