Professional Documents
Culture Documents
Look at the difference between the control and the experimental group
Large Difference – the IV manipulation has an effect
Small Difference – the difference is likely dt variability
Probability – how unlikely the difference has to be to conclude that the IV
had an effect on the DV. Ranges from 0.00 (impossible) to 1.00 (certain)
that the observed result is due to expected variability (p0.05)
Small probability = unlikely it occurred by chance
Measurement – The orderly assignment of a numerical value to a
characteristic
Properties of Scales
Rank order – ranking in increasing magnitude (rank them 1, 2, 3)
Categorization/Rating – putting people in categories (fun, boring),
or in rating groups (5 star scale)
Magnitude – you can assign data judging values as less than or
greater than or equal.
Equal Intervals – a unit of measurement on the scale is the same
regardless of where on the scale the unit falls
Abs Zero – a value where “nothing at all” of the attribute being
measured exists.
o Height or Kelvin
o Assigned 1,2,3 ranks doesn’t have AZ (no rank of zero used)
o
Types of Scales
Ratio Scale – Mag, EI, AZ
o Ratio statements can be made (a 70” person is dbl the height
of a 35” person)
Interval Scale – Mag, EI (NO AZ)
o Farenheit or Celcius – because at zero degrees, there is still
temperature. 0 deg is not abs zero. 25o is warmer than 12.5o,
but it isn’t twice as hot. But a 12 deg difference on the scale
is the same magnitude regardless of where on the scale youre
at.
Ordinal Scale – Magnitude (No EI or AZ) – rank in magnitude,
shortest to tallest, then can assign them numbers too, but the
difference bw the rankings (diff between 1 and 2, and 2 and 3) are
not equal.
Nominal Scale – Mutually exclusive groups (No Mag, EI, AZ)
o Classification of cars based on their brand name
Continuous Variable – infinite number of values bw any two points on it.
Discrete Variables – non infinite values (ex: basketball game points)
Real Limits – Upper and lower values that can be rounded off to the value
collected as data.
Example) You finish a race in 33s, and the timer measures to the
tenth position but you round to the nearest second..
o Lower real limit 32.5s (below this is 32s)
o Upper real limit 33.5s (above this is 34s)
Real Limits of a number are the points falling ½ measurement unit above
and below the number.
Depends on the units measured by a device, can it measure 1s
intervals(+/- 0.5s limits), or 0.1s intervals (+/- 0.05s limits)
Notation
Capital letters represent a variable X, Y, V
Subscripts distinguish one score from another
o Participant 1 = X1
o Participant 2 = X2
N – total # of scores. Subscripts^ run form 1 to N
Xi – any particular score in a distribution
Σ – summing of scores. Ex) Σ X = all Xi values summed
Cum f (cumulative freq distribution) – one in which the entry for any score
value or class interval is the sum of the frequencies for that value or that
interval plus the frequencies of all lower scores.
Cum Rel f (cumulative relative freq distribution) - one in which the entry for
any score value or class interval expresses that value’s or that interval’s
cumulative frequency as a proportion to the total number of cases.
f - how many scores in that class interval?
Rel f – what proportion of scores were in that class? (f/N)
Cum f – sum of scores in that class interval and the class intervals below?
Cum Rel f – what proportion of the Cum f scores are in that CI and the ones
below? (Cum f/N)
Don’t add Rel f to get Cum Rel f, because the dividing and rounding to get
the Rel f will make those values less accurate than adding up the individual
frequency values and dividing by N.
If 45% of scores are in a class interval and below, then the upper limit of
that class interval defines the 45th percentile.
Usually the lowest class interval starts with the lowest score… see below.
The 1st score values (lowest value) in the lowest class interval should be
evenly divisible by the size of the interval
Ex) If the lowest score is 30 and the intervals are 5 units large, 30/5 = 6
If the lowest score is 49, and the interval size is 4, drop down to the lowest
interval starting at 48-51, because 48 is evenly divisible by 4.
Stated Limits – In a class interval, the highest and lowest values in the
interval.
The size of a class interval is obtained by subtracting the lower real limit
from the upper real limit.
Ex) The Class interval size of 30-34 is 5 because 34.5-29.5 = 5
Frequency Histogram
Abscissa – horizontal axis
Ordinate – vertical axis (3/4 as long as abscissa)
Each axis should have a zero origin, the break in the abscissa scale
indicated that part of the scale was omitted.
Each bar width = 1 class interval and saddles the interval midpoint
FREQUENCY POLYGON
a point above each interval midpoint corresponds to the frequency
within that interval
Must have empty intervals with 0 frequences to the left and right
(dots on the abscissa)
Exploratory Data Analysis – an approach and a set of tools that are used,
often in an unplanned and exploratory manner, to describe and understand
the meaning of a set of data.
Stem & Leaf Display – combine histogram and frequency distribution into
one display, while preserving more of the information in the original data
than does a frequency distribution
A score value is broken down into a stem and a leaf
o Stem – the remaining larger digits
o Leaf – the smallest digit
o Ex) Score 23 Stem (2), Leaf (3)
Here, stems are tens digits and leafs are ones digits.
Batch – ordered listed of the stems and leafs of the scores (similar
to a distribution)
Extreme Scores – the lowest & highest scores in the batch, excluding
outliers
ESL is immediately above FL – 1.5(4th spread)
ESH is immediately below FU + 1.5(4th spread)
the 4th spread and the the fourths +/- 1.5(4th spread) are resistance
indicators of variability
Boxplot
Ordinate has no scale
Box stretches from FL to FU
Horizontal lines going to the extremes
X to label outliers
Median reflects central tendency
The box and lines reflect the variability
Asymmetry implies skewness
Short box is more peaked – leptokurtic
Broad box is flatter – platykurtic
1/10/2018 10:24:00 AM
Indices of Variability
Range
Variance
Standard deviation
The Sum of the Squared Deviations of Scores about their mean is not zero
Least Squares Sense – Taking deviations from the mean yields the
smallest number than if deviations from the scores were taken and squared
from any other number.
Median
Odd # Cases: Md score corresponds to case (N+1)/2
Even# Cases: Md score corresponds to the middle/avg of two cases
o (N/2 + N/2 + 1)/2
Mode – the most frequently occurring score value, NOT the frequency of the
most commonly occurring score value. Always the value at max peak height.
Bimodal – 2 modes
Multimodal – more than two modes
o
Properties of s2 and s
Logical to base variability on deviation from the mean
Always positive
s2 is very sensitive to extremes since it’s the squared deviation
Variance is proportional to the average squared deviation of each
score from every other score.
As variability increases, variance increases.
No variability gives s2 and s of zero
Under certain conditions, variance can be partitioned and its
portions attributed to different sources.
Population Formulas
μ = population average
σ2 = pop variance
σ = pop SD
Mean/Variance of the sample are used as estimators of the mean and
variance of the population.