Professional Documents
Culture Documents
A. Bar chart – composed of discrete bars o Defined as the midpoint of the list (In
that represent different categories of distribution of scores: smallest to largest)
data. It is used to compare values o It divides the scores, thus 50% of scores
across categories. in the distribution have values that are
B. Pie chart – circular chart used to equal to or less than the median.
compare parts of the whole. o Requires scores that can be placed in
C. Line chart – displays the relationship rank order (small - large) and are
between two types of information. measured on an ordinal, interval or ratio
D. Histogram – has connected bars that scale.
displays the frequency or proportion of o Median can be identified by a simple
cases that fall within defined interval counting procedure.
columns. o It is relatively unaffected by extreme
E. Cumulative frequency – allows us to find scores; hence it tends to stay in the
the median, lower and upper quartile ‘center’ of the distribution.
range. o It serves as an alternative to the mean.
F. Box Plots – displays data for comparison.
Five needed information:
1. Minimum value 3. Mode
2. Lower quartile (Q1)
3. Median (Q2) o Most frequently occurring category or
4. Upper quartile (Q3) score in the distribution.
5. Maximum value o In frequency distribution graph, mode is
the score corresponding to the peak or
high point of the distribution.
o It can be determined for data
Measures of Central Tendency (CT)
measured on any scale of
o A statistical measure that determines a measurement: nominal, ordinal, interval,
single value that accurately describes or ration.
the center of the distribution and
Primary value
represents the entire distribution of
scores. (I) The only measure of CT that can be
o Aims to identify the single value that is used for data measured on a
the best representative for the entire set nominal scale.
of data. (II) Used as a supplemental measure of
o Allows researchers to summarize or central tendency that is reported
condense a large set of data in a very along with mean or median.
simplified, concise form though
identifying the ‘average score’ (CT).
o A descriptive statistic Central Tendency & Shape of Distribution
n = sample size.
Testing ‘r’
171 173
Step 2. Calculate the correlation coefficient
195 193
- r = 0.932
- p = 0.0001
182 185
190 186
175 178
Spearman’s Test 3 840 17.2
Like most non-parametric tests, the data are
first ranked from smallest to largest 4 505 6.7
11 815 20.2
Calculating Spearman’s correlation coefficient
12 460 3.0
13 697 10.3
Solution:
Step 2. Calculate the correlation coefficient
Observation Mass Territory Step 3. Test the significance of the relationship
number size
ρ = 0.835, p = 0.001
1 510 6.9
2 773 20.6
Linear Regression 3. For any value of x, any particular value
of y is:
o Here we are testing a causal
relationship between the two variables. i. yi = α + βx + e
o We are hypothesizing a functional
ii. Where e, the residual, is the
relationship between the two variables
amount by which any observed
that allows us to predict a value of the
value of y differs from the mean
dependent variable, y, corresponding
value of y (analogous to
to a given value of the independent
“random error”)
variable, x.
iii. Residuals will follow a standard
normal distribution
Regression
4. The variances of the y variable for all
o Unlike correlation, regression does imply values of x are equal
causality 5. Observations are independent – each
individual is measured only once.
o An independent and a dependent
variable can be identified in this
situation. Estimating the Regression Function and Line
o μy = α + βx (or y = mx + b)
o β = population slope.
μy = α + βx
Estimating the Regression Function and Line
Sums of Squares