Professional Documents
Culture Documents
Proper&es of Mean
• The mean reflects the magnitude of every observaJon, since
every observaJon contributes to the value of the mean.
• The mean can be easily affected by the presence of an extreme
value, hence not a good measure of MCT when extreme value
do occur.
Review of Two Basic Descriptive Statistics
Proper&es of Median
• Median is a posiJonal value and hence is not affected by the
presence of an extreme value unlike the mean.
• The median is not amenable for further computaJon and hence
medians of subgroups cannot be combined in the same manner
as the mean.
Review of Two Basic Descriptive Statistics
Properties of Mode
• Since mode is the most frequently occurring value, it may not be the
center of the data.
• Mode does not make use of all observations.
• Mode is difficult to manipulate algebraically.
• Mode is ideal for qualitative type of data.
Review of Two Basic Descriptive Statistics
Illustration of MCT
Koko recorded his duration of stay in library for 10 school days. His data are as
follows:
Day Duration (in minutes)
1 44
2 20
3 35
4 33
5 40
6 33
7 33
8 15
9 42
10 34
Review of Two Basic Descriptive Statistics
Mean:
∑ 𝑥 44 + 20 + 35 + 33 + 40 + 33 + 33 + 15 + 42 + 34 329
𝑥̅ = = = = 𝟑𝟐. 𝟗 𝒎𝒊𝒏𝒔
𝑛 10 10
Median:
Arrange first the data from lowest to highest.
15 20 33 33 33 34 35 40 42 44
Since we have even number of data, two middle scores occur. Add the two middle score and divide the sum by 2.
33 + 34 67
𝑥7 = = = 𝟑𝟑. 𝟓 𝒎𝒊𝒏𝒔
2 2
Mode:
In the data set, 33 appear thrice. Thus, 33 is the mode and the data is unimodal.
; = 𝟑𝟑 𝒎𝒊𝒏𝒔
𝒙
Review of Two Basic Descriptive Statistics
2. Measures of Dispersion
a. Range – the difference between the greatest data value and the
lowest data value. It is the simplest measure of dispersion but the
least reliable. It does not reflect variations in the data set that lie in
between the highest and lowest data value.
Example:
In Koko’s data in his duration of stay in the library, the highest data value
is 44 and the lowest data value is 15. Thus,
𝑅 = 44 − 15
𝑹 = 𝟐𝟗
Review of Two Basic Descriptive Statistics
Formula:
Population Variance Sample Variance
∑(?@A)! ∑(?@?)̅ ! D ∑ ?! @ ∑ ? !
𝜎= = 𝑠= = or 𝑠= =
C D@E D D@E
Review of Two Basic Descriptive Statistics
Formula:
Population SD Sample SD
∑(?@A)! ∑(?@?)̅ !
𝜎= 𝑠=
C D@E
Review of Two Basic Descriptive Statistics
Example:
Let us consider Koko’s data in his duration of stay in the library (treated as
sample). The Statistical Package for Social Sciences (SPSS) was used to
compute the variance and standard deviation since this course
recommends computer application. The results are:
𝒔𝒕𝒂𝒏𝒅𝒂𝒓𝒅 𝒅𝒆𝒗𝒊𝒂𝒕𝒊𝒐𝒏 𝒔 = 𝟗. 𝟏𝟐
FREQUENCY COUNTS AND PERCENTAGES IN SPSS
In most cases, aside from analyzing the data to answer the main
objective of the study (i.e. testing the hypothesis that there is no
significant difference on the test anxiety of male and female students),
we usually start in determining how many of the respondents (from a
large data) belongs to a category in a study variable. For instance, of the
1000 respondents, “how many are females ?, without actually counting
it manually. Or maybe we are interested on determining, “what percent
of the students answered strongly agree on one of the test anxiety
items in your questionnaire, again without actually having to count it.
FREQUENCY COUNTS AND PERCENTAGES IN SPSS
Meanwhile, English test, Math test and Science test are the
actual raw scores obtained in the test. Using the data, respondent 1 is a
male (coded as 1), enrolled in a private school (coded as 2), and
answered neutral in terms of how much he like schooling (coded as 3).
He got the score of 65, 88 and 76 in the English test, Math test, and
Science test, respectively.
FREQUENCY COUNTS AND PERCENTAGES IN SPSS
The output shows that the Valid N for all the three variables (Gender,
Type of School and How much do you like schooling in general) is 90, and 0
Missing data. In other words, the data set is complete. The frequency table
shows that out of 90 respondents, 42 were males (coded as 1), and that is
46.7%. There are 48 females (coded as 2) comprising 53.3%. Similarly, 38 or
42.2% are enrolled in public schools (coded as 1), while the remaining 52 or
57.8% are enrolled in private schools (coded as 2). In terms of how much they
like schooling, 6 (6.7%) responded “very much” (coded as 4), while 38 (42.2%)
were neutral.
With SPSS, you can practically count all categorical variables (nominal
and ordinal) simultaneously and easily even for a very large data set (i.e. n=2,
000).
CROSS TABULATION IN SPSS
Step 1. With the excel file open, Click Analyze, Descrip^ve Sta^s^cs,
Crosstabs
Step 2. Put one of the variables in the Row(s) box, and the other
variable in the Column(s) box.
Step 3. Click Cells, and Row and Column Percentage (to express
frequency counts as percentages.)
Step 1. With the excel file open, Click Analyze, Descriptive Statistics,
Frequencies.
Step 2. Click Statistics. Check all statistics that you want to compute.
(Aside from measures of central tendency and dispersion,
measures of distribution like skewness can also be calculated)
1. Valid N for English test, Math test and Science test is 90, the data is
complete.
2. The Mean for English test = 76.22, Math test = 79.16, and Science
test = 74.60.
3. The Median for English test = 76, Math test = 78, and Science test =
76.
4. The Mode for English test = 76, Math test = 83, and Science test = 76.
The English test is polymodal (multiple modes).
MEASURES OF CENTRAL TENDENCY AND VARIABILITY IN SPSS
5. The Highest score for English test = 90, the Lowest score = 56, Range = 34 .
The Highest score for English test = 88, the Lowest score = 66, Range = 22.
The Highest score for English test = 89, the Lowest score = 56, Range = 33.
6. The Standard deviation (s) English test = 7.857, variance (s2) = 61.725.
The Standard deviation (s) Math test = 6.135, variance (s2) = 37.638.
The Standard deviation (s) Science test = 8.544, variance (s2) = 73.007.
MEASURES OF CENTRAL TENDENCY AND VARIABILITY IN SPSS
7. Skewness
The normal distribution, represented by the normal curve
represents symmetry and that the measures of central tendency
(mean, median and mode) are the same. However, if these three—
mean, median and mode are affected by lack of symmetry,
skewness in the data occurs.
MEASURES OF CENTRAL TENDENCY AND VARIABILITY IN SPSS
Step 1. With the excel file open, Click Analyze, Compare Means, Means.
Step 2. Put English test in the Dependent List and Gender in the Independent
List.
Step 3. Click Options. The Default Cell Statistics are Mean, Number of Cases
and Standard Deviation. You can include other statistics such as Median,
Minimum, Maximum, Variance, Kurtosis and Skewness using the arrow
key.
Perform the same procedure. Put Math test in the Dependent List
and Type of School in the Independent List. The result is:
DESCRIPTIVE STATISTICS OF GROUPS WITHIN A VARIABLE
DESCRIPTIVE STATISTICS OF GROUPS WITHIN A VARIABLE
The table shows that the two groups have the same minimum (66)
and maximum (88) scores. The overall mean score of the 90 students in
79.16, the same value we obtained in the earlier example. The output
shows that the mean score in the Math test of the 38 students enrolled in
public school (Type of School 1) is 78.47 with a standard deviation of 6.26
while the mean score of the 52 students enrolled private school (Type of
School 2) is 79.65 with a standard deviation of 6.06. With these means
scores and standard deviations in Math test, can we say that students
enrolled in private school performed better than those enrolled in public
school? This question will be discussed thoroughly in the succeeding
module (i.e. mean comparisons).
DESCRIPTIVE STATISTICS BASED ON CROSS TABULATIONS
Step 1. With the excel file open, Click Analyze, Compare Means, Means.
Step 2. Put English test in the Dependent List and Gender in the
Independent List. Then Click Next. Put Type of School in the
Independent List. Note: You can continue adding another
independent variable to determine the cross tabulations and means
( i.e. What is the mean score in English of male students enrolled in
public school (next) who responded “neutral”?)
The output shows that the mean score of all male students
(n= 42) is 76.42 and SD is 7.77, the same values we obtained
earlier. The mean score of 20 males (Gender 1) who are enrolled
in public school (Type of School 1) is 77.55 and SD is 7.65.
Similarly, the mean score of the 30 females (Gender 2) who are
enrolled in private school (Type of School 2) is 75.47 and SD is
8.92.
DESCRIPTIVE STATISTICS BASED ON CROSS TABULATIONS