Professional Documents
Culture Documents
study that examines the relationship between students' marks in mathematics and various factors
such as gender, writing hand, caste, occupation of parents, weight, distance of home from school,
Population
The population for this study could be all students in a particular grade level, such as
Sampling:
A sample of students would be selected from the population for the study. One possible
sampling method could be simple random sampling, where each student in the population has an
equal chance of being selected. For example, a random sample of 100 students could be chosen.
every member of the population has an equal chance of being selected. This is the most basic
type of sampling method, and it is often used in mathematics education research. For example, a
researcher might randomly select a sample of students from a school district to participate in a
Stratified Sampling
Stratified sampling is a method of sampling that divides the population into strata, or
groups, and then randomly selects a sample from each stratum. This method is often used in
mathematics education research when the researcher wants to ensure that the sample is
socioeconomic status. For example, a researcher might stratify a sample of students by gender
and then randomly select a sample of students from each gender group.
Cluster Sampling
Cluster sampling is a method of sampling that divides the population into clusters, or
groups, and then randomly selects a cluster to participate in the study. This method is often used
in mathematics education research when the researcher wants to save time and money. For
example, a researcher might cluster schools together and then randomly select a sample of
Systematic Sampling
Systematic sampling is a method of sampling that selects every kth member of the
population, where k is a random number. This method is often used in mathematics education
research when the researcher wants to ensure that the sample is representative of the population
in terms of certain characteristics, such as grade level or class size. For example, a researcher
might randomly select a number between 1 and 100 and then select every 100th student in a
Variables
Binary variable:
Categorical Variable:
Continuous Variable:
Gender
Dependent Variable:
Mathematics achievement
Mathematics anxiety
Mathematics self-efficacy
Independent Variable:
Teacher experience
Dichotomous Variable:
Discrete Variable:
Dummy variables:
Endogenous Variable:
Mathematics achievement
Mathematics anxiety
Mathematics self-efficacy
Exogenous Variable:
SES
Gender
Independent Variable:
Teacher experience
Interval Variable:
Intervening Variable:
Self-regulation
Motivation
Latent Variable:
Mathematical ability
Mathematical aptitude
Mathematical talent
Mediating variable:
Manifest Variable:
Moderating Variable:
SES
Gender
Nominal Variable:
Ordinal Variable:
Outcome Variable:
Mathematics achievement
Mathematics anxiety
Mathematics self-efficacy
Polychotomous Variables:
Grade level (elementary, middle, high school)
Predictor Variable:
SAT score
Gender
Treatment Variable:
Measurement Scales:
- Weight and Distance of Home from School are ratio scale variables.
Correlation
Correlation could be used to determine the relationship between variables. For example,
the study might analyze the correlation between students' marks in mathematics and their marks
in science.
Regression
other variables. For instance, the study might examine how well gender, writing hand, caste,
occupation of parents, weight, and distance from school predict a student's marks in
mathematics.
Normal Distribution
The study could examine whether students' marks in mathematics or other variables
follow a normal distribution, which would have implications for statistical analyses.
t-Test
A t-test could be used to compare the mean marks in mathematics between two groups, such as
One-Sample t-Test
A teacher wants to know if her students who use a new math software program are performing
better than the average student in the country. She could use a one-sample t-test to compare the
average scores of her students to the national average on a standardized math test.
A researcher wants to know if there is a difference in the average math scores of students who
are taught using a traditional method and students who are taught using a constructivist method.
She could use an independent samples t-test to compare the average scores of the two groups on
Paired t-Test
A teacher wants to know if her students' math scores improve after they receive tutoring. She
could use a paired t-test to compare the average scores of her students on a standardized math
ANOVA
ANOVA could be used to compare the mean marks in mathematics among three or more
The Mann Whitney U test could be employed to compare the median marks in
mathematics between two groups that do not meet the assumptions of a t-test, such as students
with left-handed and right-handed writing. By employing these statistical techniques and
analyzing the data collected from the sample, I think I can gain insights into the relationships
between variables and make inferences about the larger population of students in mathematics
education.
Task – II
Choose a specific area of research related to Mathematics education and learning. Create at least
seven variables (must be nominal, ordinal, and scale (interval & ratio)). Enter at least 200 data
(hypothetical) on SPSS and use SPSS to do quantitative analysis on the following topics (must
Descriptive Statistics
Table: 1
Marks in Mathematics
N Valid 200
Missing 0
Mean 54.51
Median 54.00
Mode 52
Variance 36.693
Skewness .160
Kurtosis .106
Range 32
Minimum 40
Maximum 72
Sum 10902
Percentiles 25 50.00
50 54.00
75 58.75
The above table shows the statistical measures of marks in mathematics, out of 200 valid
data of the respondents, the average marks in science is 64.96, median value is 65, mode 66,
standard error of skewness is 0.172, kurtosis is -0.054, standard error of kurtosis is 0.342, the
difference between highest and lowest marks of the participants i.e., range is 29, minimum marks
is 53, maximum marks is 82 and their sum is 12991. This informative table shows the first,
Figure:1
respondents are Government job holder, 35 percent are private job holder, and 20 percent are
physical worker. The highest percentage is government job holder whereas least is physical
worker
Figure:2
respondents are Bramin, 35 percent are Vasya, and 20 percent are other. The highest percentage
The above bar-graph shows the count of different occupation of a parent. This shows that
90 parent are government job holder, 70 parents are private job holder and 40 parents are
Physical worker. Goverement job holder parents are more than double of Physical worker.
Figure:5
The figure alongside shows the whisker plot of the weight of student. The median of the
The figure alongside shows the whisker plot of the marks in mathematics. The median of
the dataset is 65. The variables have outlier 7, which is significantly lower than rest of the data. It
is necessary to exclude the outlier from the original dataset before analyzing the data.
Figure:7
The figure above represents the cluster boxplot of Weight of student by caste of the
student. Data shows that median weight of Bhamin is 39, Median Weight of Vaisya is 41 and
median weight of Other caste student is 40. It means there are similar weight of student
The histogram displayed above represents the distance of home from school distribution
of the participants. It indicates that the majority of the data falls within a normal curve, with only
a few data points outside this range. It is suggested to exclude the outlier data for a more accurate
The histogram displayed alongside represents the marks in mathematics distribution of the
participants. It indicates that the majority of the data falls within a normal curve, with only a few
data points outside this range. It is suggested to exclude the outlier data for a more accurate
Table:2
The above table shows that mean and median of Distance of home from the school are quite
similar i.e., 5.02 and 5 respectively. With this we can say that the data are normally distributed.
On the other hands the value of kurtosis is -0.155 which lies between +2 and -2. It also proves
that the Distance of home from school are normally distributed. Let’s have a quick look on value
of skewness which is 0.016, very near to the zero (0). This also shows above data is normally
Figure: 10
and standard deviation are 5.02 and 1.816 respectively. Our data seems that approximately 68%
of the data are contained in ±1 standard deviation from the mean i.e., Approximately 68% of the
data belongs to 16.13±1.14⇒14.99-17.27. In the same 95% of the data are necessary to be
included between (μ±2σ) and 99% of the data necessary to be included between (μ±3σ). Another
Table:3
marks in Mathematics
N Valid 200
Missing 0
Mean 69.30
Std. Error of Mean .418
Median 69.50
Mode 71
Std. Deviation 5.911
Variance 34.943
Skewness .022
Std. Error of Skewness .172
Kurtosis -.272
Std. Error of Kurtosis .342
Range 31
Minimum 55
Maximum 86
Sum 13859
Percentiles 10 62.00
20 64.00
25 65.00
30 66.00
40 68.00
50 69.50
60 71.00
70 72.00
75 74.00
80 75.00
90 77.00
The above table shows that mean and median of marks of mathematics are quite similar i.e.,
69.30 and 69.5 respectively. With this we can say that the data are normally distributed. On the
other hands the value of kurtosis is 0.72 which lies between +2 and -2. It also proves that the data
in marks in mathematics are normally distributed. Let’s draw a normal curve of marks in
mathematics of respondents.
Figure: 11
standard deviation are 69.3 and 5.99respectively. Our data seems that approximately 68% of the
data are contained in ±1 standard deviation from the mean i.e., Approximately 68% of the data
belongs to 69.±5.99 In the same 95% of the data are necessary to be included between (μ±2σ)
and 99% of the data necessary to be included between (μ±3σ). Another indication of being a
Table:4
The above table shows the custom of writing hand and caste of students. The highest
number of right-handed students are Brahmins, with 60 students. The lowest number of right-
handed students are from the "other" caste, with 20 students. The highest number of left-handed
students are from the "other" caste, with 50 students. The lowest number of left-handed students
are Brahmins, with 0 students. Brahmins: 60 students are right-handed and 0 students are left-
handed. Vaisyas: 20 students are right-handed and 0 students are left-handed. Other: 20 students
are right-handed and 50 students are left-handed. The table shows that there is a clear difference
in the distribution of writing hand by caste. Brahmins are more likely to be right-handed, while
students from the "other" caste are more likely to be left-handed. This difference may be due to a
number of right-handed students are Brahmin, with 60 students. The lowest number of right-
handed students are Vaisya, with 20 students. The highest number of left-handed students are
other caste, with 50 students. The lowest number of left-handed students are Brahmin, with 20
students
Figure: 11
The above scatter plot presents the relationship between the Weight of student and Marks in
mathematics. Looking at this scatter plot the data are spreading everywhere. It shows that there is
no linear relationship between the variables. The points are scattered across the graph without a
clear trend or pattern. However, there are some clusters in the lower regions of the middle part of
the figure, it represents a group with varying scores in both variable. One important attribute of
this above figure is there is no relationship between the marks in Math and weight of
participants.
Figure: 12
The above scatter plot presents the relationship between the marks in science by weight
of student by occupation of parents. Looking at this scatter plot the data are spreading
everywhere. It shows that there is no linear relationship between the variables. The points are
scattered across the graph without a clear trend or pattern. However, there are some clusters in
the lower regions of the middle part of the figure, it represents a group with varying scores in
both subjects. One important attribute of this above figure is there is no relationship between the
Marks in Marks in
Science Mathematics
Marks in Science Pearson Correlation 1 -0.054
Sig. (2-tailed) .444
N 200 200
Marks in Mathematics Pearson Correlation -0.054 1
Sig. (2-tailed) .444
N 200 200
The table shows that the degree correlations between the marks in science and mathematics.
The Pearson r-value is -0.54. It lies in between 0 to - 0.25. which means there is very weak
correlations between the variables that means marks in science does not effects marks in
Age of Weight of
Participants Participant
Distance of home from Pearson Correlation 1 -0.08
school Sig. (2-tailed) .912
N 200 200
Weight of Participant Pearson Correlation .-0.08 1
Sig. (2-tailed) .348
N 200 200
The table shows that the degree correlations between distance of home from school and
weight of participants. The Pearson r-value is -0.08. It lies in between 0 to- 0.25. Which means
there is very weak correlations between the variables that means Distance of home from school
Coefficients Coefficients
Mathematics
science. The model found that (F, (1,198) = 0.589, p (0.00)<0.05, with R2 =0.03 and coefficient=
-0.046. the R- square value explains that the 3% variability of marks in science can be described
Y= a+ b X
This regression model shows that there is positive impact of independent variables on
dependent variables. That is, if we increase the score of mathematics by 1, there is -0.046
changes in score in science, or if we increase the score of mathematics by 100%, there is 4.6%
Coefficients Coefficients
Participant
home from school. The model found that (F, (1,198) = 0.012, p (0.00)>0.05, with R2 =-0.005 and
coefficient= - 0.02. the R- square value explains that the 0.5% variability of Distance of school
Y= a+ b X
dependent variables. That is, if we increase distance of school by 1, there is- 0.002 changes in
Inferential Statistics
Table
From the table, we have t(199) = -0.937, p-value = 0.350 < α -value = 0.05. In this case,
we accept the null hypothesis and conclude that the mean score of the respondents in science is
Null Hypothesis (H0): The mean distance of home from school is 5 ( μ= 5).
Table
From the table, we have t(199) = 0.156, p-value = 0.876 >α -value = 0.05. In this case, we
accept the null hypothesis and conclude that the mean distance of home from school is around 5
Null Hypothesis (H0): There is no significant difference between the mean scores of respondents
Alternative Hypothesis (H1): There is significant difference between the mean scores of
Table 4
The results of the t-test showed that t(198) = 0.825, p-value = 0.411 > α-value = 0.05. In
this case, we must fail to reject the null hypothesis. The conclusion is that there is no significant
difference between the mean marks of respondents in mathematics based on their writing hand.
The mean mark of respondents in mathematics who are right-handed is 69.64 and that of
respondents who are left-handed is 68.95. These values are almost equal.
Null Hypothesis (H0): There is no significant difference in the mean weight of respondents based
on their gender.
Table 4
The results of the t-test showed that t(198) = 0.658, p-value = 0.511 > α-value = 0.05. In
this case, we must fail to reject the null hypothesis. The conclusion is that there is no significant
difference between the mean weight of respondents based on their gender. The mean weight of
respondents who are male is 39.92 and that of respondents who are female is 39.35. These values
Null Hypothesis (Ho): There is no difference between the mean score of pretest and posttest.
Alternative Hypothesis (Ha): There is difference between the mean scores of pretests and
posttest.
Table 1
From the table, we have t(49) = -4.993, p-value= 0.00 < α-value 0.05. The result Reject
null hypothesis and accept alternative hypothesis and conclusion is there is significant difference
between the mean scores in pretest and posttest. From the descriptive table, we have mean scores
in pretest and posttest 60.94 and 71.42 respectively. Mean score of Posttest is 10.48 more than
Pretest.
Perform F-test (ANOVA) analysis of any two applicable variables.
Table 1
Parameters df F Sig.
Total 199
From the table we have F(2, 197)= 0.164, p-value= 0.849> α-value = 0.05. in this case,
we have accepted hypothesis and conclude that there are no significant differences of distance of
home from school based on the occupation of parents. i.e occupation of parent has no effect on
the distance of home from school We can see the descriptive values in the following tables.
N Mean Std. Deviation
From table we have descriptive values, Government job (N = 90, M= 5.02, SD= 1.767),
Private (N =70, M= 4.94, SD= 1.948), and Physical Tool (N = 40, M= 5.15, SD= 1.718). The
data shows that each category has almost equal mean distance of home from school. It leads to
conclude that there is no effect of occupation of parents to Distance of home from school.
Ho: There is no significant difference between the observed and expected gender of respondents
Ha: There is significant difference between the observed and expected gender of respondents
Table 1
we retain null hypothesis and conclude that there is no significant difference between observed
and expected frequencies between male and female Respondent. In this case, The observe
Table 2
Null Hypothesis (Ho): There is no significant association between the gender and Writing hand
of respondent
Alternative Hypothesis (Ha): There is significant association between the gender and Writing
hand of respondent
Table 1
Test of Independence of Association between Gender and Writing hand of respondent
a. 0 cells (0.0%) have expected count less than 5. The minimum expected count is 43.5.
From the above table, we have χ2(1) = 0.138, p-value =0..669> α- value=0.05. therefore, we
fail to reject the null hypothesis and conclude that there is no association between the
variables. That is The observed frequencies are close enough to the expected frequencies, so
we cannot say that there is a significant difference between them. The data does not support
Table 2
Right Left
handed hand
Gender of Male Count 45 42 87
Count
Count
Count
From the above table, we have Male respondents who is right hand (Expected =43.5,
observed =45), Left hand (Expected = 43.5, observed = 42). Similarly, female respondents who
is right hand (Expected = 56.5, observed = 55), and left hand (Excepted = 56.5, observed = 58).
Null Hypothesis (Ho): There is no significance between mean score of mathematics and science.
Alternative Hypothesis (H1): There is significance between mean score of mathematics and
science.
Table 1
From a table, the p-value for the correlation coefficient is 0.444, which is greater than the
significance level of 0.05. Therefore, we fail to reject the null hypothesis and conclude that there
Perform Mann Whitney U Test of two variables if you have data which is not normally
distributed (Optional)
Task – III
Descriptive Statistics
Central Tendencies
Correlaton
#correlation
> cor(Marks_in_Mathematics,Marks_in_science)
[1] -0.05445055
> cor(Weight,Distance_of_home_from_school)
[1] -0.00787271
> cor(Distance_of_home_from_school,Marks_in_Mathematics)
[1] 0.0640648
> cor(Distance_of_home_from_school,Marks_in_science)
[1] 0.01897551
> cor(Weight,Marks_in_Mathematics)
[1] -0.03757749
> cor(Weight,Marks_in_science)
[1] -0.008848629
Measure Of Dispersion
Range
> range(Marks_in_Mathematics)
[1] 55 86
> range(Marks_in_science)
[1] 53 82
> range(Weight)
[1] 23 57
> range(Distance_of_home_from_school)
[1] 0 10
Variance
var(Weight)
[1] 37.33766
> var(Distance_of_home_from_school)
[1] 3.29608
> var(Marks_in_Mathematics)
[1] 34.94269
> var(Marks_in_science)
[1] 24.85726
> var(Weight)
[1] 37.33766
Mean Deviation
> mad(Weight)
[1] 5.9304
> mad(Marks_in_Mathematics)
[1] 6.6717
> mad(Marks_in_science)
[1] 5.9304
> mad(Distance_of_home_from_school)
[1] 1.4826
Standard Deviation
> sd(Weight)
[1] 6.110455
> sd(Marks_in_Mathematics)
[1] 5.911234
> sd(Marks_in_science)
[1] 4.985706
> sd(Distance_of_home_from_school)
[1] 1.815511
• Plots (1/1) (Scatter, Bar Plot, Violin Plot, Histogram, Line Plot, Box Plot, etc.)
Scatter Plot
Figure 1
Line graph
Figure 5
Figure: 8
-0.19174303 0.08493209
sample estimates:
cor
-0.05445055
-0.1754040 0.1016937
sample estimates:
cor
-0.03757749
-0.1474087 0.1300520
sample estimates:
cor
-0.008848629
t-test
data: Marks_in_Mathematics
68.60426 Inf
sample estimates:
mean of x
69.295
> t.test(Weight, mu = 55, conf.level = 0.95, alt = "greater")
data: Weight
38.88098 Inf
sample estimates:
mean of x
39.595
data: Distance_of_home_from_school
-Inf 5.232147
sample estimates:
mean of x
5.02
data: Marks_in_Mathematics
-Inf 69.98574
sample estimates:
mean of x
69.295
data: Marks_in_Mathematics
68.47075 70.11925
sample estimates:
mean of x
69.295
data: Marks_in_science
64.2598 65.6502
sample estimates:
mean of x
64.955
alternative hypothesis: true difference in means between group Left hand and group Right handed is not
equal to 0
-2.866647 1.486647
sample estimates:
68.95 69.64
alternative hypothesis: true difference in means between group Left hand and group Right handed is not
equal to 0
-0.3875033 0.9475033
sample estimates:
>
ANOVA
aov(Distance_of_home_from_school~Caste)
Call:
Terms:
Caste Residuals
Occupation_OF_Parrent Residuals
#One-way ANOVA
> aov(Marks_in_Mathematics~Caste)
Call:
Terms:
Caste Residuals
> n = aov(Marks_in_Mathematics~Caste)
> TukeyHSD(n)
$Caste
> plot(TukeyHSD(n))
>
plot(TukeyHSD(n))
> aov(Weight~Caste)
Call:
Terms:
Caste Residuals
> n = aov(Weight~Caste)
> TukeyHSD(n)
$Caste
> plot(TukeyHSD(n))