You are on page 1of 13

APPLIED STATISTICS (BUM2413), SEMESTER 1 2022/2023

ASSIGNMENT BUM2413

APPLIED STATISTICS

SEMESTER II 2022/2023

SECTION: 14P GROUP NAME: SIGMASQUARE


GROUP MEMBERS (State the PIC) STUDENT ID

1. DANIAL WAFI BIN RAMLI CB21066

2. AIDIEL IZZAHARUDDIN BIN ABU BAKAR CB21079

3. MOHAMAD HATTA RAHMAN BIN BORHANUDDIN CB21080

4. HASNHIAH AULIA BINTI ABDUL HARIS CB21031

5. ROYSTON OSCAR MORRIS CB21097

6.

LECTURER: CIK NUR ZAHIRAH BINTI MD. NOOR

SUBMISSION DATE:

FOR EXAMINER USE


ONLY Y
Question Marks Your Question Marks Your
Marks Marks
1 2 8 2
2 1 9 2
3 2 10 4
4 7 11 24
5 3 12 1
6 4
7 8
TOTAL 60

1
APPLIED STATISTICS (BUM2413), SEMESTER 1 2022/2023

2
APPLIED STATISTICS (BUM2413), SEMESTER 1 2022/2023

1. Identify a topic/ problem that you are interested to study. Provide a brief description of
your study and state at least ONE (1) study objective.

Title: "The Impact of Time Spent on Social Media on CGPA: A Study on UMP Students"

Brief Description:
This study aims to investigate the relationship between the time spent on social media and
the Cumulative Grade Point Average (CGPA) of Universiti Malaysia Pahang (UMP)
students. With the increasing use of social media platforms among students, it is important
to understand whether excessive time spent on social media negatively affects academic
performance, as measured by CGPA.

Study Objective:
The objective of this study is to examine the potential impact of time spent on social
media on UMP students' CGPA. Specifically, the study aims to analyze whether there is a
correlation between the amount of time students spend on social media (categorized as 1-2
hours, 3-4 hours, 5-6 hours, and more) and their CGPA. The study seeks to determine if
increased time spent on social media is associated with a decrease in CGPA among UMP
students, thereby highlighting the potential negative effects of excessive social media
usage on academic performance.

(2 Marks)
2. State the population of the study.

- All Universiti Malaysia Pahang students.


(1 Mark)
3. Determine a single quantitative variable that is related to your chosen problem. Identify
the type of level of measurement for the variable.
● A single quantitative variable that describe the problem is Cumulative Grade Point
Average (CGPA). The type of level of measurement for the variable is
ordinal-level.

(2 Marks)
4. Divide the data collected into two significant groups (e.g.: gender (male/female), faculty,
year of study, etc.) that related to the study. The sample size is at least 50 observation for
each group.

(i) State the name of the groups.


- Pekan and Gambang (1 Mark)

(ii) Present the data collected according to the groups in a table.

3
APPLIED STATISTICS (BUM2413), SEMESTER 1 2022/2023

No. Pekan Gambang

1. 3.95 3.95

2. 3.43 3.40

3. 3.49 3.25

4. 3.50 3.10

5. 3.00 3.59

6. 3.56 3.14

7. 3.71 3.05

8. 3.52 3.00

9. 3.65 2.86

10. 3.48 4.00

11. 3.35 3.56

12. 3.48 3.44

13. 3.41 3.97

14. 3.00 3.32

15. 3.10 3.53

16. 3.50 3.40

17. 2.96 2.90

18. 3.30 3.48

19. 3.50 3.67

20. 3.00 3.30

21. 3.30 3.00

22. 3.72 3.75

23. 2.62 3.70

24. 3.20 3.50

25. 3.22 3.33

26. 3.10 3.69

4
APPLIED STATISTICS (BUM2413), SEMESTER 1 2022/2023

27. 3.20 3.88

28. 3.49 3.77

29. 3.40 3.52

30. 3.00 2.55

31. 3.00 2.87

32. 3.33 3.89

33. 3.00 3.88

34. 3.21 3.12

35. 3.02 3.33

36. 3.00 3.50

37. 3.04 3.78

38. 3.10 3.45

39. 3.06 3.40

40. 3.40 3.90

41. 3.40 3.50

42. 3.00 3.00

43. 3.60 3.55

44. 3.70 2.90

45. 3.05 3.01

46. 2.83 3.11

47. 3.40 4.00

48. 3.29 3.22

49. 3.33 2.56

50. 3.00 3.37

(2 Marks)

5
APPLIED STATISTICS (BUM2413), SEMESTER 1 2022/2023

(iii)Identify the method of data collection being used. Provide the significant
evidence.
- Questionnaires and surveys.
We've created a Google form for students to fill out regarding how many credit
hours they've taken this semester. We also include questions about which campus
they are from, as well as their current CGPA, in the form.
(2 Marks)
(iv) State the sampling method you use to collect the data. Explain the sampling method
process.
- Voluntary sampling method.
We use a Google form to create and edit questions, which we then share with
students via social media platforms such as WhatsApp, Telegram, and Instagram.
This method provides a larger sample size and a quick response.
(2 Marks)
5. For each set of data, obtain the descriptive statistics using Microsoft Excel. Then,
summarise the measures of central tendency and measures of variation in the following
table.

6
APPLIED STATISTICS (BUM2413), SEMESTER 1 2022/2023

Group Name Measures of central tendency Measures of variation


Group 1 Mean = 3.2923 Standard Deviation = 0.2931
(Pekan) Median = 3.3150 Sample Variance = 0.0859
Mode = 3.0000 Range = 1.3800
Midrange = 3.3100 Coefficient of Variation =
8.9026%
Group 2 Mean = 3.4227 Standard Deviation = 0.3628
(Gambang) Median = 3.4650 Sample Variance = 0.1316
Mode = 3.5000 Range = 1.4500
Midrange = 3.2750 Coefficient of Variation =
10.5998%

(3 Marks)
6. Compare and comment the measures of central tendency and measures of variation
between Group 1 and Group 2.

● The measure of central tendency for group 1 (Pekan) is mode<mean<median. The


distribution of data is left skewed. The measure of central tendency for group 2
(Gambang) is mean<median<mode. The distribution of data is left skewed.
● Based on the measures of central tendency, the most frequently occurring CGPA
for group 1 (Pekan) is 3.0000 meanwhile for group 2 (Gambang) is 3.500.
● Based on the measures of central tendency, the average CGPA for group 1 (Pekan)
is 3.2923 and for group 2 (Gambang) is 3.4227.
● Median for group 1 (Pekan) is 3.3150 meanwhile for group 2 (Gambang) is
3.4650.
● Based on the measure of variation, the variance of group 1 (Pekan) < group 2
(Gambang), which is 0.0859 for group 1 (Pekan) and 0.1316 for group 2
(Gambang). Conclude that group 1 (Pekan) is less spread compared to group 2
(Gambang).
● Based on the measure of variation, coefficient of variation for group1 (Pekan) <
group 2 (Gambang), which is 8.9026% for group 1 (Pekan) while 10.5998% for
group 2 (Gambang). Then the group 1 (Pekan) has less variability relative to its
mean as compared to group 2 (Gambang).

(4 Marks)

7
APPLIED STATISTICS (BUM2413), SEMESTER 1 2022/2023

7. Construct box plots for the two sets of data on the same axis. Identify the shape of
distribution for each boxplot. Compare and comment on the average and variability of
the boxplots.

Gambang Pekan

Minimum 2.55 2.62

Q1 3.13 3.02

Q2(Median) 3.47 3.30

Q3 3.70 3.49

Maximum 4.00 4.00

IQR 0.57 0.47

Lower Limit 2.28 2.32

Upper Limit 4.56 4.20

Figure ?: Boxplot of CGPA taken by student for Gambang and Pekan

8
APPLIED STATISTICS (BUM2413), SEMESTER 1 2022/2023

Shape of distribution:
Gambang: Left-Skewed distribution
Pekan: Left-skewed distribution

Average:
Mean Gambang(3.4227) > Mean Pekan(3.2923)

Variability:
Group Pekan is more consistent compared to Gambang due to the Interquartile
range for group Pekan being lower compared to group Gambang.

(8 Marks)
8. What is the best measure of central tendency to describe your data? Give a reason.
Mean: The mean is the average CGPA . It is calculated by summing all the values and
dividing the sum by the total number of values which is 118 which interpret Average time
spend on social media may or may not affect student CGPA

(2 Marks)
9. What is the best measure of variation to describe your data? Give a reason.
Interquartile Range (IQR), It represents the spread CGPA by percentage of dataset and is
useful in identifying outliers which interpret percentage CPGA earned by amount of time
spend in social media
(2 Marks)
10. Construct a normal probability plot for each data set. Do the data appear to come from
an approximately normal distribution?

9
APPLIED STATISTICS (BUM2413), SEMESTER 1 2022/2023

Figure ?:Normal Probability Plot of CGPA taken by UMP student

(4 Marks)

11. In Chapter 3, you have learnt statistical hypothesis testing concerning a parameter(s) of
one and two populations. Hypothesis testing is one of the inferential statistics in statistical
analysis. The parameters are the population mean, proportion, variance and standard
deviation. Assuming that the data obtained in (4) is normally distributed population,
answer the following questions using P-value approach and Microsoft Excel.
(NOTE: Create your own hypothesised mean with justification, may use the overall mean of
the data)
a. Create a situation and conduct a hypothesis testing for one population mean
from one of the groups.

STEP 1: Formulate the hypothesis

H0: ≥3.0 (Claim)

H1: < 3.0

STEP 2: Key in data

10
APPLIED STATISTICS (BUM2413), SEMESTER 1 2022/2023

Figure 5

Figure 5: Measure of central tendency and variation for Gambang

STEP 3: Calculate Z-test and P-Value


Z-test = (mean - µ0 ) / Standard error
= (3.422679 – 3.0) / 0.048478
= 8.7190
P-Value= 1

STEP 4: Make a decision to reject or not reject H0.


Since P-Value = (1) > α = (0.05). Therefore do not reject H0.

STEP 5: Conclusion
At α = 0.05, there is insufficient evidence to support the claim.

(8 Marks)

b. Choose one probability sampling method to select less than 30 data from each
group.

11
APPLIED STATISTICS (BUM2413), SEMESTER 1 2022/2023

i. Identify which sampling method you choose to select the data and explain the
sampling method process.

- Using the Systematic Sampling Method to choose 30 random CGPA from


Pekan and Gambang
(2 Marks)
ii. Present the selected data in a table.

Group 1 Group 2

2.86 2.83

2.9 3

3 3

3.1 3

3.14 3

3.3 3.05

3.4 3.1

3.45 3.2

3.5 3.22

3.5 3.3

3.53 3.33

3.58 3.35

3.69 3.4

3.75 3.43

3.88 3.48

3.9 3.5

4 3.5

2.55 3.6

2.87 3.71

3 3.8

12
APPLIED STATISTICS (BUM2413), SEMESTER 1 2022/2023

3.01 2.62

3.11 2.88

3.22 3

3.32 3

3.37 3

3.4 3.02

3.45 3.06

3.5 3.1

3.5 3.2

3.55 3.29

(2 Marks)

c. Create a situation to conduct a hypothesis testing using the data selected in (b) to
compare two population means between the groups.
(12 Marks)

12. Based on your problem/ topic stated in (1), give any relevant conclusion for the study.
(1 Marks)

13

You might also like