You are on page 1of 2

NATIONAL ECONOMICS UNIVERSITY

BUSINESS STATISTICS

Group Mid-term Exam


(Submission date: handout in the class, Wed, 20th Sep 2023)

Question 1

Consider the variable income in gss.sav file (the variable is total family income in the year before
the survey).
1. Make a frequency table for the variable. Does the frequency table make sense? Does it
make sense to make a histogram of the variable? A bar chart?
2. What is the scale of measurement for the variable?
3. What descriptive statistics are appropriate for describing this variable and why? Does it
make sense to compute a mean?
4. Discuss the advantages and disadvantages of recording income in this manner. Describe
other ways of recording income and the problem associated with each of them.
Ngân Hà, Hiền, Thảo, Khoa
Question 2
In the gss.sav file, the variable tvhours tells you how many hours per day GSS respondents say
they watch TV. 1. The value strikes strange is 12. From 0-10, the frequency is majorly high. However, from 11-24, the number
reduces significantly but the number of people watching TV 12 hours per day is suddenly high.
1. Make a frequency table of the hours of television watched. Do any of the values strike you
as strange? Explain.
6%
2. Based on the frequency table, answer the following questions: Of the people who answered
the question, what percentage don’t watch any television? What percentage watch two 53.1%
hours or less? Five hours or more? Of the people who watch TV, what percentage watch
one hour? What percentage watch four hours or less? > 5 hours: 100 -83.3 = 16.7% < 4 hours: 83.3%
1 hour: 20.9%
3. From the frequency table, estimate the 25th, 50th, 75th, 95th percentiles. What is the value
for the Median, Mode?
4. there are some missing values: 9,13,16,17,18,19,21,23
4. Make a bar chart of the hours of TV watched. What problem do you see with this display?
5. Make a histogram of the hours of TV watched. What causes all of the values to be clumped
together? Compare this histogram to the bar chart you generated in question 2d. Which is
a better display for these data?
5. Causes of all of the values to be clumped together: the hours people watch TV always >0, most people watch TV for 1-4 hours and
very little people watch TV for more than 10 hours
Question 3

Find a data set which is related to a specific organisational problem (either at the macro or micro
level) and apply all possible descriptive statistical techniques that you think suitable to the
5. Histogram is better to display the data because we can see all the missing values clearly and also the shape of the chart
problem. Write a short report, which includes the objectives of your analysis, the research
questions, the analytical techniques you apply to address to the research questions and your
findings. The maximum length of the report is 5 pages including Tables and Figures.

Hint: draw necessary outputs from SPSS that you find valuable to illustrate for your answer.

bài 2.3

You might also like