You are on page 1of 4

1.

The variables chosen


The variables chosen are population, the number of people who know English, Total income
in 2015 for the population aged 15 years and over in private households, the number of
people who have the Bachelor’s degree, the income taxes (average amount $) in 5
neighbourhoods which are City of Toronto, Agincourt North, Agincourt South-Malvern
West, Alderwood, and Annex.
The data of five variables including population, the number of people who know
English, Total income in 2015 for the population aged 15 years and over in private
households, the number of people who have the Bachelor’s degree, the income taxes (average
amount $) are ratio. The reason is that population is example of ratio scale, which is a
quantitative scale. There is a true zero and equal intervals between neighboring points. In the
ratio scale, a zero means that there is a total absence of the variables which are measured. In
summary, the date of 5 variables can be categorized, ranked, evenly spaced, and has a natural
zero. Thus, the data of 5 variables are ratio.

2. A summary table
A summary table of 5 variables chosen: Population, the number of people who know English,
the total income statistics in 2015 for the population aged 15 years and over in private
households, the number of people who have Bachelor’s degree, the income taxes: average
amount ($)

-
- For the category of population, city of Toronto has the highest number, Alderwood has the
lowest number. For the category of the number of people who know English, city of Toronto
has the highest number, Alderwood has the lowest number. For the category of Total-income
(in 2015) for the population aged 15 years and over in private households, city of Toronto has
the highest number, Alderwood has the lowest number. For the category of Bachelor’s
degree, city of Toronto has the highest number, Alderwood has the lowest number. For the
category of income taxes – average amount ($), Annex has the highest number, Agincourt
North has the lowest number.
- City of Toronto has the highest number for the 5 categories and Alderwood has the lowest
number of the 5 categories.

3. The mean, median and mode


- In terms of the population variable, the mean is 565404, the median is 29113, there is no
mode.
- In terms of the number of people who know English, the mean is 518282, the median is
22595, there is no mode.
- In terms of total income in 2015 for the population aged 15 years and over, the mean is
475350, the median is 25005, there is no mode.
- In terms of the number of people who have Bachelor’s degree, the mean is 89650, the
median is 3270, there is no mode
- In terms of the income taxes – average amount, the mean is 17741, the median is 11626,
there is no mode.
- In terms of data of City of Toronto, the mean is 1597198, the median is 2294785, there is
no mode.
- In terms of data of Agincourt North, the mean is 17306, the median is 22595, there is no
mode.
- In terms of data of AgincourtSouth- Malvern West, the mean is 14920, the median is
19990, there is no mode.
- In terms of data of Alderwood, the mean is 9386, the median is 11570, there is no mode.
- In terms of data of Annex, the mean is 27618, the median is 28440, there is no mode.

Agincour
t South-
City of Agincourt Malvern
Characteristic Toronto North West Alderwood Annex Mean Median Mode

Population,
2016 2,731,571 29,113 23,757 12,054 30,526 565,404 29,113 #N/A

English 2,508,815 22,595 19,990 11,570 28,440 518,282 22,595 #N/A

Total - Income
statistics 2,294,785 25,005 20,400 10,265 26,295 475,350 25,005 #N/A

Bachelor's
degree 433620 3090 3270 1415 6855 89650 3270 #N/A

Income taxes:
Average
amount ($) 17,197 6,726 7,185 11,626 45,973 17,741 11,626 #N/A

Mean 1,597,198 17,306 14,920 9,386 27,618      

Median 2,294,785 22,595 19,990 11,570 28,440      

Mode #N/A #N/A #N/A #N/A #N/A      

4. All the data is quantitative data.


The reason is that the value is measured in the forms of counts or numbers, with a unique
numerical value connecting to the data set.
Population, the number of people who know English and the number of people who have
Bachelor’s degree are discrete data because discrete data consists of counting numbers only
and cannot be measured.
Total income income in 2015 for the population aged 15 years and over in private households
and income taxes – average amount ($) is continuous data, which takes on numeric values
that can be broken down into smaller units. Besides, continuous data can be placed on a
measurement scale.
5. The variance and standard variation

-
- The skewness of 5 variables are higher than 1, which means the distribution is highly
skewed.
- Of all 5 variables, the graphs are negatively skewed as negative skew refers to a longer or
fatter tail on the left side of the distribution.

6. The range, interquartile range, mean absolute deviation, standard deviation using
the empirical rule and Chebyshev’s theorem
The data of 5 variables are not normally distributed, so we follow Chebyshev’s theorem.
7. Determine the probability of the 5 variables
We can estimate the probability that a random variable X is within k standard deviations of
the mean following Chebyshev’s Inequality. In this case, since k=2, the probability that X is
within 2 standard deviations from the mean is at least 0.75. This result illustrates that we
don't know the exact probability that X is within 2 standard deviations of the mean, but such
probability must be greater than 0.75.

8. Is the sample size appropriate for the data set? Explain why?
The population size of the data set is 25, if the confidence Interval is 0.5, the confidence level
is 95%, the result of sample size is 25, so the sample size is appropriate for the data set.

9. Calculate the z-scores for the variables

10. Describe the distribution of the sample’s mean for the 5 variables using the
central limit theorem
A Central Limit Theorem word issue will likely contain the phrase “assume
the variable is normally distributed”, so the sample’s mean for 5 variables using the central
limit theorem is assumed to be normally distributed.

11. The population mean for all variables is 25.


12. Determine the sample size needed to estimate the population mean for 5
variables by year.
The sample size needed should be at least 30.

You might also like