Professional Documents
Culture Documents
2. A summary table
A summary table of 5 variables chosen: Population, the number of people who know English,
the total income statistics in 2015 for the population aged 15 years and over in private
households, the number of people who have Bachelor’s degree, the income taxes: average
amount ($)
-
- For the category of population, city of Toronto has the highest number, Alderwood has the
lowest number. For the category of the number of people who know English, city of Toronto
has the highest number, Alderwood has the lowest number. For the category of Total-income
(in 2015) for the population aged 15 years and over in private households, city of Toronto has
the highest number, Alderwood has the lowest number. For the category of Bachelor’s
degree, city of Toronto has the highest number, Alderwood has the lowest number. For the
category of income taxes – average amount ($), Annex has the highest number, Agincourt
North has the lowest number.
- City of Toronto has the highest number for the 5 categories and Alderwood has the lowest
number of the 5 categories.
Agincour
t South-
City of Agincourt Malvern
Characteristic Toronto North West Alderwood Annex Mean Median Mode
Population,
2016 2,731,571 29,113 23,757 12,054 30,526 565,404 29,113 #N/A
Total - Income
statistics 2,294,785 25,005 20,400 10,265 26,295 475,350 25,005 #N/A
Bachelor's
degree 433620 3090 3270 1415 6855 89650 3270 #N/A
Income taxes:
Average
amount ($) 17,197 6,726 7,185 11,626 45,973 17,741 11,626 #N/A
-
- The skewness of 5 variables are higher than 1, which means the distribution is highly
skewed.
- Of all 5 variables, the graphs are negatively skewed as negative skew refers to a longer or
fatter tail on the left side of the distribution.
6. The range, interquartile range, mean absolute deviation, standard deviation using
the empirical rule and Chebyshev’s theorem
The data of 5 variables are not normally distributed, so we follow Chebyshev’s theorem.
7. Determine the probability of the 5 variables
We can estimate the probability that a random variable X is within k standard deviations of
the mean following Chebyshev’s Inequality. In this case, since k=2, the probability that X is
within 2 standard deviations from the mean is at least 0.75. This result illustrates that we
don't know the exact probability that X is within 2 standard deviations of the mean, but such
probability must be greater than 0.75.
8. Is the sample size appropriate for the data set? Explain why?
The population size of the data set is 25, if the confidence Interval is 0.5, the confidence level
is 95%, the result of sample size is 25, so the sample size is appropriate for the data set.
10. Describe the distribution of the sample’s mean for the 5 variables using the
central limit theorem
A Central Limit Theorem word issue will likely contain the phrase “assume
the variable is normally distributed”, so the sample’s mean for 5 variables using the central
limit theorem is assumed to be normally distributed.