Professional Documents
Culture Documents
Basic Stats, Hypothesis and Normal Distribution
Basic Stats, Hypothesis and Normal Distribution
Quantitative variables
Eg. Number of childern in family
Score in maths
Sales in $ for a product
Height/weight of student
per capita income
Continuous Descrete
Infinite number of values Limited # of values
E.g sales in $, per capita income e.g # of childeren in family,
Weight/height of student etc. number of cars in city
Qualitative Variables/ classification variable/ categorical variable
Defined or limited number of levels
e.g Gender, Size of Tshirt, Winning position, colors
Nominal Ordinal
Can not order Order
E.g Gender, color e.g size of t shirt - S<M<XL<XXL
Winning position: 1>2nd runner up > 3rd runner up
Score data for a class
id Gender race ses schtyp prog read write math science socst
70 Male 4 1 1 1 57 52 41 47 57
121 Female 4 2 1 3 68 59 53 63 61
86 Male 4 3 1 1 44 33 54 58 31
141 Male 4 3 1 3 63 44 47 53 56
172 Male 4 2 1 2 47 52 57 53 61
113 Male 4 2 1 2 44 52 51 63 61
50 Male 3 2 1 1 50 59 42 53 61
11 Male 1 2 1 2 34 46 45 39 36
84 Male 4 2 1 1 63 57 54 58 51
48 Male 3 2 1 2 57 55 52 50 51
75 Male 4 2 1 3 60 46 51 53 61
60 Male 4 2 1 2 57 65 51 63 61
95 Male 4 3 1 2 73 60 71 61 71
104 Male 4 3 1 2 54 63 57 55 46
38 Male 3 1 1 2 45 57 50 31 56
115 Male 4 1 1 1 42 49 43 50 56
76 Male 4 3 1 2 47 52 51 50 56
195 Male 4 2 2 1 57 57 60 58 56
114 Male 4 3 1 2 68 65 62 55 61
85 Male 4 2 1 1 55 39 57 53 46
167 Male 4 2 1 1 63 49 35 66 41
143 Male 4 2 1 3 63 63 75 72 66
41 Male 3 2 1 2 50 40 45 55 56
20 Male 1 3 1 2 60 52 57 61 61
12 Male 1 2 1 3 37 44 45 39 46
53 Male 3 2 1 3 34 37 46 39 31
154 Male 4 3 1 2 65 65 66 61 66
178 Male 4 2 2 3 47 57 57 58 46
196 Male 4 3 2 2 44 38 49 39 46
29 Male 2 1 1 1 52 44 49 55 41
126 Male 4 2 1 1 42 31 57 47 51
103 Male 4 3 1 2 76 52 64 64 61
192 Male 4 3 2 2 65 67 63 66 71
150 Male 4 2 1 3 42 41 57 72 31
199 Male 4 3 2 2 52 59 50 61 61
144 Male 4 3 1 1 60 65 58 61 66
200 Male 4 2 2 2 68 54 75 66 66
80 Male 4 3 1 2 65 62 68 66 66
16 Male 1 1 1 3 47 31 44 36 36
153 Male 4 2 1 3 39 31 40 39 51
176 Male 4 2 2 2 47 47 41 42 51
177 Male 4 2 2 2 55 59 62 58 51
168 Male 4 2 1 2 52 54 57 55 51
40 Male 3 1 1 1 42 41 43 50 41
62 Male 4 3 1 1 65 65 48 63 66
169 Male 4 1 1 1 55 59 63 69 46
49 Male 3 3 1 3 50 40 39 49 47
136 Male 4 2 1 2 65 59 70 63 51
189 Male 4 2 2 2 47 59 63 53 46
7 Male 1 2 1 2 57 54 59 47 51
27 Male 2 2 1 2 53 61 61 57 56
128 Male 4 3 1 2 39 33 38 47 41
21 Male 1 2 1 1 44 44 61 50 46
183 Male 4 2 2 2 63 59 49 55 71
132 Male 4 2 1 2 73 62 73 69 66
15 Male 1 3 1 3 39 39 44 26 42
67 Male 4 1 1 3 37 37 42 33 32
22 Male 1 2 1 3 42 39 39 56 46
185 Male 4 2 2 2 63 57 55 58 41
9 Male 1 2 1 3 48 49 52 44 51
181 Male 4 2 2 2 50 46 45 58 61
170 Male 4 3 1 2 47 62 61 69 66
134 Male 4 1 1 1 44 44 39 34 46
108 Male 4 2 1 1 34 33 41 36 36
197 Male 4 3 2 2 50 42 50 36 61
140 Male 4 2 1 3 44 41 40 50 26
171 Male 4 2 1 2 60 54 60 55 66
107 Male 4 1 1 3 47 39 47 42 26
81 Male 4 1 1 2 63 43 59 65 44
18 Male 1 2 1 3 50 33 49 44 36
155 Male 4 2 1 1 44 44 46 39 51
97 Male 4 3 1 2 60 54 58 58 61
68 Male 4 2 1 2 73 67 71 63 66
157 Male 4 2 1 1 68 59 58 74 66
56 Male 4 2 1 3 55 45 46 58 51
5 Male 1 1 1 2 47 40 43 45 31
159 Male 4 3 1 2 55 61 54 49 61
123 Male 4 3 1 1 68 59 56 63 66
164 Male 4 2 1 3 31 36 46 39 46
14 Male 1 3 1 2 47 41 54 42 56
127 Male 4 3 1 2 63 59 57 55 56
165 Male 4 1 1 3 36 49 54 61 36
174 Male 4 2 2 2 68 59 71 66 56
3 Male 1 1 1 2 63 65 48 63 56
58 Male 4 2 1 3 55 41 40 44 41
146 Male 4 3 1 2 55 62 64 63 66
102 Male 4 3 1 2 52 41 51 53 56
117 Male 4 3 1 3 34 49 39 42 56
133 Male 4 2 1 3 50 31 40 34 31
94 Male 4 3 1 2 55 49 61 61 56
24 Male 2 2 1 2 52 62 66 47 46
149 Male 4 1 1 1 63 49 49 66 46
82 Female 4 3 1 2 68 62 65 69 61
8 Female 1 1 1 2 39 44 52 44 48
129 Female 4 1 1 1 44 44 46 47 51
173 Female 4 1 1 1 50 62 61 63 51
57 Female 4 2 1 2 71 65 72 66 56
100 Female 4 3 1 2 63 65 71 69 71
1 Female 1 1 1 3 34 44 40 39 41
194 Female 4 3 2 2 63 63 69 61 61
88 Female 4 3 1 2 68 60 64 69 66
99 Female 4 3 1 1 47 59 56 66 61
47 Female 3 1 1 2 47 46 49 33 41
120 Female 4 3 1 2 63 52 54 50 51
166 Female 4 2 1 2 52 59 53 61 51
65 Female 4 2 1 2 55 54 66 42 56
101 Female 4 3 1 2 60 62 67 50 56
89 Female 4 1 1 3 35 35 40 51 33
54 Female 3 1 2 1 47 54 46 50 56
180 Female 4 3 2 2 71 65 69 58 71
162 Female 4 2 1 3 57 52 40 61 56
4 Female 1 1 1 2 44 50 41 39 51
131 Female 4 3 1 2 65 59 57 46 66
125 Female 4 1 1 2 68 65 58 59 56
34 Female 1 3 2 2 73 61 57 55 66
106 Female 4 2 1 3 36 44 37 42 41
130 Female 4 3 1 1 43 54 55 55 46
93 Female 4 3 1 2 73 67 62 58 66
163 Female 4 1 1 2 52 57 64 58 56
37 Female 3 1 1 3 41 47 40 39 51
35 Female 1 1 2 1 60 54 50 50 51
87 Female 4 2 1 1 50 52 46 50 56
73 Female 4 2 1 2 50 52 53 39 56
151 Female 4 2 1 3 47 46 52 48 46
44 Female 3 1 1 3 47 62 45 34 46
152 Female 4 3 1 2 55 57 56 58 61
105 Female 4 2 1 2 50 41 45 44 56
28 Female 2 2 1 1 39 53 54 50 41
91 Female 4 3 1 3 50 49 56 47 46
45 Female 3 1 1 3 34 35 41 29 26
116 Female 4 2 1 2 57 59 54 50 56
33 Female 2 1 1 2 57 65 72 54 56
66 Female 4 2 1 3 68 62 56 50 51
72 Female 4 2 1 3 42 54 47 47 46
77 Female 4 1 1 2 61 59 49 44 66
61 Female 4 3 1 2 76 63 60 67 66
190 Female 4 2 2 2 47 59 54 58 46
42 Female 3 2 1 3 46 52 55 44 56
2 Female 1 2 1 3 39 41 33 42 41
55 Female 3 2 2 2 52 49 49 44 61
19 Female 1 1 1 1 28 46 43 44 51
90 Female 4 3 1 2 42 54 50 50 52
142 Female 4 2 1 3 47 42 52 39 51
17 Female 1 2 1 2 47 57 48 44 41
122 Female 4 2 1 2 52 59 58 53 66
191 Female 4 3 2 2 47 52 43 48 61
83 Female 4 2 1 3 50 62 41 55 31
182 Female 4 2 2 2 44 52 43 44 51
6 Female 1 1 1 2 47 41 46 40 41
46 Female 3 1 1 2 45 55 44 34 41
43 Female 3 1 1 2 47 37 43 42 46
96 Female 4 3 1 2 65 54 61 58 56
138 Female 4 2 1 3 43 57 40 50 51
10 Female 1 2 1 1 47 54 49 53 61
71 Female 4 2 1 1 57 62 56 58 66
139 Female 4 2 1 2 68 59 61 55 71
110 Female 4 2 1 3 52 55 50 54 61
148 Female 4 2 1 3 42 57 51 47 61
109 Female 4 2 1 1 42 39 42 42 41
39 Female 3 3 1 2 66 67 67 61 66
147 Female 4 1 1 2 47 62 53 53 61
74 Female 4 2 1 2 57 50 50 51 58
198 Female 4 3 2 2 47 61 51 63 31
161 Female 4 1 1 2 57 62 72 61 61
112 Female 4 2 1 2 52 59 48 55 61
69 Female 4 1 1 3 44 44 40 40 31
156 Female 4 2 1 2 50 59 53 61 61
111 Female 4 1 1 1 39 54 39 47 36
186 Female 4 2 2 2 57 62 63 55 41
98 Female 4 1 1 3 57 60 51 53 37
119 Female 4 1 1 1 42 57 45 50 43
13 Female 1 2 1 3 47 46 39 47 61
51 Female 3 3 1 1 42 36 42 31 39
26 Female 2 3 1 2 60 59 62 61 51
36 Female 3 1 1 1 44 49 44 35 51
135 Female 4 1 1 2 63 60 65 54 66
59 Female 4 2 1 2 65 67 63 55 71
78 Female 4 2 1 2 39 54 54 53 41
64 Female 4 3 1 3 50 52 45 58 36
63 Female 4 1 1 1 52 65 60 56 51
79 Female 4 2 1 2 60 62 49 50 51
193 Female 4 2 2 2 44 49 48 39 51
92 Female 4 3 1 1 52 67 57 63 61
160 Female 4 2 1 2 55 65 55 50 61
32 Female 2 3 1 3 50 67 66 66 56
23 Female 2 1 1 2 65 65 64 58 71
158 Female 4 2 1 1 52 54 55 53 51
25 Female 2 2 1 1 47 44 42 42 36
188 Female 4 3 2 2 63 62 56 55 61
52 Female 3 1 1 2 50 46 53 53 66
124 Female 4 1 1 3 42 54 41 42 41
175 Female 4 3 2 1 36 57 42 50 41
184 Female 4 2 2 3 50 52 53 55 56
30 Female 2 3 1 2 41 59 42 34 51
179 Female 4 2 2 2 47 65 60 50 56
31 Female 2 2 2 1 55 59 52 42 56
145 Female 4 2 1 3 42 46 38 36 46
187 Female 4 2 2 1 57 41 57 55 52
118 Female 4 2 1 1 55 62 58 58 61
137 Female 4 3 1 2 63 65 65 53 61
Average
Variable Variable
types
Qualitative
variable Middle Value
Gender
Continuous
variable Score of read, write, Value with maximum
math, science replication
Average of deviation
Read Variable : Measures of of each observation
Central Tendancy Dispersion from mean of data.
5 34
6 34
7 34
8 34
Percentiles/ Value Value of Obs number
9 35 Quartiles
10 36 Q1(25%) 44 =200*0.25=50th
11 36 Q2/Median (50%) 50 =200*0.5=100th
12 36 Q3 (75%) 60 =200*0.75=150th
13 37
14 37
15 39
16 39
17 39
18 39
19 39
20 39
21 39
22 39
23 41
24 41
25 42
26 42
27 42
28 42
29 42
30 42
31 42
32 42
33 42
34 42
35 42
36 42
37 42
38 43
39 43
40 44
41 44
42 44
43 44
44 44
45 44
46 44
47 44
48 44
49 44 Q1 (25% quartile)
50 44
51 44
52 44
53 45
54 45
55 46
56 47
57 47
58 47
59 47
60 47
61 47
62 47
63 47
64 47
65 47
66 47
67 47
68 47
69 47
70 47
71 47
72 47
73 47
74 47
75 47
76 47
77 47
78 47
79 47
80 47
81 47
82 47
83 48
84 50
85 50
86 50
87 50
88 50
89 50
90 50
91 50
92 50
93 50
94 50
95 50
96 50
97 50
98 50
Median or Q2 (50%
99 50 quartile)
100 50
101 50
102 52
103 52
104 52
105 52
106 52
107 52
108 52
109 52
110 52
111 52
112 52
113 52
114 52
115 52
116 53
117 54
118 55
119 55
120 55
121 55
122 55
123 55
124 55
125 55
126 55
127 55
128 55
129 55
130 55
131 57
132 57
133 57
134 57
135 57
136 57
137 57
138 57
139 57
140 57
141 57
142 57
143 57
144 57
145 60
146 60
147 60
150 60
151 60
152 60
153 60
154 61
155 63
156 63
157 63
158 63
159 63
160 63
161 63
162 63
163 63
164 63
165 63
166 63
167 63
168 63
169 63
170 63
171 65
172 65
173 65
174 65
175 65
176 65
177 65
178 65
179 65
180 66
181 68
182 68
183 68
184 68
185 68
186 68
187 68
188 68
189 68
190 68
191 68
192 71
193 71
194 73
195 73
196 73
197 73
198 73
199 76
200 76
Score data for a class
id Gender race ses schtyp prog read write
70 Male 4 1 1 1 57 52
Variable Variable
types
Qualitative
variable
Gender
Average of deviation of each
Continuous Average observation from mean of data.
variable Score of read, write,
math, science
For eg the first observation of read
Middle Value has value of 57 and mean of read
score is 52.23. So the difference 57-
Read Variable : Measures of 52.23 = ~5. In STD all deviations
Central Tendancy Dispersion are taken from each data point w.r.t
mean and the average of such
Mean Median Mode STD deviations is called STD
Read Scores 52.23 50 47 10.25
1 28
2 31
3 34
4 34
5 34
6 34
7 34
8 34
Percentiles/ Value Value of Obs number
9 35 Quartiles
10 36 Q1(25%) 44 =200*0.25=50th
11 36 Q2/Median (50%) 50 =200*0.5=100th
12 36 Q3 (75%) 60 =200*0.75=150th
13 37
48 44
49 44 Q1 (25% quartile)
50 44
51 44
99 50
Median or Q2 (50%
100 50 quartile)
101 50
102 52
148 60
Q3 (75% quartile)
149 60
150 60
151 60
math science socst
41 47 57
n of each
mean of data.
ervation of read
d mean of read
he difference 57-
D all deviations
h data point w.r.t
age of such
d STD
C13: Average
D13: Middle Value
E13: Value with maximum replication
F13: Average of deviation of each observation from mean of data.
For eg the first observation of read has value of 57 and mean of read score is 52.23. So the difference
57-52.23 = ~5. In STD all deviations are taken from each data point w.r.t mean and the average of such
deviations is called STD
Normal distribution or a bell curve
Confidence Interval, which
represents the range of
data distribution. Used in Mean =
hypothesis testing, if a Median =
value lies in this range the Mode
value belongs to this
distribution
- H0 and H1
hypothesis
- Critical region
- Significance
level (5%)
- Type I & II err
- one tail and 2
tail test
Rejection area. H0 is
rejected for values at
this level i.e between
-2 to -3 Stdev. Here
Frequency = 0
Frequency
N15: Rejection area. H0 is rejected for values at this level i.e between
-2 to -3 Stdev. Here Frequency = 0
P14: Go to data tab -> Data analysis -> Histogram and select data range and bin range to populate
frequenc`y
id Gender race ses schtyp prog read write math science
121 Female 4 2 1 3 68 59 53 63
82 Female 4 3 1 2 68 62 65 69
8 Female 1 1 1 2 39 44 52 44
129 Female 4 1 1 1 44 44 46 47
173 Female 4 1 1 1 50 62 61 63
57 Female 4 2 1 2 71 65 72 66
100 Female 4 3 1 2 63 65 71 69
1 Female 1 1 1 3 34 44 40 39
194 Female 4 3 2 2 63 63 69 61
88 Female 4 3 1 2 68 60 64 69
99 Female 4 3 1 1 47 59 56 66
Covariance and
corrected Sum of
Cov = squares calculation
socst (x-x{mean})*
(Y-Y{mean}) between Maths (x) and
Science(y)
61 3.95825 65.163
61 211.88825 1845.183
48 5.06325 -10.707
51 32.22825 -43.407
51 93.15825 -121.017
56 273.87325 998.593
71 314.78825 5839.823
41 162.48825 -1848.567
61 149.64825 1264.023
66 194.73825 2676.843
61 47.47325 441.003
Correctes SS 11642.35
Covariance 58.50
L1: Covariance and corrected Sum of squares calculation between Maths (x) and Science(y)