Professional Documents
Culture Documents
▪ What is inferential
Statistics?
▪ Making inferences
about populations
based on samples
Descriptive Vs Inferential Statistics
It is given that μ = 4 minutes. To do any calculations, you must know λ, the decay parameter. λ = 1/μ .
Therefore, λ = 1/4 = 0.25
For example, f(5) = 0.072. The postal clerk spends five minutes with the customers.
Exponential Distribution
import numpy as np
import seaborn as sns
scale = 1 / 3.3
draws = np.random.exponential(scale, size = 1_000_000)
sns.kdeplot(draws, shade=True, color='xkcd:lightish blue')
Normal(Gaussian) Distribution
• For what data, Normal
Distribution fits
– When probability of
occurrence of extreme value
from mean is low
• Example data where Normal
distribution fits
– Body temperature
– People's height
– Car mileage
– IQ scores
– Error distribution of
observed values of sensors
• Why to fit distribution
– To infer the occurrence of
events
Sample Distribution
• What is Conjecture?
– Any statement which is either true or false
• What is Statistical Hypothesis?
– Conjecture that can be tested experiments / observations
• Eg:
– Given drug X, and disease d, X is effective in treating d
– Avg monthly salary of an Indian is 10k
– Avg monthly salary of an Indian and a Chinese is the
same
– Performance of Algorithms A and B are statistically the
same
Statistical Hypothesis Testing
Hypothesis Testing using Z-Test to test
population mean
Suppose later that further testing shows that the machine was
working properly, what type of error did the employee make
(Type 1 or Type 2)?
P(x<2.31) =0.9896
Z-Table
Steps of Z-test for left tail to
population mean
Note:
Step 1: Formulate H0 and H1 The objective is to reject
H0: PM=50 (PM denotes Population mean) null hypothesis when
H1: PM <50 population mean is
significantly less than50
Step 2: Select Significance Level
alpha = 5%
• The major difference between the Z-Test and T-Test is that the population
standard deviation/variance would be given for Z-Test where as the sample
standard deviation/variance would be given for T-Test. If the sample
standard deviation/variance is not given in the question while calculating T
statistics, you need to measure the same (standard deviation) from the
given data.