Professional Documents
Culture Documents
Statistics All Formulas by Pranav Popat
Statistics All Formulas by Pranav Popat
com
All formulas
Ca foundation statistics
(100 formulas)
Use this PDF with Formula Revision Marathon
Marathon Link: https://youtu.be/cZJHcOf6OUs
› Class Limits (CL) : CL is the minimum and maximum value the class interval may
contain
CL = UCL – LCL
UCL : Upper Class Limit
LCL : Lower Class Limit
Class Boundary (CB) : These are actual class limits of a class interval
Mid Point of Class Interval : average of LCL and UCL or average of LCB and UCB
LCL+UCL
2
LCB+UCB
2
Mid point is also called as Mid Value of a class or Class Mark
UCB - LCB
Frequency Density
Frequency of a class
Class length of that class
Relative Frequency
Frequency of class
Total Frequency of distribution
Percentage Frequency
Frequency of class
×100
Total Frequency of distribution
x1 + x2 + x3 + ... + xn x
x= x=
n n
fx
x=
N
Here,
𝑥 will be equal to mid point of class interval when frequency distribution is grouped.
N is number of observations which is also equal to total of all frequencies.
fd
x = A+ C
N
where d = x − A
C
A = assumed mean, C = class length
(x − x ) = 0
Combined AM
– If there are two groups containing 𝑛1 & 𝑛2 observations with AMs as 𝑥1 & 𝑥2
respectively then the combined AM is given by
n1 x1 + n2 x2
xc =
n1 + n2
– Step 2: Calculate N/2 and check between which class boundaries it falls and call it as
Median Class.
13
Nu − Nl
For a set of observations, the sum of absolute deviations is minimum, when the
deviations are taken from the median.
Quartiles
th
1
Q1 = (n + 1) term
4
th
2
Q2 = (n + 1) term
4
th
3
Q3 = (n + 1) term
4
Deciles
th
1
D1 = (n + 1) term
10
th
2
D2 = (n + 1) term
10
th
9
Q9 = (n + 1) term
10
Percentiles
th
1
P1 = (n + 1) term
100
th
2
P2 = (n + 1) term
100
th
99
P99 = (n + 1) term
100
N
– For first Quartile 𝑸𝟏
− Nl
› Find 𝑄1 class Q1 = l1 + 4 C
› Use Formula : Nu − Nl
3N
– For third Quartile 𝑸𝟑 − Nl
› Find 𝑄3 class Q3 = l1 + 4 C
Nu − Nl
18
› Use Formula :
– Apply formula
f 0 − f −1
Mo = l1 + C
2 f 0 − f −1 − f1
G = ( x1 x2 ... xn )
1/ n
𝑓1 𝑓2 𝑓3 𝑓𝑛 1/𝑁
𝐺 = (𝑥1 × 𝑥2 × 𝑥3 × ⋯ × 𝑥𝑛 )
𝑛
𝐻=
Σ(1ൗ𝑥)
𝑁
𝐻=
𝑓ൗ
Σ( 𝑥)
Combined HM
𝑛1 +𝑛2
𝑥ҧ𝑐 = 𝑛 𝑛2
1
+
𝐻1 𝐻2
L– S
L = Largest Observation
S = Smallest Observation
L– S
L = UCB of last class interval
S = LCB of first class interval
Coefficient of Range
L−S
100
L+S
1
𝑀𝐷𝐴 = Σ 𝑥 − 𝐴
𝑛
1
𝑀𝐷𝐴 = Σ𝑓 𝑥 − 𝐴
𝑁
Here A is mean or median or mode as given in question
(x − x ) 2
x = SDx =
n
2
x
−(x)
2
x = SDx =
n
f (x − x ) 2
x = SDx =
N
2
fx
−(x)
2
x = SDx =
N
Coefficient of Variation
SDx
100
x
|a −b|
SD =
2
n −1 2
s=
12
Combined SD
n s + n2 s2 + n d + n2 d 2
2 2 2 2
SDc = 1 1 1 1
n1 + n2
d1 = xc − x1
d 2 = xc − x2
Quartile Deviation
Q3 − Q1
QDx =
2
Q3 − Q1
100
Q3 + Q1
4 SD = 5MD = 6QD
SD : MD : QD = 15 :12 :10
q
p
Downloaded From www.castudynotes.com
51
stats Formula marathon
QUIZ NO. 1 CA Foundation Paper 3
Downloaded From www.castudynotes.com
Formula Marathon for Maths, LR and Stats by CAPRANAV
Addition Theorem 1
In case of two mutually exclusive events A and B
P A∪B = P A+B = P A or B = P A + P B
Addition Theorem 2
In case of two or more mutually exclusive events
𝑃 𝐴1 ∪ 𝐴2 ∪ 𝐴3 ∪ ⋯ = 𝑃 𝐴1 + 𝑃 𝐴2 + 𝑃 𝐴3 + …
Addition Theorem 3
For any two events
𝑃 𝐴 ∪ 𝐵 = 𝑃 𝐴 + 𝑃 𝐵 − 𝑃(𝐴 ∩ 𝐵)
Addition Theorem 4
In case of any three events
𝑷 𝑨∪𝑩∪𝑪
= 𝑃 𝐴 + 𝑃 𝐵 + 𝑃 𝐶 − 𝑃 𝐴∩𝐵 − 𝑃 𝐵∩𝐶
− 𝑃 𝐴 ∩ 𝐶 + 𝑃(𝐴 ∩ 𝐵 ∩ 𝐶)
𝐵 𝑃 𝐵∩𝐴
𝑃 =
𝐴 𝑃 𝐴
𝑝𝑟𝑜𝑣𝑖𝑑𝑒𝑑 𝑃(𝐴) ≠ 0
𝐴 𝑃 𝐴∩𝐵
𝑃 =
𝐵 𝑃 𝐵
𝑝𝑟𝑜𝑣𝑖𝑑𝑒𝑑 𝑃(𝐵) ≠ 0
Compound Theorem
In case of two dependent events
Compound Theorem
In case of two independent events
E ( x) = pi xi
E ( x) =
𝑉 𝑥 = 𝜎 2
𝐸 𝑥−𝜇 2 = 2
𝐸(𝑥) −𝜇 2
𝑛 𝑥 𝑛−𝑥
𝑓 𝑥 = 𝑃 𝑋 = 𝑥 = 𝐶𝑥 𝑝 𝑞
𝜇 = 𝑛𝑝
𝜎 2 = 𝑛𝑝𝑞
−𝑚 𝑥
(𝑒 . 𝑚 )
𝑓 𝑥 = 𝑃 𝑋=𝑥 =
𝑥!
𝜇=𝑚
𝜎 2 =𝑚
𝜎= 𝑚
1 −(
𝑥−𝜇 2 1
) ×
𝑓 𝑥 = × 𝑒 𝜎 2
𝜎 2𝜋
MD = 0.8 𝜎
𝑄1 = 𝜇 − 0.675𝜎
𝑄3 = 𝜇 + 0.675𝜎
𝑄𝐷 = 0.675𝜎
Points of Inflexion
𝜇−𝜎 & 𝜇+ 𝜎
Ratio between
QD : MD : SD = 10:12:15
Mean μ 0
Standard Deviation σ 1
Z Value
𝑥−𝜇
𝑍=
𝜎
From To Area/Probability
μ +σ 34.135%
+σ +2σ 13.59%
+2σ +3σ 2.14%
3σ ∞ 0.135%
𝐶𝑜𝑣 𝑥, 𝑦
𝑟𝑥𝑦 =
𝜎𝑥 . 𝜎𝑦
Covariance
𝛴 𝑥 − 𝑥ҧ 𝑦 − 𝑦ത 𝛴𝑥𝑦
𝐶𝑜𝑣 𝑥, 𝑦 = 𝑜𝑟 − 𝑥.ҧ 𝑦ത
𝑛 𝑛
6𝛴𝑑 2
𝑟𝑅 = 1 −
𝑛 𝑛2 −1
𝑡 3 −𝑡
𝛴
12
2𝑐 − 𝑚
𝑟𝑐 = ± ±
𝑚
Regression Coefficients
𝜎𝑦 𝐶𝑜𝑣(𝑥,𝑦)
𝑏𝑦𝑥 = 𝑟. or 𝑏𝑦𝑥 =
𝜎𝑥 𝜎𝑥 2
𝜎𝑥 𝐶𝑜𝑣(𝑥,𝑦)
𝑏𝑥𝑦 = 𝑟. or 𝑏𝑥𝑦 =
𝜎𝑦 𝜎𝑦 2
𝑟 = ± 𝑏𝑥𝑦 × 𝑏𝑦𝑥
𝑏𝑥𝑦 , 𝑏𝑦𝑥 𝑎𝑛𝑑 𝑟 𝑎𝑙𝑙 𝑤𝑖𝑙𝑙 ℎ𝑎𝑣𝑒 𝑠𝑎𝑚𝑒 𝑠𝑖𝑔𝑛
then,
change of scale of y
bvu = byx
change of scale of x
change of scale of x
buv = bxy
change of scale of y
( x, y )
Coefficient of determination
Explained Variance
Accounted Variance
2
𝑟
Coefficient of non-determination
Unexplained Variance
Unaccounted Variance
2
1−𝑟
1− r2
0.6745
N
– where, 𝒓 = correlation coefficient, 𝑵 = no. of pairs of observation
r PE
𝑃𝑛
Price Relative Price Relative =
𝑃0
Quantity 𝑄𝑛
Quantity Relative =
Relative 𝑄0
𝑉𝑛 𝑃𝑛 𝑄𝑛
Value Relative Value Relative = =
𝑉0 𝑃0 𝑄0
𝑃1 𝑃2 𝑃3 𝑃𝑛
Link relative
, , ,…,
𝑃0 𝑃1 𝑃2 𝑃𝑛−1
Same can be created for quantities also
When the above relatives are in respect to a fixed base period these are
also called the chain relatives
Chain relatives 𝑃1 𝑃2 𝑃3 𝑃𝑛
, , ,…,
93
𝑃0 𝑃0 𝑃0 𝑃0
› Under this method, we invert the actual price for each variable into percentage
of the base period. These percentages are called relatives.
› Formula
𝑃𝑛
Σ
𝑃0
N
› Laspeyres’ Index
Σ𝑃𝑛 𝑄0
× 100
Σ𝑃0 𝑄0
› Passche’s Index
Σ𝑃𝑛 𝑄𝑛
× 100
Σ𝑃0 𝑄𝑛
› Marshall-Edgeworth Index
Σ𝑃𝑛 (𝑄0 + 𝑄𝑛 )
× 100
Σ𝑃0 (𝑄0 + 𝑄𝑛 )
› Fisher’s Index
(GM of Laspeyres’ and Paasche’s)
Σ𝑃𝑛 𝑄0 Σ𝑃𝑛 𝑄𝑛
× × 100
Σ𝑃0 𝑄0 Σ𝑃0 𝑄𝑛
Current Value
Deflated Value =
Price Index of the current year