Professional Documents
Culture Documents
Reference Excel Project File
Reference Excel Project File
ntitative Techniques
& Semester- 1
Veerendra Anchan
INTRODUCTION
For this project we got the opportunity to apply and understand the conce
and theories of statistics taught to us in class in an effective manner. We ha
created a questionnaire and collected the responses from numerous peop
primarily in order to apply the concepts. Our faculty Mr. Veerendra Ancha
gave us the opportunity to statistically analyse the airlines industry in pre a
post covid times. We have been successful in doing an interpretation of th
relationship between the pre and post covid situation to the best of our
abilities with the help of the MS-Excel. We have used quantitative tools an
techniques such as Graphs ,Measures Of Central Tendency, Measure Of
Dispersion ,Descriptive Statistics, Correlation, ANOVA test, Chi sqaured te
Mann Whitney U test and Regression to analyse the inter-relation between
dependent and independent variables. We Group no.5 have decided to tak
survey on impact of covid on airlines industry as it suffered a major loss du
the worldwide lockdown and it has surprisingly recovered rapidly from it
downfall.
nderstand the concepts
ctive manner. We have
om numerous people,
Mr. Veerendra Anchan
nes industry in pre and
n interpretation of the
on to the best of our
quantitative tools and
ndency, Measure Of
test, Chi sqaured test,
er-relation between the
have decided to take a
ered a major loss due to
vered rapidly from its
How much do you usually
Name Gender Age spend while travelling
domestically.
3-6 3-6
6-9 3-6
3-6 0-3
6-9 0-3
0-3 0-3
9-12 0-3
9-12 9-12
3-6 0-3
0-3 0-3
3-6 0-3
9-12 0-3
0-3 0-3
9-12 0-3
0-3 0-3
9-12 9-12
3-6 0-3
0-3 0-3
3-6 0-3
6-9 3-6
0-3 0-3
0-3 0-3
0-3 0-3
3-6 0-3
0-3 0-3
3-6 0-3
3-6 0-3
3-6 0-3
3-6 0-3
0-3 0-3
3-6 0-3
0-3 0-3
3-6 0-3
0-3 0-3
3-6 0-3
6-9 3-6
0-3 0-3
6-9 0-3
0-3 0-3
0-3 0-3
0-3 0-3
3-6 0-3
0-3 0-3
9-12 3-6
3-6 0-3
6-9 3-6
3-6 0-3
0-3 0-3
3-6 0-3
9-12 0-3
0-3 0-3
0-3 0-3
0-3 0-3
0-3 3-6
0-3 0-3
9-12 3-6
0-3 0-3
0-3 0-3
0-3 0-3
9-12 6-9
9-12 3-6
3-6 0-3
3-6 0-3
3-6 0-3
0-3 0-3
0-3 0-3
3-6 0-3
3-6 0-3
3-6 3-6
6-9 3-6
3-6 0-3
3-6 0-3
6-9 6-9
3-6 0-3
0-3 0-3
3-6 0-3
3-6 3-6
3-6 0-3
0-3 0-3
3-6 6-9
3-6 0-3
0-3 0-3
0-3 0-3
3-6 0-3
3-6 3-6
0-3 0-3
3-6 0-3
3-6 0-3
3-6 3-6
3-6 3-6
0-3 0-3
3-6 0-3
0-3 0-3
0-3 0-3
0-3 0-3
0-3 0-3
0-3 0-3
0-3 0-3
6-9 0-3
0-3 0-3
0-3 0-3
3-6 0-3
9-12 3-6
3-6 0-3
0-3 0-3
0-3 0-3
3-6 0-3
0-3 0-3
0-3 0-3
0-3 0-3
3-6 0-3
3-6 0-3
0-3 0-3
9-12 0-3
0-3 0-3
0-3 0-3
0-3 0-3
6-9 0-3
3-6 0-3
0-3 0-3
3-6 0-3
3-6 0-3
9-12 6-9
6-9 0-3
9-12 6-9
0-3 0-3
9-12 3-6
3-6 0-3
3-6 0-3
9-12 9-12
3-6 0-3
3-6 0-3
0-3 0-3
3-6 0-3
3-6 0-3
6-9 0-3
9-12 0-3
6-9 3-6
9-12 0-3
3-6 0-3
6-9 0-3
9-12 0-3
3-6 0-3
0-3 0-3
3-6 0-3
3-6 0-3
0-3 0-3
3-6 0-3
0-3 3-6
6-9 0-3
3-6 0-3
9-12 3-6
3-6 0-3
3-6 0-3
0-3 0-3
3-6 0-3
6-9 0-3
3-6 0-3
3-6 0-3
0-3 0-3
How often do you plan to travel post
Covid per year?
6-9
6-9
3-6
0-3
6-9
6-9
9-12
3-6
0-3 How often p
3-6 140
9-12
3-6
9-12 120
0-3
9-12
100
0-3
3-6
3-6 80
Frequency
3-6
0-3
3-6 59
60
3-6
3-6
0-3 40
3-6
3-6
3-6 20
3-6
0-3
0
3-6 0-3
6-9
3-6
0-3
3-6
6-9
0-3
6-9
0-3
3-6
0-3
3-6
0-3
9-12
3-6 How often
6-9
3-6 140
0-3
120
100
How often
140
0-3
120
9-12
3-6
0-3 100
0-3
3-6
Frequency
0-3 80
9-12
0-3 60
0-3 52
0-3
9-12 40
9-12
3-6
3-6 20
3-6
3-6 0
0-3 0-3
3-6
3-6
6-9
9-12
6-9
0-3
9-12
3-6
0-3
0-3
3-6
6-9
0-3
6-9
3-6
3-6
0-3
3-6
9-12
3-6
9-12
3-6
6-9
6-9
0-3
0-3
0-3
0-3
0-3
0-3
0-3
3-6
6-9
3-6
3-6
6-9
9-12
3-6
0-3
0-3
3-6
3-6
3-6
3-6
0-3
3-6
0-3
9-12
0-3
0-3
0-3
6-9
3-6
0-3
0-3
3-6
9-12
0-3
9-12
0-3
9-12
3-6
3-6
9-12
3-6
3-6
0-3
0-3
3-6
6-9
9-12
6-9
6-9
3-6
9-12
9-12
0-3
0-3
3-6
0-3
0-3
3-6
0-3
3-6
3-6
9-12
3-6
3-6
3-6
3-6
3-6
0-3
3-6
3-6
How often did you travel by air before covid in a year? Frequency
0-3 59
3-6 64
6-9 16
9-12 20
120
100
80
Frequency
64
59
60
40
20
20 16
0
0-3 3-6 6-9 9-12
How often do you plan to travel post cost covid per year? Frequency
0-3 52
3-6 64
6-9 20
9-12 23
120
100
How often people plan to travel post covid per year
140
120
100
Frequency
80
64
60
52
40
23
20
20
0
0-3 3-6 6-9 9-12
140
131
120
100
Frequency
80
60
40
20
20
20 5
0
0-3 3-6 6-9
9-12
How often did you travel by air before covid in the year 2020?
0-3
3-6
6-9
9-12
140
131
120
100
Frequency
80
60
40
20
20
5 3
0
0-3 3-6 6-9 9-12
he year 2020
5 3
6-9 9-12
How satisfied were you
with the services provided
Bins
by the airlines before
covid?
4 2 Bins Frequency
4 3 2 2
4 4 3 24
3 5 4 105
5 5 28
4
1
4
4
4
5
4
4
3
5
4
4
3
4
4
4 Bins Frequency
3 2 2
3 3 24
3 4 105
4 5 28
3
3
5
4
4
5
5
4
4
4
4
3
5
4
4
4
4
4
4
4
4
3
4
4
4
3
3
4
4
4
4
4
4
4
4
4
4
4
4
4
4
5
3
4
4
4
4
4
4
5
5
4
4
4
4
4
4
4
4
3
4
5
4
4
5
4
4
4
4
4
5
4
4
4
4
3
4
4
5
5
4
5
4
3
5
5
3
4
4
4
4
3
3
3
4
4
4
5
4
3
4
3
5
4
5
4
4
4
4
5
4
4
2
5
4
4
4
4
4
4
5
4
4
4
5
4
3
5
4
4
5
5
4
3
Less than type Ogive
120.00%
100.00%
Cumulative %
1.26% 80.00%
16.35%
Frequency
82.39% 60.00%
100.00%
40.00%
20.00%
0.00%
2 3 4 5
Cumulative %
80.00%
Frequency
60.00%
40.00%
Frequency
60.00%
40.00%
20.00%
0.00%
2 3 4 5
Bins
Column G
How satisfied were you
with the services provided
Bins
by the airlines during
covid?
3 2 Bins
4 3 2
3 4 3
3 5 4
4 5
4
1
4
3
3
5
3
4
3
1
4
4
3
3
3
3
3
3 Bins
3 2
4 3
3 4
3 5
5
3
4
3
3
3
3
4
3
4
4
3
3
5
3
4
3
4
4
4
3
4
4
3
3
3
3
4
4
4
3
4
3
3
4
3
3
3
3
4
3
3
4
4
4
4
3
2
3
3
3
4
3
3
3
4
4
3
3
3
4
4
4
3
4
3
3
3
3
4
3
3
4
4
4
4
2
5
2
5
3
3
5
3
3
4
3
3
4
3
4
3
2
4
4
1
3
4
3
2
5
5
5
4
4
1
5
4
3
4
3
5
2
4
3
3
4
2
5
3
4
4
5
3
3
3
5
2
3
4
4
3
Less than type Ogive
120.00%
Frequency
14 100.00% 60.00%
40.00%
20.00%
0.00%
2 3 4 5
Bins
Cumulative %
60.00%
40.00%
20.00%
0.00%
2 3 4 5
Frequen
60.00%
40.00%
20.00%
0.00%
2 3 4 5
Bins
Column Z Column Z
s
5
5
5
How satisfied were you with
the services provided by the How satisfied were you with the services provided by the airlines before covid?
airlines before covid?
4
4 Mean 3.99371069182
4 Standard Error 0.05086377165
3 Median 4
5 Mode 4
4 Standard Deviati 0.64136775669
1 Sample Variance 0.41135259932
4 Kurtosis 2.70135192787
4 Skewness -0.72335468608
4 Range 4
5 Minimum 1
4 Maximum 5
4 Sum 635
3 Count 159
5
4
4
3
4
4
4
3
3
3
4
3
3
5
4
4
5
5
4
4
4
4
3
5
4
4
4
4
4
4
4
4
3
4
4
4
3
3
4
4
4
4
4
4
4
4
4
4
4
4
4
4
5
3
4
4
4
4
4
4
5
5
4
4
4
4
4
4
4
4
3
4
5
4
4
5
4
4
4
4
4
5
4
4
4
4
3
4
4
5
5
4
5
4
3
5
5
3
4
4
4
4
3
3
3
4
4
4
5
4
3
4
3
5
4
5
4
4
4
4
5
4
4
2
5
4
4
4
4
4
4
5
4
4
4
5
4
3
5
4
4
5
5
4
3
How satisfied were you with the
he airlines before covid? services provided by the airlines How satisfied were you with the services provided by the a
during covid?
3
4 Mean 3.42767296
3 Standard Error 0.06523772
3 Median 3
4 Mode 3
4 Standard Deviati 0.82261638
1 Sample Variance 0.67669772
4 Kurtosis 0.765642
3 Skewness -0.2488947
3 Range 4
5 Minimum 1
3 Maximum 5
4 Sum 545
3 Count 159
1
4
4
3
3
3
3
3
3
3
4
3
3
5
3
4
3
3
3
3
4
3
4
4
3
3
5
3
4
3
4
4
4
3
4
4
3
3
3
3
4
4
4
3
4
3
3
4
3
3
3
3
4
3
3
4
4
4
4
3
2
3
3
3
4
3
3
3
4
4
3
3
3
4
4
4
3
4
3
3
3
3
4
3
3
4
4
4
4
2
5
2
5
3
3
5
3
3
4
3
3
4
3
4
3
2
4
4
1
3
4
3
2
5
5
5
4
4
1
5
4
3
4
3
5
2
4
3
3
4
2
5
3
4
4
5
3
3
3
5
2
3
4
4
3
s provided by the airlines during covid?
How much do you usually
Age(X) spend while travelling
domestically.(Y) Karl Pearson Correlation Coefficien
18 10000 Karl Pearson’s coefficient of correlation is defined as a linear correlation coefficie
18 5000 the value range of -1 to +1. Value of -1 signifies strong negative correlation while +1 in
17 10000
57 10000 Standard Deviation(X)= 13.62
24 10000 Standard Deviation(Y)= 5217.40
28 10000
36 15000 Covariance(X,Y)= -561.21
44 10000
40 5000 Karl Pearson's Correlation Coefficient= -0.0080
18 10000
15 15000 There is a very week negative relation between Age and Expenditure i.e -0.8%.
21 10000 So there is hardly any impact on expenditure due to change in age.
31 20000
33 10000 Scatter Diagram showing relationship between Age and Expenditure
23 5000
60 10000
17 20000
24 10000
15 20000
24 5000
43 20000
45 15000
51 10000
48 5000
26 10000
38 20000
25 20000
34 10000
24 20000
33 10000
29 10000
59 20000
43 10000
47 10000
23 15000
50 20000
44 10000
45 20000
24 10000
16 10000
37 10000
59 20000
22 20000
32 10000
40 10000
44 20000
32 15000
41 5000
38 5000
47 10000
35 15000
42 10000
48 15000
20 10000
51 5000
53 10000
42 10000
47 10000
59 10000
47 20000
17 10000
15 5000,
15 15000
39 10000
17 20000
60 5000
25 15000
24 10000
27 10000
21 10000
43 15000
47 15000
29 10000
58 10000
55 10000
38 5000
34 5000
46 10000
23 5000
54 10000
15 5000
43 20000
15 10000
55 10000
41 5000
23 10000
60 15000
50 10000
37 10000
41 10000
26 10000
58 10000
33 5000
54 5000
54 5000
23 10000
37 5000
47 10000
30 10000
32 20000
48 5000
25 15000
33 5000
16 10000
46 5000
52 10000
57 10000
47 5000
54 10000
39 15000
43 15000
50 20000
42 10000
41 5000
29 5000
16 10000
17 15000
25 15000
34 15000
27 20000
28 15000
58 20000
15 20000
58 20000
15 15000
16 20000
18 20000
36 20000
29 10000
57 20000
59 15000
26 10000
26 20000
40 10000
16 20000
53 20000
21 20000
32 20000
33 5000
50 20000
33 20000
40 15000
55 20000
26 20000
31 10000
25 15000
47 15000
19 10000
59 20000
21 15000
43 20000
53 15000
34 20000
36 10000
47 20000
30 10000
19 5000
52 10000
36 20000
18 10000
18 5000
17 10000
57 10000
24 10000
28 10000
36 15000
44 10000
40 5000
18 10000
15 15000
21 10000
31 20000
33 10000
23 5000
60 10000
17 20000
24 10000
15 20000
24 5000
43 20000
45 15000
51 10000
48 5000
26 10000
38 20000
25 20000
34 10000
24 20000
33 10000
29 10000
59 20000
43 10000
47 10000
23 15000
50 20000
44 10000
45 20000
24 10000
16 10000
37 10000
59 20000
22 20000
32 10000
40 10000
44 20000
32 15000
41 5000
38 5000
47 10000
35 15000
42 10000
48 15000
20 10000
51 5000
53 10000
42 10000
47 10000
59 10000
47 20000
17 10000
15 5000,
15 15000
39 10000
17 20000
60 5000
25 15000
24 10000
27 10000
21 10000
43 15000
47 15000
29 10000
58 10000
55 10000
38 5000
34 5000
46 10000
23 5000
54 10000
15 5000
43 20000
15 10000
55 10000
41 5000
23 10000
60 15000
50 10000
37 10000
41 10000
26 10000
58 10000
33 5000
54 5000
54 5000
23 10000
37 5000
47 10000
30 10000
32 20000
48 5000
25 15000
33 5000
16 10000
46 5000
52 10000
57 10000
47 5000
54 10000
39 15000
43 15000
50 20000
42 10000
41 5000
29 5000
16 10000
17 15000
25 15000
34 15000
27 20000
28 15000
58 20000
15 20000
58 20000
15 15000
16 20000
18 20000
36 20000
29 10000
57 20000
59 15000
26 10000
26 20000
40 10000
16 20000
53 20000
21 20000
32 20000
33 5000
50 20000
33 20000
40 15000
55 20000
26 20000
31 10000
25 15000
47 15000
19 10000
59 20000
21 15000
43 20000
53 15000
34 20000
36 10000
47 20000
30 10000
19 5000
52 10000
36 20000
orrelation Coefficient
as a linear correlation coefficient that falls in correlation.
ong negative correlation while +1 indicates strong positive
ANOVA
Source of Variation SS df MS
Between Groups 50.53785 1 50.537854889589
Within Groups 343.6404 632 0.54373477618496
F P-value F crit
92.9457836855 1.29705E-20 3.85621452
Check:
p-Value 5.31%
Test Statistic 16.732
Step 4 Determine the average rank and sum of rank per group
Group Avg.rank
Male 165.7060302
Female 149.1218487
Step 11 Determine Z
Z -1.856301322 =M32/L45
The decision is to fail to reject the null hypothesis because the p value (6.34%)
greater than 5%
Therefore, we can assume that the gender of a person and how satisfied they
were with the services provided by the airlines before covid.
ann Whitney U Test
=COUNTIF(B:B,I4)
=COUNTIF(B:B,I5)
=SUMIF(M4:M5)
=M4*(M4+1)/2
=M5*(M5+1)/2
(U)
=N21-M11
=N22-M12
=M26-$M$4*$M$5/2
=M27-$M$4*$M$5/2
(T1)
COUNTIF($D$1:D1,D2)>0,"", COUNTIF(D:D,D2))
er tied rank
="", "", (E2^3-E2)/12)
=SUM(F:F)
(M4*M5/M6*(M6-1)*((M6^3-M6)/12-M41))
ORM.S.DIST(ABS(L48),TRUE))
45 20000
24 10000
16 10000
37 10000
59 20000
22 20000
32 10000
40 10000
44 20000
32 15000
41 5000
38 5000
47 10000
35 15000
42 10000
48 15000
20 10000
51 5000
53 10000
42 10000
47 10000
59 10000
47 20000
17 10000
15 5000
15 15000
39 10000
17 20000
60 5000
25 15000
24 10000
27 10000
21 10000
43 15000
47 15000
29 10000
58 10000
55 10000
38 5000
34 5000
46 10000
23 5000
54 10000
15 5000
43 20000
15 10000
55 10000
41 5000
23 10000
60 15000
50 10000
37 10000
41 10000
26 10000
58 10000
33 5000
54 5000
54 5000
23 10000
37 5000
47 10000
30 10000
32 20000
48 5000
25 15000
33 5000
16 10000
46 5000
52 10000
57 10000
47 5000
54 10000
39 15000
43 15000
50 20000
42 10000
41 5000
29 5000
16 10000
17 15000
25 15000
34 15000
27 20000
28 15000
58 20000
15 20000
58 20000
15 15000
16 20000
18 20000
36 20000
29 10000
57 20000
59 15000
26 10000
26 20000
40 10000
16 20000
53 20000
21 20000
32 20000
33 5000
50 20000
33 20000
40 15000
55 20000
26 20000
31 10000
25 15000
47 15000
19 10000
59 20000
21 15000
43 20000
53 15000
34 20000
36 10000
47 20000
30 10000
19 5000
52 10000
36 20000
18 10000
18 5000
17 10000
57 10000
24 10000
28 10000
36 15000
44 10000
40 5000
18 10000
15 15000
21 10000
31 20000
33 10000
23 5000
60 10000
17 20000
24 10000
15 20000
24 5000
43 20000
45 15000
51 10000
48 5000
26 10000
38 20000
25 20000
34 10000
24 20000
33 10000
29 10000
59 20000
43 10000
47 10000
23 15000
50 20000
44 10000
45 20000
24 10000
16 10000
37 10000
59 20000
22 20000
32 10000
40 10000
44 20000
32 15000
41 5000
38 5000
47 10000
35 15000
42 10000
48 15000
20 10000
51 5000
53 10000
42 10000
47 10000
59 10000
47 20000
17 10000
15 5000
15 15000
39 10000
17 20000
60 5000
25 15000
24 10000
27 10000
21 10000
43 15000
47 15000
29 10000
58 10000
55 10000
38 5000
34 5000
46 10000
23 5000
54 10000
15 5000
43 20000
15 10000
55 10000
41 5000
23 10000
60 15000
50 10000
37 10000
41 10000
26 10000
58 10000
33 5000
54 5000
54 5000
23 10000
37 5000
47 10000
30 10000
32 20000
48 5000
25 15000
33 5000
16 10000
46 5000
52 10000
57 10000
47 5000
54 10000
39 15000
43 15000
50 20000
42 10000
41 5000
29 5000
16 10000
17 15000
25 15000
34 15000
27 20000
28 15000
58 20000
15 20000
58 20000
15 15000
16 20000
18 20000
36 20000
29 10000
57 20000
59 15000
26 10000
26 20000
40 10000
16 20000
53 20000
21 20000
32 20000
33 5000
50 20000
33 20000
40 15000
55 20000
26 20000
31 10000
25 15000
47 15000
19 10000
59 20000
21 15000
43 20000
53 15000
34 20000
36 10000
47 20000
30 10000
19 5000
52 10000
36 20000
P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
8.08E-51 32.0008085 39.7966287 32.00080847 39.7966287
0.91209 -0.00027183 0.00030418 -0.00027183 0.00030418
At first we did the descriptive analysis of the whole data by representing various samples of variables in di
forms including pie charts, histograms, and ogive. We even found measures of central tendency and measu
dispersion on the Likert scale data sample. Then we used parametric and non-parametric tools for analyzin
and finding relations between dependent and independent variables.
First, we used Karl Pearson’s correlation coefficient to find the relation between the age of people and thei
on travel by flights which concluded that there is a very weak relation of 0.8% between the two variables.
We then used the ANOVA test to check if the means of two or more groups are significantly different from
other or not. ANOVA checks the impact of one or more factors by comparing the means of different sampl
doing the ANOVA test on Likert scale data we concluded that covid had a significant impact on the satisfa
of people from airlines services.
Further, we used The Chi-Square Test is done to check whether the hypothesis of the variables is independ
each other or not. Here we analyze if the hypothesis of the amount of money a person usually spends while
and how often that person plans on travelling post-COVID per year, are independent of each other is true o
and by using the test, we conclude that the hypothesis is true.
Then we did The Mann-Whitney U test is used to compare differences between two independent groups w
dependent variable is either ordinal or continuous, but not normally distributed. By doing the test it can be
that that the gender of a person (independent variable) and how satisfied they were with the services provi
airlines before covid (dependent variable) do differ based on gender.
Lastly, we did Regression which is a statistical method used in finance, investing, and other disciplines tha
to determine the strength and character of the relationship between one dependent variable (usually denote
and a series of other variables (known as independent variables). We can therefore conclude that the depen
variable(How much do you usually spend while travelling domestically.) is not that affected by the indepe
variable(age) by looking at the above data.
Lastly, we did Regression which is a statistical method used in finance, investing, and other disciplines tha
to determine the strength and character of the relationship between one dependent variable (usually denote
and a series of other variables (known as independent variables). We can therefore conclude that the depen
variable(How much do you usually spend while travelling domestically.) is not that affected by the indepe
variable(age) by looking at the above data.
amples of variables in different
tral tendency and measures of
metric tools for analyzing our data
he variables is independent of
son usually spends while travelling
ent of each other is true or false