Professional Documents
Culture Documents
Business Analytics Assignment Riya Mathew 19021141088: STR (Crew - Data)
Business Analytics Assignment Riya Mathew 19021141088: STR (Crew - Data)
RIYA MATHEW
19021141088
Import “crew data.csv” from MS teams>files and answer to the following questions
> str(Crew.data)
> summary(Crew.data$bonus)
> sd(Crew.data$bonus)
[1] 2552.178
> var(Crew.data$bonus)
[1] 6513610
3. How many groups are containing in the variable “Job code”
> Crew.data%>%count(Job.code)
Job.code n
1 FLTAT1 14
2 FLTAT2 18
3 FLTAT3 12
4 PILOT1 8
5 PILOT2 9
6 PILOT3 8
> table(Crew.data$Job.code)
> Emptb=table(Crew.data$Job.code)
> Emptb
> class(Emptb)
[1] "table"
> Empf=as.data.frame(Emptb)
> Empf
Var1 Freq
1 FLTAT1 14
2 FLTAT2 18
3 FLTAT3 12
4 PILOT1 8
5 PILOT2 9
6 PILOT3 8
> names(Empf)=c("Jobcat","count")
> Empf
Jobcat count
1 FLTAT1 14
2 FLTAT2 18
3 FLTAT3 12
4 PILOT1 8
5 PILOT2 9
6 PILOT3 8
Using dplyr
> Crew.data%>%count(Job.code)
Job.code n
1 FLTAT1 14
2 FLTAT2 18
3 FLTAT3 12
4 PILOT1 8
5 PILOT2 9
6 PILOT3 8
> Crew.data%>%group_by(Job.code)%>%summarise(count=n())
> Crew.data%>%group_by(Job.code)%>%summarise(mean(Salary))
> summary(Crew.data$Salary)
> table(Crew.data$Salary)
21000 22000 23000 24000 25000 26000 27000 28000 29000 30000 32000 33000
34000 35000
1 2 1 1 2 1 1 2 2 1 1 3 4 3
36000 37000 38000 41000 42000 43000 44000 45000 47000 48000 65000 66000
68000 69000
2 2 3 2 1 1 3 2 2 1 1 1 1 1
71000 72000 73000 75000 76000 77000 78000 81000 82000 83000 86000 92000
93000 94000
1 2 1 1 1 1 1 1 1 2 1 1 1 1
95000 100000 105000 108000 112000
1 1 1 1 1
> Emptb=table(Crew.data$Salary)
> Emptb
21000 22000 23000 24000 25000 26000 27000 28000 29000 30000 32000 33000
34000 35000
1 2 1 1 2 1 1 2 2 1 1 3 4 3
36000 37000 38000 41000 42000 43000 44000 45000 47000 48000 65000 66000
68000 69000
2 2 3 2 1 1 3 2 2 1 1 1 1 1
71000 72000 73000 75000 76000 77000 78000 81000 82000 83000 86000 92000
93000 94000
1 2 1 1 1 1 1 1 1 2 1 1 1 1
95000 100000 105000 108000 112000
1 1 1 1 1
> Empf=as.data.frame(Emptb)
> Empf
Var1 Freq
1 21000 1
2 22000 2
3 23000 1
4 24000 1
5 25000 2
6 26000 1
7 27000 1
8 28000 2
9 29000 2
10 30000 1
11 32000 1
12 33000 3
13 34000 4
14 35000 3
15 36000 2
16 37000 2
17 38000 3
18 41000 2
19 42000 1
20 43000 1
21 44000 3
22 45000 2
23 47000 2
24 48000 1
25 65000 1
26 66000 1
27 68000 1
28 69000 1
29 71000 1
30 72000 2
31 73000 1
32 75000 1
33 76000 1
34 77000 1
35 78000 1
36 81000 1
37 82000 1
38 83000 2
39 86000 1
40 92000 1
41 93000 1
42 94000 1
43 95000 1
44 100000 1
45 105000 1
46 108000 1
47 112000 1
> names(Empf)=c("Salary","count")
> Empf
Salary count
1 21000 1
2 22000 2
3 23000 1
4 24000 1
5 25000 2
6 26000 1
7 27000 1
8 28000 2
9 29000 2
10 30000 1
11 32000 1
12 33000 3
13 34000 4
14 35000 3
15 36000 2
16 37000 2
17 38000 3
18 41000 2
19 42000 1
20 43000 1
21 44000 3
22 45000 2
23 47000 2
24 48000 1
25 65000 1
26 66000 1
27 68000 1
28 69000 1
29 71000 1
30 72000 2
31 73000 1
32 75000 1
33 76000 1
34 77000 1
35 78000 1
36 81000 1
37 82000 1
38 83000 2
39 86000 1
40 92000 1
41 93000 1
42 94000 1
43 95000 1
44 100000 1
45 105000 1
46 108000 1
47 112000 1
Using dplyr
> Crew.data%>%count(Salary)
Salary n
1 21000 1
2 22000 2
3 23000 1
4 24000 1
5 25000 2
6 26000 1
7 27000 1
8 28000 2
9 29000 2
10 30000 1
11 32000 1
12 33000 3
13 34000 4
14 35000 3
15 36000 2
16 37000 2
17 38000 3
18 41000 2
19 42000 1
20 43000 1
21 44000 3
22 45000 2
23 47000 2
24 48000 1
25 65000 1
26 66000 1
27 68000 1
28 69000 1
29 71000 1
30 72000 2
31 73000 1
32 75000 1
33 76000 1
34 77000 1
35 78000 1
36 81000 1
37 82000 1
38 83000 2
39 86000 1
40 92000 1
41 93000 1
42 94000 1
43 95000 1
44 100000 1
45 105000 1
46 108000 1
47 112000 1
> Crew.data%>%group_by(Salary)%>%summarise(count=n())
> Crew.data%>%group_by(Salary)%>%summarise(mean(Salary))
> summary(mtcars$mpg)
> summary(mtcars$cyl)
> summary(mtcars$disp)
> summary(mtcars$hp)
Min. 1st Qu. Median Mean 3rd Qu. Max.
52.0 96.5 123.0 146.7 180.0 335.0
> summary(mtcars$drat)
> summary(mtcars$wt)
> summary(mtcars$qsec)
> summary(mtcars$vs)
> summary(mtcars$am)
> summary(mtcars$gear)
> summary(mtcars$carb)