You are on page 1of 2

List of Figures

SALARY ANALYSIS

Sl No Figure Name Page no


Fig1 Interaction Plot 5
Fig2 Point Plot 5

PRINCIPAL COMPONENT ANALYSIS

Sl No Figure Name Page no


Fig1 - Fig 17 Histogram and boxplot of each column variable 12-17
Fig 18 Point Plot 18
Fig 19 Correlation Heat map 19
Fig 20 Box plot of original Data 22
Fig 21 Box plot of scaled Data 22
Fig 22 Heat Map Correlation between components and features 30
Fig 23 Cumulative variance Plot 30
Fig 24 Scree Plot 32
Fig 25 Comparison between Cumulative and Individual explained Variance 32

List of Tables

SALARY ANALYSIS

Sl No Name Page no
Table 1 Dataset Sample 3
Table.2 Data information 4
Table.3 Descriptive Summary of the data 4
Table.4 ANOVA for Education 5
Table.5 ANOVA for Occupation 5
Table.6 Tukey HSD for Education 6
Table.7 and 8 Two way Anova 7

PRINCIPAL COMPONENT ANALYSIS

Sl No Name Page no
Table 1 Dataset Description 9
Table.2 Dataset Sample 9
Table.3 Data information 10
Table.4 Descriptive Summary of the data 11
Table.5 Scaled Data 20
Table.6 Correlation Table 21
Table.7 and 8 PCA component Table 29
Table9 Cumulative explained Variance 31

2|Page
SALARY ANALYSIS

EXECUTIVE SUMMARY

Salary is hypothesized to depend on educational qualification and Occupation. To understand the


dependency, the salaries of 40 individuals [SalaryData.csv] are collected and each person’s
educational qualification and Occupation are noted. Educational qualification is at three levels, High
school graduate, Bachelor, and Doctorate. Occupation is at four levels, Administrative and clerical,
Sales, Professional or specialty, and Executive or managerial. A different number of observations
are in each level of education – Occupation combination.
[Assume that the data follows a normal distribution. In reality, the normality assumption may not
always hold if the sample size is small.]
DATA DESCRIPTION

1. Education - Three Education level of Employee are considered in given sample.


1.1 Doctorate
1.2 Bachelors
1.3 High school graduate
2. Occupation - Four Occupation of individuals are considered in given sample.
2.1 Administrative and clerical
2.2 Sales
2.3 Professional or specialty
2.4 Executive or managerial
3. Salary – Earnings of an individual.

DATASET SAMPLE

Out of 40 rows, We have considered only first 5 rows details to check the data.
All this variables details has been explained under Data description.

Table 1. Dataset Sample

EXPLORATORY DATA ANALYSIS

From Table 2,

We can conclude that there are total 40 rows and 3 columns in the dataset. Salary columns is
of Integer type and Education and Occupation are of Object type.

3|Page

You might also like