You are on page 1of 5

Data Analysis for Managers

CIA 1

Report on Data Visualization

MBA PROGRAMME
SCHOOL OF BUSINESS AND MANAGEMENT
CHRIST (DEEMED TO BE UNIVERSITY), BANGALORE

Name Reg. No.


Aiswarya R 2027953
NOMINAL DATA

Nominal data can be analyzed using grouping method. Different variables can be grouped together
into categories, and each category, the frequency and percentage can be calculated. This can also be
presented visually, such as by using a pie chart.

Data of people in Indian Prison

The dataset provides the details of the number of people currently at different prisons throughout
India. The data set also has the information regarding the nationality of the prisoners. This
information was used to plot a pie chart on how many of the prisoners were Indians and foreigners.

Number of people in Indian prison

30%

Foreigners in Indian prison


Indians in Indian prison

70%

Source: Indian Prison Statistics. (2017, September 5). Kaggle.

https://www.kaggle.com/rajanand/prison-in-india

Inference drawn

From the above pie chart, it can be inferred that there is a significant number of foreigners in Indian
prisons. Almost 70% of the prison inmates are Indians while the rest 30% inmates are foreigners who
had either come to India for travel or others who had indulged in crimes that happened in India.

Conclusion / Recommendation

it can be concluded that there is significant number of foreigners are inmate in Indian prisons, from
this we understand that Indian legislative system is and has always been one of the best in the
world, it does not favour anyone or any country rather is fair for everyone irrespective of whether
the individual is a tourist or an official.

30% is an alarming rate and so it is recommended that India create strict laws for other nationalities
who visit India and make certain pacts with respective embassy’s regarding the issue, so that the
crime rates can be henceforth reduced to a great extend.
ORDINAL DATA
Ordinal data is a categorical or statistical data type in which the variables have natural, ordered
categories and the distances are unknown

Types of coffee students prefer

This dataset includes information on food choices, nutrition, preferences, childhood favorites, and
other information from college students. There are 126 responses from students. The coffee
choice of students are selected and represented below

Espresso Cappuccino Americano Doppio Latte


(1) (2) (3) (4) (5)

Types of coffee students prefer


45
40
35
30
No. of students

25
20
15
10
5
0
Espresso Cappuccino Americano Doppio Latte

Source: Food choices. (2017, April 23). Kaggle. https://www.kaggle.com/borapajo/food-

choices?select=food_coded.csv

Inference drawn

From the above data it can be inferred that Latte is the most preferred type of coffee among
students, followed by Cappuccino and then Americano and so on. Espresso is found to be the least
preferred type of coffee, which is popular in countries like Italy.

Conclusion / Recommendation

The above data concludes that students do not prefer high caffeine coffees like espresso and rather
go for milder ones like latte or cappuccino. Coffee is certainly an important part in one’s daily routine
especially for students as it help keep stress away and refreshes the body and mind

It is recommended to have coffee everyday as it boosts energy and also it has been proved that
coffee consumption decreases mortality rate and reduces risks for certain diseases.
INTERVAL DATA

An interval scale is one where there is order and the difference between two values is meaningful

Exam scores of 100 students

The dataset contains exam scores of 100 students in a class test. And the histogram represented
below shows the results of these students

Source: Kaggle: Your Home for Data Science. (2019, August 2). Kaggle.

https://www.kaggle.com/tanmoyie/comments

Inference drawn

It can be understood from the above graph that majority of the students have scored an above
average score of 50-80 marks. Bare minimal students have secured a score below 30 which would be
considered as failure. This shows the most students are efficient to a level more than the average
found among students from different institutions.

Conclusion / Recommendation

Marking is scale that determines the performance of a students and how well he/she has
represented what has been taught. A histogram like the one represented above shows the
performance of the whole class in a glance and also helps faculty to decide on what is lacking or how
well has knowledge transmission taken place.

It is recommended that faculty keeps a regular track of the class performance and also individual
performance of students which will help students in prospering better.
RATIO DATA

Ratio Data is defined as quantitative data, having the same properties as interval data, with an equal
and definitive ratio between each data and absolute “zero” being treated as a point of origin.

BMI of 30 patients in order to track their diebetes

The dataset shows the BMI information of different patients in a hospital, and how a change in
height and weight affects the a persons BMI

Bmi of 30 pati ents


180
160
140
120
weight in pound

100
80
60
40
20
0
62 64 66 68 70 72 74 76
height in inch

Inference drawn

From the above graph it can be inferred that almost all the patients have their BMI between the
range of 15 to 25 which is calculated from their respective heights and weights

Conclusion / Recommendation

BMI is considered to be a very important risk factor for diabetes, obesity and heart diseases. The
units of measurement in this dataset for Age is years, Height is inches and Weight is pounds. The
BMI of a healthy person is supposed to be below 25, and thus all the patients are found to be
healthy from the above dataset based graph.

Source: bmi_data. (2019, November 19). Kaggle. https://www.kaggle.com/freego1/bmi-data

You might also like