You are on page 1of 6

Data Analysis for Managers

CIA 1

Report on Data Visualization

MBA PROGRAMME
SCHOOL OF BUSINESS AND MANAGEMENT
CHRIST (DEEMED TO BE UNIVERSITY), BANGALORE

Section L

Name Reg. No.


Aneesh Gopinath 2027914
 NOMINAL DATA
Nominal data can be analyzed using the grouping method. The variables can be grouped together
into categories, and for each category, the frequency or percentage can be calculated. The data can
also be presented visually, such as by using a pie chart.

List of people who survived the Titanic

The dataset provides the details of all the passenger who travelled in the Titanic ship. Titanic was a
British passenger liner operated by the White Star Line that sank in the North Atlantic Ocean in
the early morning hours of 15 April 1912

Source: Titanic Dataset. (n.d.). Retrieved July 17, 2020, from


https://www.kaggle.com/c/titanic-dataset/data?select=titanic_test.csv

chart of people who survived the titanic


non survivors survivors

39%

61%

Inference drawn

The dataset consists of all the passengers who traveled on the RMS Titanic ship in 1912. This data
represents the number of people who lost life in the tragic incident that took place in the North
Atlantic ocean. The titanic has a passenger capacity of 2,435 and was sailing on full size the day it
sank. It can be seen that almost 61% of the people who were on board did not survive the accident,
and the rest 39% managed to survive. Of the people who survived, it was mostly females and
children who survived as they were given the 1 st priority during the incident.

Conclusion / Recommendation

It can be concluded that the RMS Titanic did not have enough of life-saving equipment and other
lifeboats to accommodate all the passengers in the ship during the evacuation process. This resulted
in the massive loss of lives of people traveling in the ship.

It is recommended that every cruise ship accommodate only a certain number of people that it can
handle during an evacuation process so that we won't have to see another disaster like what
happened with the RMS Titanic. Titanic was known as the unsinkable cruise ship, and that claim
turned out to be wrong in its very first trip. Thus we should understand that safety should always be
given importance.
 ORDINAL DATA
Ordinal data is a categorical, statistical data type where the variables have natural, ordered
categories and the distances between the groups are not known

Dataset on cooking preference of graduate students

The dataset provides a survey of 125 graduate students on how much they prefer cooking compared
to eating from outside or takeaways. The feedback from these students have been recorded by a
study and is represented in a Likert scale of 1-5

Source: BoraPajo. (2017, April 23). Food choices. Retrieved July 17, 2020, from
https://www.kaggle.com/borapajo/food-choices?select=food_coded.csv

Love cooking Like cooking Prefer cooking Dislike cooking Hate cooking
(1) (2) (3) (4) (5)

Survey on how many students prefer cooking


70

60

50

40

30

20

10

0
love cooking like cooking prefer cooking dislike cooking hate cooking

Inference drawn

The X-axis represents the responses from 126 graduate students, who were asked to answer a
questionnaire on how much like to cook compared to eating outside or getting takeaways. These
responses were presented in the form of a Likert of 5 scales from how much they love cooking to
how much they hate cooking.

Y-axis represents the number of respondents who participated in the survey

From the above graph, it can be inferred that out of the 126 graduate students, the majority of them
preferred cooking that eating from outside or getting a takeaway. On further analysing we can
understand that there were only 15 students who said that they love to cook, while the highest
number of students counting to 60 said that they like cooking. 12 and 11 students responded that
they dislike and hate cooking, respectively. Then there was a group of 23 people who had a neutral
opinion on the preference for cooking.
Conclusion / Recommendation

we can thereby conclude that students do prefer cooking by themselves than going out or getting a
takeaway. And that only a small percent of students do not prefer cooking.

It is recommended that students have an excellent physical and mental health. In order to attain
this, it is necessary to eat healthily, and this can be achieved only by practicing healthy eating habits.
Thus cooking can always be regarded as a better option than outside food as they contain many
preservatives and unhealthy ingredients.

 INTERVAL DATA
An interval scale is one where there is order, and the difference between the two values is
meaningful.

Number of fatalities a due to Novel Coronavirus


This dataset has daily level information on the number of affected cases, deaths, and recovery
from 2019 novel coronavirus. It is a time-series data and so the number of cases on any given
day is the cumulative number.

Source: Novel Corona Virus, 2019 Dataset. (2020, July 15). Kaggle.

https://www.kaggle.com/sudalairajkumar/novel-corona-virus-2019-dataset
Inference drawn

The X-axis represents the different age intervals with a class interval of 10years. The results of 820
individuals have been recorded so far and are represented in the above graph.

The y-axis represents the number of lives lost due to the Novel coronavirus in India.

From the above graph, it can be understood that the Novel Coronavirus has mainly been affected
severely in people of the age group from 50-60 years olds. Novel coronavirus has the least affected
for the very olds and also infants and children below the age of 20. As people of age 20 to 70 would
have to go out for various purposes, these are the age group which is most prone towards getting
the virus, and that is what leads to the high number of fatalities in this age group.

Conclusion / Recommendation

It can be concluded that the highest number of fatalities are found in adults between the age of 20
to 70. As these are the working group and they have more probability of falling ill compared to
minors and old age.
It is recommended that everyone follows government instructions and stay at home and stay safe
during the global pandemic. And in case of any symptoms, it is advised to be in quarantine and to
inform the authorities so that we together can stop the communal spread of the Novel Coronavirus.
 RATIO DATA

Ratio Data is defined as quantitative data, having the same properties as interval data, with an equal
and definitive ratio between each data and absolute “zero” being treated as a point of origin.

BMI of 30 patients in order to track their diebetes

The dataset shows the BMI information of different patients in a hospital, and how a change in
height and weight affects the a persons BMI

Source: bmi_data. (2019, November 19). Kaggle. https://www.kaggle.com/freego1/bmi-data

Calculation of BMI of 30 Patients


180
160
140
weight in pounds

120
100
80
60
40
20
0
63 64 65 66 67 68 69 70 71 72
Height in inch

Inference drawn

From the above graph it can be inferred that almost all the patients have their BMI between the
range of 15 to 25 which is calculated from their respective heights and weights

Conclusion / Recommendation

BMI is considered to be a very important risk factor for diabetes, obesity and heart diseases. The
units of measurement in this dataset for Age is years, Height is inches and Weight is pounds. The
BMI of a healthy person is supposed to be below 25, and thus all the patients are found to be
healthy from the above dataset based graph.

You might also like