You are on page 1of 7

Assignment 1,

Report on Data Visualization


Name: Abin Thomas Register Date: 19/July/2020
No:2027918

Topic: Data Visualization

Objective: Identify/transform data on four levels of measurement Nominal, Ordinal, Interval


and Ratio and apply data visualization for Bar Chart, Pie Chart, Histogram and
Scatter diagram.

Source Of Data:
 www.niti.gov.in
 www.rbidocs.rbi.org.in
 www.kaggle.com
1) Nominal Data Representation
The total number of students in different each Ethnicity group.

1.1) Data for data visualization.

Race/Ethnicity Number of Students Relative Distribution


group A 89 9%
group B 190 19%
group C 319 32%
group D 262 26%
group E 140 14%
Total 1000 100%

1.2) Chart

Students & Ethnicity Relati on


0.09
0.14

group A
group B 0.19
group C
group D
group E 0.26

0.32

1.3) Inference
We have the Nominal Distribution of students from different Ethnicity groups. It is graphically
represented using Pie Chart. Where Group C with highest number strength, which is 32% of the entire
students. At the same time, Group A is the ethnic group of students with the least number, which is just
9%. The remaining ethnic group strength is Group B, which has a strength of 19%, Group D with 26%,
and Group E with 14%.
Hence it’s concluded that the highest number of students are from group C. Noticeable effort is taken by
group C for students’ education.

2) Ordinal Data Representation

Students who completed the test preparation course from different ethnicity groups.

2.1) Data for data visualization.


Rating Frequency
Disappointing 4
Unpleasant 45
Satisfactory 741
Premium 1003
Elite 2

2.2) Chart

Chocolate Rating
Elite 2

Premium 1003
Rating

Satisfactory 741

Unpleasant 45

Disappointing 4

0 200 400 600 800 1000 1200


Frequency
2.3) Inference
We have the Ordinal distribution of rating for the chocolate bar given by the over 1700
customers along with information on their regional origin, percentage of cocoa, the variety of
chocolate bean used, and where the beans were grown. The ratings given for the chocolate are
the rating as Disappointing by Pre people, Unpleasant by 45, Satisfactory given by 741,
Premium by 1003, and rating 5 given by Elite people.

It can be concluded that the company has taken efforts to keep up the standards. Since the
highest rating is given by the customers are Premium defines the premium quality of the product,
i.e., its Superior flavor development, character, and style. There are a lot of people given a rating
of Satisfactory states the praiseworthy and well made with special qualities. It also defines the
efforts that have to be taken further for making the product to premium quality.

3) Ratio
The relation between the literacy rate and sex ration in the country.

3.1) Data for data visualization.


Literacy Rate in %
(Person) Sex Ratio
86.6 876
67 993
65.4 938
72.2 958
61.8 918
86 818
70.3 991
88.7 973
78 919
75.6 879
82.8 972
67.2 889
66.4 948
75.4 973
94 1084
91.8 946
69.3 931
82.3 929
79.2 985
74.4 989
91.3 976
79.6 931
86.2 868
72.9 979
85.8 1037
75.8 895
66.1 928
81.4 890
80.1 996
87.2 960
67.7 912
78.8 963
76.3 950

3.2) Chart

Literacy Rate &Sex Ratio


1100

1050

1000
Sex Ratio

950

900

850

800
60 65 70 75 80 85 90 95 100

Literacy RATE

3.4) Inference
The Scattered graph plotted above is biased on the sex ratio and the literacy rate in India. The literacy
rate is plotted along the x-axis and sex ratio along the y-axis. From the graph, it’s clear that the mapping
of sex ratio and literacy rate in each state in India is scattered, which defines the lack of direct relationship
between them. But it is also found that the state the highest literacy has rate highest sex ratio and the
Second state with second-highest sex ratio also found to be having a literacy rate above 85%.

Hence it can be concluded that the sex ratio of a state is not directly related to the literacy rate of the
state, rather it can be influenced by a combination of many other aspects that can include literacy, social
and cultural aspects, etc.

4) Intervals
Mark Scored by students for the subject Maths.

4.1) Data for data visualization.


Lowest 0
Highest 100
Class Intervals 10

Interva
l Frequency
00-10 2
11-20 2
21-30 12
31-40 34
41-50 100
51-60 189
61-70 270
71-80 215
81-90 126
91-100 50
More 0
4.3) Chart

Mark Distribution for Maths


300
270
250
215
189
Number of Students

200

150 126
100
100
50
50 34
2 2 12
0
0
00-10 11-20 21-30 31-40 41-50 51-60 61-70 71-80 81-90 91- More
100
Marks in Maths

4.3) Inference
The histogram is plotted bases on the mark scored by the student in the subject math. The distribution is
made with 10 intervals. From the graph, it is found that the highest number of students comes under the
range 61-70, which is 270, and the least number of students come under the range 00-10, and 11-20,
whose frequencies are 2. From the graph, it can be concluded that the 61 to 70 mark range with the
highest frequency, that most students score marks is in between 61-70 for the subject math. Interval with
the highest frequency is 61-70.

Hence it can be concluded that the proper efforts have been taken by the staff and the management in the
subject maths as a result majority of the students have got better marks in the subject maths.

You might also like