You are on page 1of 6

Reg.

No: 2127758

CIA – 1

REPORT ON DATA VISUALIZATION

Under the Guidance of

Dr. A.Subburaj

CHRIST (DEEMED TO BE UNIVERSITY), BANGALORE

AUG 2021

Done By:

SUREKHA Y

SECTION – j

ROLL NUMBER – 2127758

1
Reg. No: 2127758

BIG MART SALES


Introduction:
The Big Mart sales data set consists of various categories. From this data, we can identify all
four data types (Nominal, Ordinal, Interval, Ratio) and can be represented in a respective
graphical form.

NOMINAL:
From the data set, we can identify the nominal data variable which is the types of items in the big
mart sales. Summarized the sample data and framed it in a table, contains types of items and no.
of items available in Big Mart sales.

ITEM TYPE FREQUENCY


Baking Goods 215
Breads 60
Health and Hygiene 115
Canned 122
Fruits and Vegetables 209
Household 126
Snack Foods 315

Table 1: Frequency distribution of Item type

Fig.1: Pie chart of no.of items

Inference: Graphical representation of the data is interpreted from the above table. As the data belongs
to qualitative data which is a nominal variable, can be plotted in either a pie chart or bar graph. I have
taken pie chart. Through this graph we can show how many no. of items available in the big mart

Data Source: https://www.kaggle.com/mragpavank/big-mart-sales-dataset

2
Reg. No: 2127758

Ordinal:
From the dataset, we can identify the size of the outlet ( high, medium, small) as an ordinal data
type. From this data, we customized the data in a frequency table.

Outlet size No.of outlets Percentage


High 25 14%
Medium 78 44%
Small 73 41%
total 176

Table 2: Frequency distribution of No.of outlets

Fig.2: Bar Graph of Outlet sizes

Inference: As the outlet size comes under ordinal data we can place it in an order according to
the frequency. With the help of the above-customized data, we can plot the data in either a pie
chart or bar graph.

According to the bar graph, we can interpret that the highest no. of outlets are Medium size and
second highest no. of outlets are small size and least no. of outlets are large size. i.e., medium
size outlets > small size outlets > large size outlets as per the sample data taken.

Data Source: https://www.kaggle.com/mragpavank/big-mart-sales-dataset

3
Reg. No: 2127758

Interval:
From the dataset, we can identify item visibility as an interval data type. As the item visibility
comes under interval data type, firstly, we need to group the data into equal intervals of classes
before customizing it into a table. This data can be customized in a frequency table.

Item-visibility Frequency Percentage


0-0.08 174 70%
0.08-0.16 60 24%
0.16-0.24 11 4%
0.24-0.32 4 2%
total 259

Table 3: Frequency distribution of item visibility

With the help of the above-customized data, we can plot the data in a histogram, formed from
grouped data, displaying frequencies.

Analysis: From the data range, Min = 0 and Max = 0.32. This histogram diagram is formed from
grouped data, displaying frequencies. The histogram is right skewed graph. Most visibility falls
under 0-0.08.

Fig 3: Graphical representation of Histogram diagram

Data Source: https://www.kaggle.com/mragpavank/big-mart-sales-dataset

4
Reg. No: 2127758

Ratio:
From the dataset, we can identify Price and Sales as a ratio data type. From this data, we can
customize the data in a frequency table.

Item_MRP Item_Outlet_Sales
249.8092 3735.138
48.2692 443.4228
141.618 2097.27
53.8614 994.7052
51.4008 556.6088
57.6588 343.5528
96.9726 1076.5986
144.1102 2187.153
145.4786 1589.2646
119.6782 2145.2076
196.4426 1977.426
115.3492 1621.8888
54.3614 718.3982
113.2834 2303.668
230.5352 2748.4224
250.8724 3775.086
45.906 838.908
42.3112 1065.28
39.1164 308.9312

Table 4: Frequency distribution of outlets sales

Fig4: Scatter Diagram of Outlet sales

Data Source: https://www.kaggle.com/mragpavank/big-mart-sales-dataset

5
Reg. No: 2127758

Analysis: With the help of the above data, we can plot the data in a scatter diagram with a
trending line of linear frequency which shows that it is being positive correlation.

Conclusion:
Identified the data type and plotted the graphs concerning data type and inferred the information.

You might also like