Professional Documents
Culture Documents
CIA 1
Report on Data Analysis & Data Visualization
By
Paul John
To
Prof Helen Josephine
August 2022
Introduction:
The following study was carried out to have a better understanding of data visualisation.
More emphasis is placed on the four measurements (nominal, ordinal, interval, and ratio) and
the classification of the data under these headings as a result. The identified data is then
visualised using a pie chart, bar graph, histogram, and scatter diagram, and an interpretation
is formed.
I chose a supermarket sales dataset for analysis, and data for a nominal, ordinal, interval and
ratio were taken from it. It is critical to preserve such datasheets so the store can track its
performance and determine how efficient it is at offering services.
There are four data measurement scales: Nominal, Ordinal, Interval, and Ratio.
1. Nominal data:
It is defined as data used for naming or labelling variables without any qualitative value.
We can’t say that one dataset is better than the other in nominal data or compare it. There is
no intrinsic to little data.
Total
Central
East
South
West
From the data, we can see which area has the highest sum of profit.
As per the data, we can see that west region has the highest profit volume followed by south
and east with the bit of change in value and the east we have second least volume of profit
2. Ordinal Data
Ordinal data is categorical data with a set order or a scale to it, and they are mutually
exclusive. The ordinal scale is used to represent the ordinal scale. The distinction between the
two groups is uncertain. They are equivalent, and one data can be regarded as superior to the
other. The ranking supplied can be considered ordinal data.
Total
800000
700000
600000
500000
400000
Total
300000
200000
100000
0
Central East South West
Here in this region, we can see the sales figures for the four areas with the highest volume
with the west leading with quite a high margin and the south with the most minor sales in
their region.
3. Interval data
It is defined as a type of data that is measured along a scale, in which each point is placed at
an equal distance from one another. Interval data always appears in the form of numbers.
Here the quantity interval can be seen in the following diagram on how the quantity
demanded is pictured with the help of a histogram as per the data. The most requested is the
sum between 3 and 5 with the highest number of amounts ordered.
4. Rational data
Ratio data is quantitative, having the same properties as interval data with an equal and
definite ratio between each information and absolute zero being treated as a point of origin.
Quantity Vs Sales
16
14
12
10
0
0 1000 2000 3000 4000 5000 6000 7000 8000 9000
In the above diagram, there is a difference between the sales Volume and the quality of items
being sold in the supermarket, The chart used in analysing the data is a scatter diagram. As
can be observed from the graph, we can see that even though the quantity demanded is 12, it
does not guarantee the sales with a profit, but with the amount of 8, the number of deals with
high volume can be achieved.