You are on page 1of 3

CIA 1

REPORT ON DATA VISUALIZATION


Insurance

Medical Cost Personal Datasets | Kaggle


From the data:
Age – ordinal
Frequenc
bin y
0-10 0
10-20 166
20-30 278
30-40 257
40-50 281
50-60 265
60-70 91
70-80 0
More 0

Frequency
300 278 281
257 265
250
Number of people

200
166
150

100 91

50
0 0 0
0
0-10 10-20 20-30 30-40 40-50 50-60 60-70 70-80 More
Age

Age is continuous variable. Age can be considered as nominal, ordinal or ratio


measurement based on the data interpretation. In the above the age is considered as
ordinal as age is categorized into ranges of 0-10, 10-20, etc., this gives precise number
of people who comes in the which age group range. The above bar graph represents
the number of people who took insurance lie in which age range. The people between
40-50 age range have taken the highest insurance and 60-70 age range has taken least
insurance.
Sex – nominal

Total

male; female;
676 662

Nominal variable does not have any evaluative distinction. Gender is nominal variable
as it is coded as 0 and 1, 1 indicating male and 0 indicating female or vice versa. There
can’t be any other variable other than male and female. The above pie chart represents
the number of female and male who has taken insurance. Male is highest who has
taken insurance.
BMI (Body mass index)-Interval:
Bin Frequency
<18.5 21
19-24.9 221
25-29.9 380
30-34.9 397
35< 319

Histogram
450 397
400 380
319
Number of people

350
300
250 221
200 Frequency
150
100
50 21
0
<18.5 19-24.9 25-29.9 30-34.9 35<
BMI
Body mass index is variable which can be considered as nominal, ordinal or interval.
Typically, it is measured in interval level such the table given above. BMI can be
collapsed into lower-level ordinal categories as obese, overweight, and underweight.
The above histogram represents the BMI of all the people who has taken the
insurance. It is seen that highest number of people BMI range in 30-34.9 who are
mostly obese and almost 21 people are below 18.5 nothing but they are underweight.

Children – ratio

Row Count of
Labels children
0 574
1 324
2 240
3 157
4 25
5 18
(blank)  
Grand
Total 1338

Total
700

600 574

500
Number of people

400
324 Total
300 240
200 157

100
25 18
0
0 1 2 3 4 5 (blank)
Number of childern

The number of children is a discrete data. The number of children is a ratio variable. A
family may not have any children thus, the zero has a definite value. From the above
data we can see that 574 people who bought the insurance doesn’t have children, this
is represented in a bar chart and 18 people have 5 children.

You might also like