You are on page 1of 2

School of Computer Science Engineering and Technology

Program- MCA Type- Core


Course Code- CMCA 523 Course Name-Data Analysis using
Python
Year- I Semester- I
Batch- 03

Lab Assignment 5
CO Mapping
Exp No. Name CO-1 CO-2 CO-3
5. Measure
Variance,
of Dispersion,
Standard   -
Deviation, Covariance and
Correlation for health_exp
data set.

Objective: Utilize the `seaborn` library and the "health_exp" dataset to explore covariance, correlation,
and measures of dispersion in Data Statistics techniques for Data Science.

Task 1: Load the "health_exp" dataset using Seaborn and examine its structure to understand the variables
and data types.

Task 2: Calculate the population standard deviation for the "health_expenditure" column. Explain the
significance of this value and how it reflects the dispersion of health expenditure across countries.

Task 3: Randomly select a sample of 100 data points from the "health_expenditure" column. Calculate the
sample standard deviation for this sample and discuss why sample standard deviation is important for
studying smaller subsets of data.

Task 4: Create a histogram using Seaborn to visualize the distribution of "health_expenditure" in the entire
dataset. Interpret the shape of the distribution and how it relates to the standard deviation you calculated
earlier.

Task 5: Generate a box plot using Seaborn to examine the dispersion of "health_expenditure." Discuss what
the box plot reveals about the spread of health expenditure data.

Task 6: Calculate the interquartile range (IQR) for "health_expenditure" and explain its significance in
understanding the dispersion of health expenditure data.

Task 7: Calculate the coefficient of variation for the "health_expenditure" column and explain its purpose
in comparing the variability of health expenditure across different countries.
School of Computer Science Engineering and Technology
Submission Guidelines:

a) The assignment must be verified by the instructor during the lab. (Submission on LMS will
onlybe considered). Organize your code neatly with appropriate comments.

b) Zipped file must be saved as per the format RollNo_Lab# (Example: E21CSE632_Lab1).

c) Write name and enrolment number inside the assignment file. Without it, your submission won’t
be considered for evaluation.

d) Submit the assignment in your respective batch’s submission link in LMS. Submission in other
batch’s submission portal will not be checked.
e) Late submission will lead to penalty.
f) Plagiarism will lead to negative grading.

You might also like