You are on page 1of 8

Data Science Active Learning Report

Data Science Active Learning Report - 2021

T.Y T9 Computer Science

Date of Submission : 4th of June, 2021 (Friday)

Members of the Group :

Rishi Acharya - PE10

1032171681

Chinmay Shrimali - PD02

1032171328

Rushit Patel - PA39

1033181248

Submitted to :

Prof. Shilpa Sonawani ma’am


Data Science Active Learning Report

Problem Statement :

Perform Data Analytics on Bank Marketing Data-set, using seaborn library to

visualize it.

Data Set :

Data set provided to us is a tabular form of data where attributes are distributed columns and

observations in row. Data set contains Fisical, Banking & Personal information of

respective leads.

Data info :

RangeIndex: 45211 entries, 0 to 45210

Data columns (total 17 columns)

Data Type: int64(7), object(10)


Data Science Active Learning Report

Library Used : Seaborn

1.Barplot of Balance and Duration :

It represents an estimate of central tendency for a numeric variable with the height of each rectangle and

provides some indication of the uncertainty around that estimate using error bars

2. Distplot of Balance :

The distplot() function combines the matplotlib hist function with the seaborn kdeplot() and rugplot() functions.
Data Science Active Learning Report

3.Joint Plot of Balance and Duration

Displays a relationship between 2 variables (bivariate) as well as 1D profiles (univariate) in the margins.
This plot is a convenience class that wraps JointGrid. 'hex' bins the data into hexagons with histograms in
the margins.
Data Science Active Learning Report

4. Box plot of Duration and Balance

A box plot (or box-and-whisker plot) shows the distribution of quantitative data in a way that facilitates
comparisons between variables or across levels of a categorical variable.

5. Point Plot of Duration and Balance

A point plot represents an estimate of central tendency for a numeric variable by the position of scatter
plot points and provides some indication of the uncertainty
Data Science Active Learning Report

Group Activity
Data Science Active Learning Report

7
Data Science Active Learning Report

Conclusion

In this activity we analysed the data of marketing for a bank, with the

help of data visualization using the Seaborn library.

We considered Duration and Balance as a key factor for our analyses.

These factors affect the profit cycle of banking institutions.

Considering the wide range of data we neglected some attributes as they

were only informative in nature.

You might also like