You are on page 1of 4

Subunit - 2.

4
Data Exploration
Data Exploration
● The goal of data exploration is to learn about characteristics and potential
problems of a data set without the need to formulate assumptions about the
data beforehand.
● To analyse the data, we need to visualise it in some user-friendly format so
that we can:
■ See the trends, relationships and patterns contained within the data.
■ Define strategy for which model to use at a later stage.
Data Visualisation

● Data visualisation refers to the process of representing data visually or


graphically , by using visual elements like charts , graphs,diagrams and maps.
● Data Visualisation is important because :
■ It is a powerful to represent a bulk of data in a collective visual form.
■ It is a way to explore data with presentable results
■ It becomes easier to see the trends , relationship and trends of data
through data visualisation.
● To visualise data in Python we use matplotlib.pyplot library.
Ways to visualize data
Techniques Description Format Examples

Charts These use an established pattern Scatterplot, Bubble chart


or theme for displaying data.These
may or may not use axes

Graph These contain a X and Y axis with Line graph , pie graph ,bar graph ,
at least one showing numerical histogram
data

Maps These are used for visualising the Choropleth , heat map
geospatial and geographic data

Temporal These display data over a period of Timeline


time , with a start and a finish time

Network These demonstrate data within Node - link diagram


Network

Infographics These visually display textual data Word Cloud


in multiple of aesthetically pleasing
way

You might also like