You are on page 1of 10

Overview of Data Science

Data Science and Visualization


20AD2202
Data Science
A powerful new approach to
make discoveries from data

An automated way to analyze


enormous amounts of data and
extract information

A new discipline that combines the


aspects of statistics, mathematics,
programming, and visualization to turn
data into information

Data Science and Visualization


[20AD2202]
Domain Expertise and Scientific
Methods
 Data Scientists collect, explore, analyze, and
visualize data. They apply mathematical and
statistical models to find patterns and solutions in
the data.

 Data analysis can be:


◦ Descriptive: Study a dataset to decipher the details
◦ Predictive: Create a model based on existing information to
predict outcome and behavior
◦ Prescriptive: Suggest actions for a given situation using the
collected information

Data Science and Visualization


[20AD2202]
Data Processing and Analytics
 Modern tools and technologies have made data
processing and analytics faster and efficient.

 These technologies help Data Scientists to:


◦ Build and train machine learning models
◦ Manipulate data with technology
◦ Build data tools, applications, and services
◦ Extract information from data

 Data science without mathematics and Statistics may lead to


wrong interpretation of data

Data Science and Visualization


[20AD2202]
Skill Set of a Data Scientist
 A Data Scientist should be able to:
◦ Ask the right questions
◦ Understand data structure
◦ Interpret and wrangle data
◦ Apply statistical and mathematical methods
◦ Visualize data and communicate with stakeholders
◦ Work as a team player

Data Science and Visualization


[20AD2202]
WHAT IS DATA SCIENCE?
 Data science starts with data, which can
range from a simple array of a few numeric
observations to a complex matrix of millions
of observations with thousands of variables.

 Data science utilizes certain specialized


computational methods in order to discover
meaningful and useful structures within a
dataset.

Data Science and Visualization


[20AD2202]
WHAT IS DATA SCIENCE?
 The discipline of data science coexists and is
closely associated with a number of related areas
such as database systems, data engineering,
visualization, data analysis, experimentation, and
business intelligence (BI).

 We can further define data science by investigating


some of its key features and motivations

Data Science and Visualization [20AD2202]


Extracting Meaningful Patterns
 Knowledge discovery in databases is the
nontrivial process of identifying valid, novel,
potentially useful, and ultimately
understandable patterns or relationships
within a dataset in order to make important
decisions
 Data science involves inference and iteration

of many different hypotheses.

Data Science and Visualization


[20AD2202]
Data Science Models

Data Science and Visualization


[20AD2202]
3 Vs of Big Data

Data Science and Visualization


[20AD2202]

You might also like