You are on page 1of 32

Grade VIII

Getting started
with Data
Science

Version 1.0
Chapter 1: Introduction to data

At the end of this chapter, students will understand what is


data. They will know:

• What are qualitative and quantitative data?


• Difference between discrete and continuous data
• Real world applications of data
Chapter 1: What is data?
• We often use the term data to refer
to computer information
• This information is either
transmitted or stored
• Data comes in numerous forms
• Any kind of information may it be
in numbers or text, or pictures is
termed as Data
Types of data

Data comes in different types.


Some of the common types of data
include:
• Text
• Image
• Video
• Numbers
• Spreadsheets
• Sound
Qualitative vs Quantitative data

Qualitative Quantitative

• Qualitative data is the • Quantitative Data is the


data that is a descriptive data that is numerical
piece of information. information
• For example, "What a • For example, “1”, “3.65”
nice day it is" etc.
Quantitative
data can be of
two types
Discrete vs Continuous data
Discrete Continuous

• Can be expressed as a • Can be any value in an


specific value. interval
• For example, “Number of • For example, “The amount
months in a year“, of oxygen in the
“Number of members in a atmosphere”, “Age of
family” etc. members in a family”
Real world applications of data

Predicting interests Getting insights from


of the audience on customer reviews in Effective targeting of
different online video online stores, food the advertisements
streaming platforms delivery apps etc.
Chapter 2: Introduction to data science

At the end of this chapter, students will have a brief


introduction to data science. They will know:
• What is data science?
• Careers in data science
• What data science helps us achieve
What is data science?

Data when investigated


Extracting meaningful
and carefully analyzed,
insights from data is
provides insights which
known as data science.
enriches our daily lives
Careers in data science?
Learning data science offers multiple career options. Some of the common job
titles for data scientists include
• Data Scientist
• Business Intelligence Analyst
• Data Mining Engineer
• Data Architect
• Senior Data Scientist
Careers in data science?
Business Senior Data
Data Scientist Data Engineer Data Architect
Intelligence Analyst Scientist
• Data Scientists are • Business • Data Engineer • Data Architects • Senior Data
data enthusiasts Intelligence examines not only work closely with Scientists
who gather and Analysts use data the Data for their users, system anticipate the
analyze large sets to assess the own business but designers, and business's needs
of structured and market and find also that of third developers to in the future.
unstructured the latest parties. In create a blueprint Although they
data. They business trends in addition to mining that data might not be
analyze, process, the industry. This data, a data management involved in
and model data helps to develop a engineer creates systems use to gathering data,
and later interpret clearer picture of robust algorithms centralize, they play a high-
the results to how a company to help analyze integrate and level role in
create actionable should shape its the data further. maintain the data analyzing it.
plans for strategy. sources.
companies and
organizations.
How data science Simply stated, data science
helps us? helps us answer different types
of questions from data. Some
common questions to ask from
data are:
• Which class does this
belong to - A or B?
• Is this an outlier?
• What will probably be the
value of this variable?
• What should be done now?
Which class does this belong
to - A or B?
The answers to some questions can only
be from a definite number of options.
For example,
Q: Will it rain today?
A: Yes/No
Q: Will the weather be hot or cold?
A: Hot/Cold

To make such predictions, we use a


family of algorithms called classification
algorithms.
Is this an outlier?
In some cases, the objective is to find
outliers or anomalies in data that is
otherwise mostly consistent. Some
examples of anomaly detections are:
Q: Is this email normal or spam?
Q: You are checking your car tyre
pressure. Is the reading normal?
The algorithms that are used for these
types of questions are called anomaly
detection algorithms.
What will probably be the value of this
variable?

There are scenarios in which we must predict


numerical values of a variable based on
historic data. Some examples are:
Q: How much rainfall will we receive this year?
A: 100 mm
Q: How many runs will the winning team
score?
A: 320
The kind of algorithms that can predict these
values are called regression algorithms.
What should be done
now?

This question usually solves the


problems of autonomous robots or
self-driving cars that need to make
decisions based on changes in
external factors. Machine learning
helps to solve such problems with
the help of reinforcement learning.
Chapter 3: Data visualization

At the end of this chapter, students will have a brief introduction to data
visualizations. They will know:
• What is data visualization?
• Examples of data visualization
• The importance of visualization
• Collecting relevant data
• Asking the right question
• Predict an answer
What is data visualization?

The goal of data


Data visualization is
visualization is to Provides a way to see
the representation of
communicate and understand
data or information in
information clearly trends, outliers, and
a graph, chart, or
and efficiently to patterns in data.
other visual formats
users.
Common Common types of data visualizations are:
• Charts
types of data • Graphs
• Tables
visualizations? • Maps
• Histograms
Data visualization using Pie chart
Let us consider, we have the food When visualized using Pie Charts
item preference of 50 students. it will look like below

Food Number of Food preference

item students
Pizza 25 Dosa
30%

Pasta 10 Pizza

Dosa 15
50%

Pasta
20%
Data visualization using Line chart
Below data shows the number of When visualized using Line Chart
students present in the class for one it will look like below
week.
N u m b e r o f s tu d e n ts p re s e n t
Date Number of 60
students present
50
06-Apr 49
07-Apr 42 40

08-Apr 37 30
09-Apr 48
10-Apr 43 20

11-Apr 36 10

12-Apr 50
0
6 -Ap r 7 -Ap r 8 -Ap r 9 -Ap r 1 0 -Ap r 1 1 -Ap r 1 2 -Ap r
Data visualization using Bar chart
Below data shows the number of When visualized using Bar Chart
students present in the class for one it will look like below
week.

Date Number of Number of students present


60
students present
06-Apr 49 50

07-Apr 42 40

08-Apr 37 30
09-Apr 48
10-Apr 43 20

11-Apr 36 10

12-Apr 50 0
6-Apr 7-Apr 8-Apr 9-Apr 10-Apr 11-Apr 12-Apr
Pre-requisites for data collection

Quality of the Completeness Format of


Data of data data
Asking the right question?

• What exactly do you want to find out?


• Which statistical analysis techniques
do you want to apply?
• Classification
• Anomaly detection
• Regression
• Who are the final users of your
analysis results?
• What data visualizations should you
choose?
Chapter 4: Data science and AI

At the end of this chapter, students will have a brief introduction


applications of data science in AI. They will know
• Applications of data science
• Analytics on text data
• Analytics on image data
• Overview of AI
Some applications of data science

Digital Speech
Advertisements Recognition
What is text analytics?

Text analytics can be defined as


the process of collecting
unstructured text from various
sources and analyzing and
extracting relevant information
from it. It can also be used for
transforming it into structured
information that can then be
used in various other ways.
Analytics on image data

Image recognition can be


described as a process by which
we can process images for
identifying people, patterns,
logos, objects, or places.
Overview of AI

Artificial Intelligence is defined as


the science and engineering of
making intelligent machines. AI is a
branch of Computer Science which
deals with the research and design
of intelligent systems that can take
inputs from their environment and
takes actions based on it as a
human being would.
Goals of AI

LOGICAL REASONING KNOWLEDGE PLANNING AND NATURAL LANGUAGE PERCEPTION


REPRESENTATION NAVIGATION PROCESSING

EMERGENT
INTELLIGENCE
Thank You

You might also like