Welcome to Scribd!

Skip carousel

Introduction To Data Science

Uploaded by

Reeya Chumbar

0% found this document useful (0 votes)

5 views4 pages

Original Title

Introduction to data science

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

5 views4 pages

Introduction To Data Science

Uploaded by

Reeya Chumbar

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 4

Search inside document

Introduction to data science:

What is data science?

Data science in a nutshell is solving problems with data!

What is in data science?

Machine learning- Usage and development of algorithms that allow computers to learn and make
predictions

Applied statistics- Hypothesis Testing, Mathematical Modeling, Experimental design

Operational Research- Optimizing processes, resources, decision making, etc. within a business

Information theory- Quantification, storage and communication of information

Data engineering- Building and maintaining robust infrastructure for data pipelines (feeding and
analyzing large volumes of data)

Types of ML

There are three most common types of ML:

Regression (supervised)
Classification (supervised)

Clustering (supervised)

Regression:

Classification:

Clustering:
Basic statistics to describe data:

- When describing numerical data, we may utilize descriptive statistics

- Descriptive statistics provide summary information on the characteristics and distributions of
values within one or more datasets

Descriptive statistics:
- There are three main prominent areas:
- Distribution- frequency of each value occurring within the data
- Central tendency- the averages of the data
- Variability- how spread out the values are from central tendency

Distribution:

- Datasets are made up of distribution values, and we can summarize the frequency of each
possible value using numbers or percentages. This is usually done through a frequency table.
- The simple frequency table represents all values grouped together with their main
categories. We can easily identify the most popular group using this
- The group frequency table creates numerical groupings based on the amount of visits each
person had to the library. We can identify further information on the distribution of values,
ie. Here most people visit the library between 9 and 12 times.

Central tendency:
- Central tendency represents the center, or average of the dataset
- Mean, median and mode are mostly used for finding the average
- Mean: add up all values and divide by the amount of values
- Median: the exact middle value
- Mode: the most commonly found value

Variability:

- Variability tells us how spread out the values within the dataset are
- Range, standard deviation and variance are the most common metrics of variability
- Range- largest value minus smallest value
- Standard deviation- Average amount of variability within the dataset. High SD means high
variability, low SD means low variability
- Variance- Standard Deviation squared

Standard deviation:

- Standard deviation represents the dispersion of values from the mean

- The flatter the curve, the higher standard deviation there is
- The smaller the spread, the lower the standard deviation

Statistical Machine Learning
Document12 pages
Statistical Machine Learning
Deva Hema
100% (1)
Project 10 (Statistics
Document14 pages
Project 10 (Statistics
Arkin Dutta
No ratings yet
Retzlaff Criminal Complaint
Document8 pages
Retzlaff Criminal Complaint
FOX 11 News
100% (1)
Managerial Grid
Document10 pages
Managerial Grid
rony
No ratings yet
C4 Descriptive Statistics
Document34 pages
C4 Descriptive Statistics
NAVANEETH
No ratings yet
Worksheet - Money and Monetary Policy
Document3 pages
Worksheet - Money and Monetary Policy
brian
No ratings yet
Agriculture Science Sba
Document3 pages
Agriculture Science Sba
kevoy peart
No ratings yet
Get Set Go! - Pupils Book - Level 2 PDF
Document96 pages
Get Set Go! - Pupils Book - Level 2 PDF
Gráfico Práctica Digital
No ratings yet
Amazon Inc
Document15 pages
Amazon Inc
abdul basit
No ratings yet
Univariate Bivariate & Multivariate Analysis of Data
Document24 pages
Univariate Bivariate & Multivariate Analysis of Data
Leah Mae Agustin
No ratings yet
73 Azure Security Best Practices Everyone Must Follow - Skyhigh
Document6 pages
73 Azure Security Best Practices Everyone Must Follow - Skyhigh
Rohit Jain
No ratings yet
Module 3 Descriptive Statistics Final
Document15 pages
Module 3 Descriptive Statistics Final
Jordine Umayam
100% (1)
Digital Marketing Health Check Smart Insights Original
Document8 pages
Digital Marketing Health Check Smart Insights Original
Hong Bui
No ratings yet
Matrix - Constitutional Law Review I
Document32 pages
Matrix - Constitutional Law Review I
Dennie Vieve Idea
No ratings yet
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet
Summary Statistics and Visualization Techniques To Explore
Document30 pages
Summary Statistics and Visualization Techniques To Explore
Marshil Shibu
No ratings yet
Week 7 - Food Styling and Photography
Document25 pages
Week 7 - Food Styling and Photography
Kurt Francisco
No ratings yet
DTI Good Practice Guide For PV Installations PDF
Document28 pages
DTI Good Practice Guide For PV Installations PDF
Cliff
No ratings yet
DSBDL Asg 3 Write Up
Document6 pages
DSBDL Asg 3 Write Up
sdaradeyt
No ratings yet
DM 02 01 Data Undrestanding
Document35 pages
DM 02 01 Data Undrestanding
Pallavi Bharti
No ratings yet
Unit 1 - Business Statistics & Analytics
Document25 pages
Unit 1 - Business Statistics & Analytics
k89794
No ratings yet
Unit II
Document18 pages
Unit II
Shyam Maari
No ratings yet
It0089 Finalreviewer
Document143 pages
It0089 Finalreviewer
Karl Erol Pasion
100% (1)
Measures of Central Tendency
Document13 pages
Measures of Central Tendency
castorangela94
No ratings yet
ND RD
Document10 pages
ND RD
Vy Jessie
No ratings yet
Central Tendency: Mode, Median, and Mean
Document15 pages
Central Tendency: Mode, Median, and Mean
Krung Krung
No ratings yet
Business Analytics
Document40 pages
Business Analytics
vaishnavidevi dharmaraj
No ratings yet
Chandru
Document27 pages
Chandru
Sidhant Bhayana
No ratings yet
Research Presentation
Document29 pages
Research Presentation
Avitus Hamutenya
No ratings yet
Summarize Data Sets
Document12 pages
Summarize Data Sets
MBA LOGISTICS & SUPPLY CHAIN MANAGEMENT
No ratings yet
Financial Modelling: Statistical Measures
Document6 pages
Financial Modelling: Statistical Measures
Manfredi Soprani
No ratings yet
Data Analysis Procedure
Document7 pages
Data Analysis Procedure
JenniferTidalgo
No ratings yet
Statistics For Data Science
Document93 pages
Statistics For Data Science
Cesar Lobato
No ratings yet
SINGLE VARIABLE Notes 5.3 Year 10
Document9 pages
SINGLE VARIABLE Notes 5.3 Year 10
primalp6105
No ratings yet
Ai ML
Document2 pages
Ai ML
bharathma411
No ratings yet
Math Notes
Document2 pages
Math Notes
Kana
No ratings yet
Dsbda Unit 2
Document155 pages
Dsbda Unit 2
king maker
No ratings yet
Math4E Week 7 - Lecture 6
Document19 pages
Math4E Week 7 - Lecture 6
John Cris Lustria Püblico
No ratings yet
Summary of Chapter 12 and 13
Document8 pages
Summary of Chapter 12 and 13
Abdul Basit
No ratings yet
STATISTICS
Document4 pages
STATISTICS
Jhianne Estacoja
No ratings yet
Almendralejo Statistics
Document19 pages
Almendralejo Statistics
Rhywen Fronda Gille
No ratings yet
Measures of Central Tendency
Document5 pages
Measures of Central Tendency
MEL ABA
No ratings yet
Measures of Central Tendency: Mean, Mode, Median
Document30 pages
Measures of Central Tendency: Mean, Mode, Median
afnan nabi
No ratings yet
Measures of Dispersion Edit
Document8 pages
Measures of Dispersion Edit
Shimaa Kashef
No ratings yet
Data Analysis: Descriptive Statistics
Document8 pages
Data Analysis: Descriptive Statistics
Rajja Rashad
No ratings yet
Untitled
Document15 pages
Untitled
best barbie
No ratings yet
Measures of Central Tendency Mean, Mode, Median: DR. Hetal Koringa Assistant Professor
Document26 pages
Measures of Central Tendency Mean, Mode, Median: DR. Hetal Koringa Assistant Professor
Hetal Koringa patel
No ratings yet
RSU - Statistics - Lecture 3 - Final - myRSU
Document34 pages
RSU - Statistics - Lecture 3 - Final - myRSU
irina.mozajeva
No ratings yet
Educ 202
Document8 pages
Educ 202
Chris Patlingrao
No ratings yet
Contents UNIT 42
Document21 pages
Contents UNIT 42
zainabasim2003
No ratings yet
Statistics
Document5 pages
Statistics
Elene Grace Barte
No ratings yet
Presentation On Data Analysis: Submitted by
Document38 pages
Presentation On Data Analysis: Submitted by
Amisha Popli
No ratings yet
ASSIGNMEN4
Document15 pages
ASSIGNMEN4
Harshita Sharma
100% (1)
Data Management
Document48 pages
Data Management
Rikki Mae
No ratings yet
Mean
Document9 pages
Mean
Tejashwi Kumar
No ratings yet
Measures of Central Tendency
Document4 pages
Measures of Central Tendency
api-150547803
No ratings yet
Business Statistics & Analytics For Decision Making Assignment 1 Franklin Babu
Document9 pages
Business Statistics & Analytics For Decision Making Assignment 1 Franklin Babu
franklin
100% (1)
Chapter 2 4 - FBAS
Document17 pages
Chapter 2 4 - FBAS
Jasper Lagrimas
No ratings yet
Unit .......
Document45 pages
Unit .......
Rajan shah
No ratings yet
Ungrouped Data
Document3 pages
Ungrouped Data
zulheymey
No ratings yet
Central Tendency Measures BYJU's
Document6 pages
Central Tendency Measures BYJU's
Catherine Fetizanan
No ratings yet
Name: Roll No: Learning Centre: Subject: Mb0040 - Statistics For Management Date of Submission at The Learning Centre
Document23 pages
Name: Roll No: Learning Centre: Subject: Mb0040 - Statistics For Management Date of Submission at The Learning Centre
karkika9235
No ratings yet
Averages 2
Document6 pages
Averages 2
Tagalog, Cyril Dhune C.
No ratings yet
Tutorial 15
Document7 pages
Tutorial 15
Yeong Zi Ying
No ratings yet
Chapter 14
Document24 pages
Chapter 14
sai
No ratings yet
All The Statistical Concept You Required For Data Science
Document26 pages
All The Statistical Concept You Required For Data Science
Ashwin Chaudhari
No ratings yet
It0089 Finalreviewer
Document143 pages
It0089 Finalreviewer
Karl Erol Pasion
No ratings yet
Descriptive Statistics
Document4 pages
Descriptive Statistics
Raghad Al Qweefl
No ratings yet
Kemeng Reviewer
Document8 pages
Kemeng Reviewer
Vinabie Puno
No ratings yet
Statistics For Data Analysis
Document13 pages
Statistics For Data Analysis
عبد الحق
No ratings yet
Data Scaling and Normalization
From Everand
Data Scaling and Normalization
Chuck Sherman
No ratings yet
Admission and Examination System
Document87 pages
Admission and Examination System
MUHAMMAD SALEEM RAZA
No ratings yet
Spoiler-Here Comes The Lady Chef
Document2 pages
Spoiler-Here Comes The Lady Chef
eulea larkaro
No ratings yet
Course Name (Course Code) Student Name (Matric No) : Task
Document1 page
Course Name (Course Code) Student Name (Matric No) : Task
Nur
No ratings yet
4 As
Document4 pages
4 As
Salvador Patosa
100% (1)
CHAPTER IX - Concepts in Hostage Categories of Hostage
Document4 pages
CHAPTER IX - Concepts in Hostage Categories of Hostage
Erica Marie Bagon
No ratings yet
Your Randomly Generated Identity: Maria J. Odom
Document2 pages
Your Randomly Generated Identity: Maria J. Odom
hiwduioqghwdop
No ratings yet
Tulipa Gesneriana and T. Hybrids
Document19 pages
Tulipa Gesneriana and T. Hybrids
Dragan ItakoTo
No ratings yet
Korean Seven Knight Hero Tier List 6
Document9 pages
Korean Seven Knight Hero Tier List 6
GhifarAnshary
No ratings yet
BSI ISO 50001 Case Study Sheffield Hallam University UK EN PDF
Document2 pages
BSI ISO 50001 Case Study Sheffield Hallam University UK EN PDF
facundo
No ratings yet
A Report On Reconstructionism
Document23 pages
A Report On Reconstructionism
Niel Nisperos
100% (1)
(English) The Danger of AI Is Weirder Than You Think - Janelle Shane (DownSub - Com)
Document8 pages
(English) The Danger of AI Is Weirder Than You Think - Janelle Shane (DownSub - Com)
Devansh
No ratings yet
Magis Registration Form: Application Form For Grade 6 & 9
Document3 pages
Magis Registration Form: Application Form For Grade 6 & 9
Khan Aagha
No ratings yet
Proposal Media Partner Geoweek 2021
Document20 pages
Proposal Media Partner Geoweek 2021
Wahyuni
No ratings yet
7 Tariffs and Customs
Document28 pages
7 Tariffs and Customs
HaRry Peregrino
No ratings yet
Pages From 2020-European - Semester - Country-Report-Romania - En-3
Document20 pages
Pages From 2020-European - Semester - Country-Report-Romania - En-3
M
No ratings yet
Inside Glen Eyrie Castle
Document9 pages
Inside Glen Eyrie Castle
Maureen O'Brien
No ratings yet
Science and Health: (Philippine Elementary Learning Competencies) Basic Education Curriculum
Document31 pages
Science and Health: (Philippine Elementary Learning Competencies) Basic Education Curriculum
irish-tuazon-4112
94% (17)
Radio in The UK.
Document26 pages
Radio in The UK.
Mohammed Miah
No ratings yet
Compiler Design Question Bank-UNIT 1
Document12 pages
Compiler Design Question Bank-UNIT 1
Jesudass I
No ratings yet