Welcome to Scribd!

Skip carousel

Introduction To Data Science and Machine Learning

Uploaded by

gb_oprescu

0% found this document useful (0 votes)

15 views21 pages

Very short introduction to Data Science

Original Title

ML Concepts

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Very short introduction to Data Science

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

15 views21 pages

Introduction To Data Science and Machine Learning

Uploaded by

gb_oprescu

Very short introduction to Data Science

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 21

Search inside document

Introduction to Data Science

and Machine Learning

What is Data Science

A multi disciplinary field of

research that uses scientific
methods, processes and
algorithms to extract information
from data.

Internal Use - Confidential

What do you need to know in DS?

• Algebra (data is represented as a matrix)

Math • Calculus (for optimization algorithms)

• Moments, distributions, correlations

Statistics • Hypothesis testing

• Programming languages (R/Python). SQL

Computer Science • Programming algorithms.

• Good understanding of the problem

Domain expertise • Discuss with experienced people

Internal Use - Confidential

What types of data do you know?

Internal Use - Confidential

Unstructured data

Internal Use - Confidential

Structured data

Internal Use - Confidential

What is Machine Learning?

Machine learning (ML) is

the study of computer
algorithms that improve
automatically through
experience and by the use
of data.

Internal Use - Confidential

Types of learning

Internal Use - Confidential

ML Algorithms
- Linear Regression
- Logistic Regresion
Regression - Decision Trees
- SVM
Supervised - Random Forest
Classification - Neural Networks
Machine
- Discriminate Analysis
Learning - KNN
- Hierarchical
Clustering
Unsupervised
- K – Means

Dimension - PCA
Reduction - SVD
- Embeddings

Internal Use - Confidential

Problems you can you solve with ML

Internal Use - Confidential

Neural networks (NN)

• Electrical charge comes through

dendrites.
• Once a certain electrical potential is
reached, the electrical signal propagates
through the axon.
• Axon terminal are connected with other
dendrites.
• The process goes on.

Internal Use - Confidential

NN mathematical analogy

• , … are the input feature (columns, variables)

• are weights for each feature.
• is the linear combination between inputs and
weights.
• ʃ is the activation function.

Internal Use - Confidential

NN – forward propagation

• The model outputs probabilities and we

need to decide if the ‘signal’ will pass
forward.
• The threshold will be useful to decide if
an application is fraud.
• IF the output is larger than 0.5, then it is
a fraud.

X1 X2 X3 X4 Y_hat Y
Num_Empl Capital Financed Assets W Sumator Sigmoid Threshold=0.5 Real fraud
Company 1 2 0 3 0 1 Company 1 3.2 0.039166 0 0
Company 2 5 2 1 2 * -0.9 = Company 2 1.2 -> 0.231475 -> 0 -> 1
Company 3 1 1 2 2 0.4 Company 3 -1.5 0.817574 1 0
-1.2

Internal Use - Confidential

NN - loss

Loss function:
• Depends on weights vector
• It is generally convex, if not on the full domain, at least on
intervals.
• Generates the optimization problem: reduce the loss with
respect to weights.
• Finding the global minimum means finding the best model.

𝑚
1
𝐽 (𝑤)= ∗∑ ¿ ¿ ¿
𝑚 𝑖 =1
Y_hat Y
Threshold=0.5 Real fraud Loss
0 0 0.666667
0 -> 1 ->
1 0
Internal Use - Confidential
NN – forward and backward

Internal Use - Confidential

NN – what to optimize?
Neurons

Hidden layers

Learning Rate

Loss Function
Hyper Parameters
Optimizer

Metric

Dropout

Early stopping

Parameters Weights

Internal Use - Confidential

NN - intuition

Neural networks can learn to approximate any function (with a certain cost).

The essence of supervised learning:

- Get training examples
- Find a function that approximates the real function
- Minimize the loss

Why do we not know the real function?

- Because we do not have the whole data input for a
certain phenomenon.
- Because we are unable to grasp the complexity of a
phenomenon.

Internal Use - Confidential

NN - intuition

Using hidden layers, NN have the possibility to

represent objects in a space with more dimensions
than the original space (with a certain cost).

Internal Use - Confidential

NN – bias and variance

Internal Use - Confidential

Fraud Detection Project

UDB Training data

UDE
29683 36 268
Databases
applications variables frauds
XML

model with 1 2433 AUC

Fraud Tracker
hidden layer parameters 0.9236

Internal Use - Confidential

Challenges
1. Use the same information as people do for deciding upon a fraud.

2. Select most suited variables for the model (tables with hundred of columns).

3. Internal databases were not providing quality data, so use data parsed from XML files.

4. Clean / transform data (quite messy and without explanation).

5. Highly imbalanced data. Fraud prevalence on training of 0.9%.

6. Find a model able to generalize and perform well in real conditions.

Internal Use - Confidential

Intro To Deep Learning
Document39 pages
Intro To Deep Learning
hiperboreoatlantec
No ratings yet
Analytical Problem Solving
Document7 pages
Analytical Problem Solving
Ramón G. Pacheco
50% (2)
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
Document106 pages
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
Zee Ingame
No ratings yet
CII4Q3 - Computer Vision-EAR - Week-11-Intro To Deep Learning v1.0
Document50 pages
CII4Q3 - Computer Vision-EAR - Week-11-Intro To Deep Learning v1.0
Zee Ingame
No ratings yet
2 DNN-CNN-RNN
Document87 pages
2 DNN-CNN-RNN
Salma Hamzaoui
No ratings yet
Machine Learning & Data Mining
Document108 pages
Machine Learning & Data Mining
M A Dipto
No ratings yet
Fintech ML Using Azure
Document51 pages
Fintech ML Using Azure
Vikram Pandya
No ratings yet
C106363GC10 - PRODUCTION - Machine Learning On Autonomous Database A Practical Example
Document24 pages
C106363GC10 - PRODUCTION - Machine Learning On Autonomous Database A Practical Example
Tran Quoc Dung
No ratings yet
Machine Learning Shortnote
Document14 pages
Machine Learning Shortnote
lahiru
No ratings yet
Machine Learning: Mona Leeza Email: Monaleeza - Bukc@bahria - Edu.pk
Document60 pages
Machine Learning: Mona Leeza Email: Monaleeza - Bukc@bahria - Edu.pk
zombiee hook
No ratings yet
Deep Learning Hands On
Document18 pages
Deep Learning Hands On
Mohamed Alhadi
No ratings yet
Data Science Applications and Research Directions
Document38 pages
Data Science Applications and Research Directions
Robin Rohit
100% (1)
2 - Types of Machine Learning
Document26 pages
2 - Types of Machine Learning
Sanyam Gupta
No ratings yet
Neural Networks - Comprehensive Foundation (Introduction)
Document47 pages
Neural Networks - Comprehensive Foundation (Introduction)
Guillermo Orozco
No ratings yet
Convolutional Neural Networks (1) : Geena Kim
Document28 pages
Convolutional Neural Networks (1) : Geena Kim
Huston LAM
No ratings yet
Pengantar Spatial Machine Learning
Document83 pages
Pengantar Spatial Machine Learning
Dimas
No ratings yet
Predicting The Performance of Mechanical Systems Using Machine Learning
Document54 pages
Predicting The Performance of Mechanical Systems Using Machine Learning
Ankur Vishal
No ratings yet
Clustering Techniques: Welcome To
Document52 pages
Clustering Techniques: Welcome To
Aasmi
No ratings yet
Neural - Networks
Document47 pages
Neural - Networks
howgibaa
No ratings yet
3 b41658c776 Artificial Intelligence Unit 1
Document81 pages
3 b41658c776 Artificial Intelligence Unit 1
Vishesh negi
No ratings yet
Untitled
Document128 pages
Untitled
P.V.S. VEERANJANEYULU
No ratings yet
Neural Networks
Document40 pages
Neural Networks
salemamr1010
No ratings yet
Abhi Presentation
Document20 pages
Abhi Presentation
Chella venkannababu
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
Document22 pages
Foundations of Machine Learning: Module 6: Neural Network
Nishant Tiwari
No ratings yet
Non-Linear Classifiers
Document19 pages
Non-Linear Classifiers
Pooja Patwari
No ratings yet
Bee4333 Intelligent Control: Artificial Neural Network (ANN)
Document76 pages
Bee4333 Intelligent Control: Artificial Neural Network (ANN)
WanM.Syamim
No ratings yet
Bird or Drug
Document42 pages
Bird or Drug
성은문
No ratings yet
Advanced Algorithm Design and Analysis Techniques: Dynamic Programming
Document24 pages
Advanced Algorithm Design and Analysis Techniques: Dynamic Programming
Abraham Alemseged
No ratings yet
House Dzone Refcard 383 Neural Network Essentials
Document5 pages
House Dzone Refcard 383 Neural Network Essentials
Fernando
No ratings yet
l15 16 Autoencoders
Document26 pages
l15 16 Autoencoders
Rajakumar Awaradi
No ratings yet
Deep Learning State of The Art: Amulya Viswambharan ID 202090007 Kehkshan Fatima ID
Document17 pages
Deep Learning State of The Art: Amulya Viswambharan ID 202090007 Kehkshan Fatima ID
Amulya
No ratings yet
IntroductionToSLvsUSL v1.0
Document28 pages
IntroductionToSLvsUSL v1.0
Sukeshan R
No ratings yet
DL Full Merged
Document454 pages
DL Full Merged
Mann chhikara
No ratings yet
Machine Learning Re Defining Semiconductor Industry 1598272842
Document33 pages
Machine Learning Re Defining Semiconductor Industry 1598272842
「瞳」你分享
No ratings yet
Intro
Document38 pages
Intro
Tran Kim Toai
No ratings yet
Introduction To Deep Learning
Document17 pages
Introduction To Deep Learning
Anjaney
No ratings yet
ML1 Foundations
Document39 pages
ML1 Foundations
Gonzalo Contreras
No ratings yet
Lecture 2
Document13 pages
Lecture 2
Aaa aaa
No ratings yet
Day 2
Document58 pages
Day 2
ganareddys
No ratings yet
Introduction To Neurons and Neural Networks by Dr. Maitreyee Dutta Professor, CSE Department
Document43 pages
Introduction To Neurons and Neural Networks by Dr. Maitreyee Dutta Professor, CSE Department
Veeravasantharao Battula
No ratings yet
Day 2 - Sesi 1 - Pengantar ML
Document28 pages
Day 2 - Sesi 1 - Pengantar ML
Dheeka Hani Soeroso
No ratings yet
Neural Networks
Document17 pages
Neural Networks
Varun Bhayana
No ratings yet
Neural Networks and Deep Learning
Document19 pages
Neural Networks and Deep Learning
Nitesh Yadav
No ratings yet
Auto Encoder
Document39 pages
Auto Encoder
Sreetam Ganguly
No ratings yet
Pengantar Datamining: Anto Satriyo Nugroho, DR - Eng
Document33 pages
Pengantar Datamining: Anto Satriyo Nugroho, DR - Eng
Rendy Dwi Anugrah Putra
No ratings yet
Deep Learning
Document80 pages
Deep Learning
extra junk
No ratings yet
Introductory Data Science and ML
Document25 pages
Introductory Data Science and ML
Pareekshith Katti
No ratings yet
3 Deep Learning Overview v3.5
Document85 pages
3 Deep Learning Overview v3.5
Dany Sanchez
No ratings yet
Introduction To Soft Computing
Document18 pages
Introduction To Soft Computing
Sitanath Biswas
No ratings yet
Exploring The Possibilities of Analog Neuromorphic Computing With BrainScaleS
Document21 pages
Exploring The Possibilities of Analog Neuromorphic Computing With BrainScaleS
bareya.eztu
No ratings yet
Lec19 - GANs
Document47 pages
Lec19 - GANs
Yattin Gaur
No ratings yet
Chapter 3
Document33 pages
Chapter 3
Adriano Vianna
No ratings yet
Autoencoder
Document39 pages
Autoencoder
Rivujit Das
No ratings yet
ML Training
Document6 pages
ML Training
shrestha3902
No ratings yet
Deep Learning: Seungsang Oh
Document279 pages
Deep Learning: Seungsang Oh
KaAI Kookmin
No ratings yet
Machine Learning in Medical Health Care
Document47 pages
Machine Learning in Medical Health Care
John Doe
100% (1)
A145286344 23681 24 2018 Tensorflow
Document15 pages
A145286344 23681 24 2018 Tensorflow
Rohit Kolli
No ratings yet
Deeplearning Ai
Document71 pages
Deeplearning Ai
Jian Quan
No ratings yet
6-Neural NT
Document44 pages
6-Neural NT
SAMRIDDHI JAISWAL
No ratings yet
Final
Document30 pages
Final
ayushkukreja30
No ratings yet
Mastering Mathematica®: Programming Methods and Applications
From Everand
Mastering Mathematica®: Programming Methods and Applications
John W. Gray
Rating: 5 out of 5 stars
5/5 (1)
Forrester TEI Study 2021 Final
Document28 pages
Forrester TEI Study 2021 Final
gb_oprescu
No ratings yet
Abhishek Nandy, Manisha Biswas (Auth.) - Reinforcement Learning - With Open AI, TensorFlow and Keras Using Python-Apress (2018)
Document174 pages
Abhishek Nandy, Manisha Biswas (Auth.) - Reinforcement Learning - With Open AI, TensorFlow and Keras Using Python-Apress (2018)
DineshKumarAzad
No ratings yet
A Guide To Starting A Career: in Cyber Security
Document16 pages
A Guide To Starting A Career: in Cyber Security
Ovidiu Costea
100% (2)
Deep Learning Tutorial Release 0.1
Document173 pages
Deep Learning Tutorial Release 0.1
lerhlerh
No ratings yet
Toward State-of-the-Art Deep Learning in R: Darch 1.0
Document142 pages
Toward State-of-the-Art Deep Learning in R: Darch 1.0
gb_oprescu
No ratings yet
Data Driven Strategies For Writing Effective Titles and Headlines
Document28 pages
Data Driven Strategies For Writing Effective Titles and Headlines
Simona Bartic
No ratings yet
Kinetica User Manual
Document768 pages
Kinetica User Manual
thientinbui
No ratings yet
Lungimea Zilei
Document2 pages
Lungimea Zilei
gb_oprescu
No ratings yet
Visual Inquiry Lesson: Pontiac's Rebellion
Document3 pages
Visual Inquiry Lesson: Pontiac's Rebellion
Nicole McIntyre
No ratings yet
Bedtime Procrastination Introducing A New Area of Procrastination
Document8 pages
Bedtime Procrastination Introducing A New Area of Procrastination
Fercho Med
No ratings yet
Human Language
Document14 pages
Human Language
emanuelanrom
100% (1)
Article Review 1
Document3 pages
Article Review 1
Jnb Chioco
No ratings yet
Opportunities For Ict in Stage 1 Cambridge Primary Science Guide p.110
Document1 page
Opportunities For Ict in Stage 1 Cambridge Primary Science Guide p.110
YAS Learning Center School
No ratings yet
Eliminating Differences in Perception
Document1 page
Eliminating Differences in Perception
Sourabh Raorane
No ratings yet
Daily Lesson Plan Subject: English Language WEEK: 27 Date Class Chapter/Unit/ Topic/ Theme/ Subtopic Learning Objective (S)
Document11 pages
Daily Lesson Plan Subject: English Language WEEK: 27 Date Class Chapter/Unit/ Topic/ Theme/ Subtopic Learning Objective (S)
power rangerdi l Ren
No ratings yet
Chapter 12 - Principle of Management
Document40 pages
Chapter 12 - Principle of Management
Amsavalli Sella
100% (1)
2nd Quarter MAPEH 7-Week5
Document3 pages
2nd Quarter MAPEH 7-Week5
Gilbert Obing
No ratings yet
Kornell Bjork - 2009 PDF
Document20 pages
Kornell Bjork - 2009 PDF
Rizki Fadhilah
No ratings yet
Examining Metacognitive Performance Between Skilled and Unskilled Writers in An Integrated EFL Writing Class
Document16 pages
Examining Metacognitive Performance Between Skilled and Unskilled Writers in An Integrated EFL Writing Class
devy
No ratings yet
Sanisa Latifah - Thesis Analysis RPW
Document3 pages
Sanisa Latifah - Thesis Analysis RPW
vn ryn
No ratings yet
Collegial Educational Management Model - The Face of Improved Management Practice
Document16 pages
Collegial Educational Management Model - The Face of Improved Management Practice
JAKE YAO
No ratings yet
Lecture Notes 3
Document5 pages
Lecture Notes 3
fgsfgs
No ratings yet
Conceptual Blockbusting - A Guide To Better Ideas
Document16 pages
Conceptual Blockbusting - A Guide To Better Ideas
Cristi Cosac
75% (4)
Reaction Paper
Document5 pages
Reaction Paper
yvonne
No ratings yet
Impact of On-The-Job Training To Hotel and Restaurant Services Students of Higher School of The University of Makati
Document35 pages
Impact of On-The-Job Training To Hotel and Restaurant Services Students of Higher School of The University of Makati
Jed Valdez
No ratings yet
English Olympiad Prefix and Suffix Grammar Questions 5th Grade
Document2 pages
English Olympiad Prefix and Suffix Grammar Questions 5th Grade
SuvashreePradhan
No ratings yet
The Effect of Novel Attributes On Product Evaluation: Ashesh Mukherjee Wayne D. Hoyer
Document11 pages
The Effect of Novel Attributes On Product Evaluation: Ashesh Mukherjee Wayne D. Hoyer
Manoj Kumar
No ratings yet
The Physics of Art and The Art of Physics
Document1 page
The Physics of Art and The Art of Physics
gcschmit
No ratings yet
Project REACH - 2022
Document4 pages
Project REACH - 2022
Elyka Sata
No ratings yet
Edu 220 Cooperative Learning Lesson Plan
Document6 pages
Edu 220 Cooperative Learning Lesson Plan
api-533889645
No ratings yet
Group 2
Document8 pages
Group 2
Borela Monique
No ratings yet
Art As A Thinking Process - C.V. - A Technical Efflorescence
Document12 pages
Art As A Thinking Process - C.V. - A Technical Efflorescence
big_jah
No ratings yet
Pec 7 Week 2
Document6 pages
Pec 7 Week 2
Jason Binondo
0% (1)
Bernabe - Easy Audiovisual Content For All Draft
Document172 pages
Bernabe - Easy Audiovisual Content For All Draft
Daniela Souza
No ratings yet
Good Shepherd Convent Case Study
Document7 pages
Good Shepherd Convent Case Study
api-270714689
No ratings yet
Micromanagement Assignment
Document3 pages
Micromanagement Assignment
rue
No ratings yet
Unit 2 Role of Guidance and Counselling Personnel
Document5 pages
Unit 2 Role of Guidance and Counselling Personnel
Ahmad Shah
No ratings yet