Professional Documents
Culture Documents
• Machine Learning
▪ Why Machines Learn?
▪ Machine Learning Model
▪ Challenges & Applications
▪ Types
• Key Mathematical/ Statistical Concepts
▪ Scale of Measurements
• Python
• Datasets
What will be the Output of this Lecture? 3
1. Feature Engineering ……
Machine Learning 5
A Pattern Exists
We don’t know it
• Designing algorithms that ingest data and learn a (hypothesized) model of the data
• The learned model can be used to Detect patterns/structures/themes/trends etc. in the data
• ML needed for:
▪ Data-Driven Decision Making
▪ Efficiency and Scale
• Quality of data
• Use of low-quality data leads to the problems related to data preprocessing and feature extraction.
• Time-Consuming task
• Consumption of time especially for data acquisition, feature extraction and retrieval.
• Curse of dimensionality
• Too many features of data points. This can be a real hindrance.
• Difficulty in deployment
• Complexity of the ML model makes it difficult to be deployed in real life.
Applications In Security 9
• Responding to Ransomware
• Combining Application Development and Cybersecurity
• Using Deep Learning to Detect DGA-Generated Domains
• Detecting Non-Malware Threats
• Adaptive Honeypots and Honeytokens
• Deep Reinforcement Learning
• Protecting the IoT
• Predicting the Future
• Linear Regression
• Random Forest
• K-Means Algorithm
• DBSCAN
• Apriori Algorithm
Clustering
Anomaly Detection
Example- Unsupervised Learning 19
Exploration
Exploitation
Examples: Reinforcement Learning… 22
• Shelf Management
ML Versus DL 23
• Data Dependency
• Hardware Dependency
• Feature Engineering
• Problem Solving Approach
• Execution Time
• Interpretability
24
Data is the fact and figures collected together for reference and analysis.
s
Data in Machine Learning 26
Two types of variables based on the type of values that it can take.
Qualitative
Variables can take only particular values: retail store location area, state, city are examples for discrete
variables as it can take only one particular value for a store (here store is our object).
Quantitative
Variables can take any positive or negative numerical value between a large range.
Retail sales amount, insurance claims amounts are examples for continuous variables that can take any number
within large ranges.
• Nominal Scale
▪ Color, Gender, etc.
• Ordinal Scale
▪ Military rank, clothing size, etc.
• Interval Scale
▪ Temperature, IQ rating, etc.
• Ratio Scale
▪ Age, Weight, Height, etc.
Data Handling 28
• Collection
• Analysis
• Interpretation
• Presentation
• Visualization
Sampling Techniques 29
Random Sampling 30
• Measure of Variability/Spread
Measure of Central Tendency 34
• Mean
• Median
• Mode
Data Pre-Processing 35
About Python
Notebook Pandas
Qtconsole Scipy
Orange Matplotlib
Vscode Sklearn
PyCharm Numpy
Python Exercises to solve this week 39
• https://pynative.com/python-exercises-with-solutions/
• https://www.w3resource.com/machine-learning/scikit-learn/iris/index.php
• https://www.practicepython.org/
Next Lecture: Supervised Learning Algorithms 40
• Linear Regression
• Logistic Regression
• Decision Tree
• Random Forest
• Naïve Bayes
Reading Task for this week 41