Welcome to Scribd!

Live 1 - AI - K Nearest Neighbors

Uploaded by

0% found this document useful (0 votes)

25 views21 pages

The document discusses the k-nearest neighbors (kNN) machine learning algorithm. kNN can be used for both classification and regression tasks using a similarity measure to assign labels or values to new data based on closest training examples. It requires a value for k neighbors, a labeled training set, and a distance metric. The algorithm finds the k closest training examples in feature space and assigns the new data the most common label of its neighbors.

Original Description:

Original Title

Live 1 - AI - K nearest neighbors

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

25 views21 pages

Live 1 - AI - K Nearest Neighbors

Uploaded by

Vỹ Trần

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 21

Search inside document

Artificial Intelligence

k - Nearest Neighbors

Pham Viet Cuong

Dept. Control Engineering & Automation, FEEE
Ho Chi Minh City University of Technology
k - Nearest Neighbors
ü A type of supervised ML algorithm
ü Can be used for both classification and regression
ü Lazy learning algorithm

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 2
k - Nearest Neighbors
ü Uses ‘feature similarity’ to predict the values of new datapoints
ü The new data point will be assigned a value based on how closely it
matches the points in the training set

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 3
k - Nearest Neighbors
ü The kNN requires
v An integer k
v A set of labeled examples (training data)
v A metric to measure “closeness”
ü Example 1: Classification
v 2D
v 2 classes
v k=3
v Euclidean distance
v 2 sea bass, 1 salmon

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 4
k - Nearest Neighbors
ü Example 2: Classification
v 2D
v Three classes
v k=5
v Euclidean distance

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 5
k - Nearest Neighbors
ü Example 3: Classification
v Three-class 2D problem
v non-linearly separable
v k=5
v Euclidean distance

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 6
k - Nearest Neighbors
ü Example 4: Classification
v Three-class 2D problem
v non-linearly separable
v k=5
v Euclidean distance

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 7
k - Nearest Neighbors
ü Algorithm
v Step 1: Load training data and test data
v Step 2: Choose k
v Step 3:
Ø Calculate distance between test data and other data points
Ø Identify k nearest neighbors
Ø Use class labels of nearest neighbors to determine the class label
of test data (e.g., by taking majority vote)
v Step 4: End

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 8
k - Nearest Neighbors
ü How to choose k?
v If infinite number of samples available, the larger is k the better
v In practice: # samples is finite
v Rule of thumb: k = sqrt(n), n: number of examples
v k = 1: for efficiency, but can be sensitive to “noise”

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 9
k - Nearest Neighbors
ü How to choose k?
v Larger k may improve performance, but too large k destroys locality
v Smaller k: higher variance (less stable)
v Larger k: higher bias (less precise)

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 10
k - Nearest Neighbors
ü How well does KNN work?
v If we have lots of samples, kNN works well

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 11
k - Nearest Neighbors
ü Minkowski distance

ü Mahattan distance

ü Euclidean distance
ü Chebyshev distance

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 12
k - Nearest Neighbors
ü Best distance?

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 13
k - Nearest Neighbors
ü Euclidian distance

v Euclidean distance treats each feature as equally important

v However some features (dimensions) may be much more
discriminative than others

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 14
k - Nearest Neighbors
ü Euclidian distance

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 15
k - Nearest Neighbors
ü Feature nomalization
v Linearly scale to 0 mean, variance 1

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 16
k - Nearest Neighbors
ü Feature weighting
v Scale each feature by its importance for classification

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 17
k - Nearest Neighbors
ü Computational complexity
v Basic kNN algorithm stores all examples
v Very expensive for a large number of samples

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 18
k - Nearest Neighbors
ü kNN - a lazy learning algorithm
v Defers data processing until it receives a request to classify unlabeled
data
v Replies to a request for information by combining its stored training
data
v Discards the constructed answer and any intermediate results
v Lazy algorithms have fewer computational costs than eager
algorithms during training but greater storage requirements and
higher computational costs on recall

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 19
k - Nearest Neighbors
ü Advantages
v Can be applied to the data from any distribution
v Very simple and intuitive
v Good classification if the number of samples is large enough
v Uses local information, which can yield highly adaptive behavior
v Very easy for parallel implementations
ü Disadvantages
v Choosing k may be tricky
v Test stage is computationally expensive
v Need large number of samples for accuracy
v Large storage requirements
v Highly susceptible to the curse of dimensionality
Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 20
k - Nearest Neighbors
ü Sources:
v https://www.csd.uwo.ca/courses/CS4442b/L3-ML-knn.pdf
v http://research.cs.tamu.edu/prism/lectures/pr/pr_l8.pdf
v http://web.iitd.ac.in/~bspanda/KNN%20presentation.pdf
v V. B. Surya Prasath et. al., Effects of Distance Measure Choice on KNN
Classifier Performance - A Review, Big Data. 7. 10.1089/big.2018.0175.

Pham Viet Cuong - Dept. Control Eng. & Automation, FEEE, HCMUT 21

Arduino Measurements in Science: Advanced Techniques and Data Projects
From Everand
Arduino Measurements in Science: Advanced Techniques and Data Projects
Richard J. Smythe
No ratings yet
28 - Bergmann - HV Testing of MV and HV Cables and GIS PDF
Document63 pages
28 - Bergmann - HV Testing of MV and HV Cables and GIS PDF
bcqbao
100% (4)
Hamiltonian Monte Carlo Methods in Machine Learning
From Everand
Hamiltonian Monte Carlo Methods in Machine Learning
Tshilidzi Marwala
No ratings yet
Power System Simulation Lab
Document67 pages
Power System Simulation Lab
Victor C. Ccari
No ratings yet
Practical Methods for Analysis and Design of HV Installation Grounding Systems
From Everand
Practical Methods for Analysis and Design of HV Installation Grounding Systems
Ljubivoje M. Popovic
Rating: 4.5 out of 5 stars
4.5/5 (2)
Artificial Intelligence and Quantum Computing For Advanced Wireless Networks Savo G Glisic Full Chapter
Document51 pages
Artificial Intelligence and Quantum Computing For Advanced Wireless Networks Savo G Glisic Full Chapter
alice.chao451
100% (6)
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Artificial Intelligence: Alexnet
Document20 pages
Artificial Intelligence: Alexnet
Thành Cao Đức
No ratings yet
Live 2 - AI - K Means Clustering
Document17 pages
Live 2 - AI - K Means Clustering
Vỹ Trần
No ratings yet
K-NN (Nearest Neighbor)
Document17 pages
K-NN (Nearest Neighbor)
Shisir Ahmed
100% (1)
Pec-Cs 701e
Document4 pages
Pec-Cs 701e
suvamsarma67
No ratings yet
Pilot Based Channel Estimation Thesis
Document6 pages
Pilot Based Channel Estimation Thesis
deniseenriquezglendale
100% (2)
ECE 4710: Lecture #1: Edition, Leon Couch, Prentice Hall, 2007
Document20 pages
ECE 4710: Lecture #1: Edition, Leon Couch, Prentice Hall, 2007
Hawkins Basil
No ratings yet
Network Planning
Document20 pages
Network Planning
Farrukh
No ratings yet
Uncertainty
Document28 pages
Uncertainty
Gia Bang Nguyen
No ratings yet
2 Me - Electronics 2014 PDF
Document28 pages
2 Me - Electronics 2014 PDF
santosh_babar_26
No ratings yet
Lec1 Overview
Document49 pages
Lec1 Overview
Thi Lê Anh
No ratings yet
Instance-Based Learning: K-Nearest Neighbour Learning
Document21 pages
Instance-Based Learning: K-Nearest Neighbour Learning
feathh1
No ratings yet
ML Lec-10
Document19 pages
ML Lec-10
BHARGAV RAO
No ratings yet
Ece Thesis Forum
Document6 pages
Ece Thesis Forum
trinajacksonjackson
100% (2)
Intro To Broadband and Optical
Document16 pages
Intro To Broadband and Optical
Abd ALRahman
No ratings yet
KNN Algorithm
Document3 pages
KNN Algorithm
rr_adi94
No ratings yet
Wikipedia K Nearest Neighbor Algorithm
Document4 pages
Wikipedia K Nearest Neighbor Algorithm
Radu Cimpeanu
No ratings yet
Enhanced K-Nearest Neighbor Algorithm: Dalvinder Singh Dhaliwal, Parvinder S. Sandhu, S. N. Panda
Document5 pages
Enhanced K-Nearest Neighbor Algorithm: Dalvinder Singh Dhaliwal, Parvinder S. Sandhu, S. N. Panda
Ayani Uni
No ratings yet
Lab Presentation M1S2 - 1
Document21 pages
Lab Presentation M1S2 - 1
Tan Zhi Liang
No ratings yet
EE TSGENCO Schedule
Document6 pages
EE TSGENCO Schedule
Prasad Narvaneni
No ratings yet
Machine Learning
Document7 pages
Machine Learning
Ali Hiadr
No ratings yet
PE448 Lect1
Document19 pages
PE448 Lect1
Zeeshan Karim
No ratings yet
Scott 2009 Kalman
Document8 pages
Scott 2009 Kalman
uname0708
No ratings yet
Thesis: Cost-Effective Insulation Coordination Design of 115 KV Transmission Line For Lightning Back-Flashover
Document66 pages
Thesis: Cost-Effective Insulation Coordination Design of 115 KV Transmission Line For Lightning Back-Flashover
Tun tun lin
No ratings yet
Emtp-Rv: Use of The Emtp-Rv Software For Insulation Coordination Studies
Document4 pages
Emtp-Rv: Use of The Emtp-Rv Software For Insulation Coordination Studies
Rebekah Powell
No ratings yet
B.tech (ECE) 7th & 8th Sem.
Document17 pages
B.tech (ECE) 7th & 8th Sem.
Anuj Raj
No ratings yet
A Method Based On Multi-Sensor
Document21 pages
A Method Based On Multi-Sensor
Rendi Fabiola
No ratings yet
Cheng 2015
Document32 pages
Cheng 2015
Marelo Araya
No ratings yet
Evaluation of Intersection Collision Warning System Using An Inter-Vehicle Communication Simulator
Document33 pages
Evaluation of Intersection Collision Warning System Using An Inter-Vehicle Communication Simulator
Anonymous LFgO4WbID
No ratings yet
Week 7 Part 1KNN K Nearest Neighbor Classification
Document47 pages
Week 7 Part 1KNN K Nearest Neighbor Classification
Michael Zewdie
No ratings yet
Ece592 Power System Transients Analysis Syllabus-F14v5
Document4 pages
Ece592 Power System Transients Analysis Syllabus-F14v5
Hafsa Mustafa
No ratings yet
Ece6200 Spring14 Course Outline
Document4 pages
Ece6200 Spring14 Course Outline
Adithya Chandrasekaran
No ratings yet
AY1415 FYP Available List 20140626
Document25 pages
AY1415 FYP Available List 20140626
Joanna Lock
No ratings yet
TCP Over Wireless Networks
Document30 pages
TCP Over Wireless Networks
monumunduri
No ratings yet
Ece4710 L1
Document20 pages
Ece4710 L1
Iradatur Rahmat
No ratings yet
Electrical Engineer
Document5 pages
Electrical Engineer
Touhid Ahmad
No ratings yet
Aamir Alam E&I QC INSPECTOR
Document6 pages
Aamir Alam E&I QC INSPECTOR
Touhid Ahmad
No ratings yet
Lecture8 Notes
Document81 pages
Lecture8 Notes
Amir Mohamed Nabil Saleh Elghamery
No ratings yet
01 STJN Matdyn
Document19 pages
01 STJN Matdyn
Anonymous 1880JHcG
No ratings yet
Antirandom Testing
Document15 pages
Antirandom Testing
Rosyid Ridlo
No ratings yet
Thesis Ebe 2009 Mudau D
Document251 pages
Thesis Ebe 2009 Mudau D
Beu DeCondes Belchior
No ratings yet
Mewar University Chittorgarh - Power System Engineering
Document38 pages
Mewar University Chittorgarh - Power System Engineering
nved01
No ratings yet
Gate Ece Syllabus C0e015f0
Document5 pages
Gate Ece Syllabus C0e015f0
Anbu Selvan
No ratings yet
Lab Manual 19EC3021
Document40 pages
Lab Manual 19EC3021
raju raju
No ratings yet
EE514 - CS535 KNN Algorithm: Overview, Analysis, Convergence and Extensions
Document53 pages
EE514 - CS535 KNN Algorithm: Overview, Analysis, Convergence and Extensions
Zubair Khalid
No ratings yet
Experiment No 7 ML
Document9 pages
Experiment No 7 ML
Piyush Hood
No ratings yet
Vehicular Health Monitoring With EV Range Prediction and Route Planning
Document16 pages
Vehicular Health Monitoring With EV Range Prediction and Route Planning
lahari jagarlamudi
No ratings yet
Analog Syllabus
Document2 pages
Analog Syllabus
Santhosh Vempati
0% (1)
Autonomous Landing On A Moving Vehicle With An Unmanned Aerial Vehicle
Document32 pages
Autonomous Landing On A Moving Vehicle With An Unmanned Aerial Vehicle
Wei Jiang Ng
No ratings yet
Syllabus DCRUST
Document38 pages
Syllabus DCRUST
Sachin Malik
No ratings yet
Ali Hejazizo: - Curriculum Vitae
Document3 pages
Ali Hejazizo: - Curriculum Vitae
conter
No ratings yet
Sample Thesis Electronics Engineering
Document8 pages
Sample Thesis Electronics Engineering
afktciaihzjfyr
100% (2)
Cmme302 00
Document15 pages
Cmme302 00
Little Voice
No ratings yet
PHEV
Document77 pages
PHEV
Syed Sulaiman
No ratings yet
Artificial Intelligence
Document47 pages
Artificial Intelligence
Thành Cao Đức
No ratings yet
Video 2 - AI - Linear Regression
Document22 pages
Video 2 - AI - Linear Regression
Vỹ Trần
No ratings yet
Live 2 - AI - K Means Clustering
Document17 pages
Live 2 - AI - K Means Clustering
Vỹ Trần
No ratings yet
Live 3 - AI - Q Learning
Document18 pages
Live 3 - AI - Q Learning
Vỹ Trần
No ratings yet
Trí tuệ nhân tạo trong điều khiển: Convolutional Neural Networks Mạng nơron tích chập
Document19 pages
Trí tuệ nhân tạo trong điều khiển: Convolutional Neural Networks Mạng nơron tích chập
Vỹ Trần
No ratings yet
Tran Chi Vy: Sumary
Document2 pages
Tran Chi Vy: Sumary
Vỹ Trần
No ratings yet
OFDM De-Noising With RLS Adaptive Filter
Document5 pages
OFDM De-Noising With RLS Adaptive Filter
fikus
No ratings yet
Swigup Acrobot Spong PDF
Document7 pages
Swigup Acrobot Spong PDF
J.a. Tenshi Executioner
No ratings yet
AES Modes
Document9 pages
AES Modes
paul di
No ratings yet
A Sudoku Problem Formulated As An LP - PuLP v1.4.6 Documentation
Document3 pages
A Sudoku Problem Formulated As An LP - PuLP v1.4.6 Documentation
Ramez Safwat
No ratings yet
AI Project Cycle: 2.1. Problem Scoping
Document6 pages
AI Project Cycle: 2.1. Problem Scoping
Asmi Agnihotri
No ratings yet
CVL757: Finite Element Methods: IIT Delhi
Document16 pages
CVL757: Finite Element Methods: IIT Delhi
Samarth Garg
No ratings yet
3
Document4 pages
3
Deysy Reyes
No ratings yet
SS 15A03601 Operations Research
Document2 pages
SS 15A03601 Operations Research
rammohan reddy
No ratings yet
Error Detection Using CRC
Document6 pages
Error Detection Using CRC
suresh
No ratings yet
Chapter 1 Theory
Document17 pages
Chapter 1 Theory
Elveron
No ratings yet
Introduction To AI-ML-and Applications
Document115 pages
Introduction To AI-ML-and Applications
Mrinal Sarvagya
No ratings yet
Entropy 24 01489 PDF
Document15 pages
Entropy 24 01489 PDF
Stanislav Samokhvalenko
No ratings yet
Forecasting With Bayesian Vector Autoregression: Student: Ruja Cătălin Supervisor: Professor Moisă Altăr
Document18 pages
Forecasting With Bayesian Vector Autoregression: Student: Ruja Cătălin Supervisor: Professor Moisă Altăr
Muhammad Iqball
No ratings yet
UT Dallas Syllabus For cs4375.501.07f Taught by Yu Chung NG (Ycn041000)
Document5 pages
UT Dallas Syllabus For cs4375.501.07f Taught by Yu Chung NG (Ycn041000)
UT Dallas Provost's Technology Group
No ratings yet
Adiabatic Quantum Computation and Quantum Annealing - Theory and Practice (PDFDrive)
Document94 pages
Adiabatic Quantum Computation and Quantum Annealing - Theory and Practice (PDFDrive)
ams1ams1
No ratings yet
Sample Problems Numerical Differentiation
Document9 pages
Sample Problems Numerical Differentiation
brake saad
No ratings yet
Experience: Stephen Greet
Document1 page
Experience: Stephen Greet
Naim Rihan
No ratings yet
Math3280 2016HW2
Document3 pages
Math3280 2016HW2
uchiha_rhenzaki
No ratings yet
Planning
Document22 pages
Planning
sayedu
No ratings yet
Johnson's Algorithm
Document12 pages
Johnson's Algorithm
ladi9510
80% (5)
Data Science Generating Value From Data Course Slides Red
Document54 pages
Data Science Generating Value From Data Course Slides Red
Tejas Hirapara
No ratings yet
10 TSP Exam Sol
Document8 pages
10 TSP Exam Sol
Sheeraz Ahmad
No ratings yet
Lecture 7 - The Discrete Fourier Transform
Document2 pages
Lecture 7 - The Discrete Fourier Transform
Moy Pineda
No ratings yet
Normal Form Games and Nash Equilibrium: ECON2112: Week 1
Document467 pages
Normal Form Games and Nash Equilibrium: ECON2112: Week 1
raspberryorianna
No ratings yet
l3 Systems of Equations
Document2 pages
l3 Systems of Equations
api-287224366
No ratings yet
The Transformer Model in Equations: John Thickstun
Document5 pages
The Transformer Model in Equations: John Thickstun
Cristian Rodriguez
No ratings yet
Ex01 Linear Regression
Document2 pages
Ex01 Linear Regression
Nguyễn Thanh Thụy
No ratings yet
ML 5units
Document284 pages
ML 5units
bhargavvobilisetti
No ratings yet
23
Document5 pages
23
Arpita Das
No ratings yet