Welcome to Scribd!

K-Nearest Neighbor: Carla P. Gomes CS4700

Uploaded by

0% found this document useful (0 votes)

10 views9 pages

The document discusses K-Nearest Neighbor (KNN), a simple machine learning classification algorithm. KNN classifies new data points based on the labels of the k closest training examples in feature space. It can be sensitive to noise and high-dimensional data. Key aspects include choosing a distance metric, determining k, and addressing the "curse of dimensionality". KNN is simple to implement but requires searching all training data for predictions.

Original Description:

Original Title

Lec 4

Copyright

Available Formats

PPT, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPT, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

10 views9 pages

K-Nearest Neighbor: Carla P. Gomes CS4700

Uploaded by

Ali Shan

Copyright:

Available Formats

Download as PPT, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 9

Search inside document

K- Nearest Neighbor

Carla P. Gomes
CS4700
1-Nearest Neighbor

One of the simplest of all machine learning classifiers

Simple idea: label a new point the same as the closest known point

Label it red.

Carla P. Gomes
CS4700
Distance Metrics
Different metrics can change the decision surface

Dist(a,b) =(a1 – b1)2 + (a2 – b2)2 Dist(a,b) =(a1 – b1)2 + (3a2 – 3b2)2

Standard Euclidean distance metric:

– Two-dimensional: Dist(a,b) = sqrt((a1 – b1)2 + (a2 – b2)2)
Adapted from “Instance-Based Learning”
– Multivariate: Dist(a,b) = sqrt(∑ (ai – bi)2) lecture slides by Andrew Moore, CMU.

Carla P. Gomes
CS4700
1-NN’s Aspects as an
Instance-Based Learner:

A distance metric
– Euclidean
– When different units are used for each dimension
 normalize each dimension by standard deviation
– For discrete data, can use hamming distance
 D(x1,x2) =number of features on which x1 and x2 differ
– Others (e.g., normal, cosine)

How many nearby neighbors to look at?

– One
How to fit with the local points?
– Just predict the same output as the nearest neighbor.

Adapted from “Instance-Based Learning”

lecture slides by Andrew
Carla P.Moore,
Gomes CMU.
CS4700
k – Nearest Neighbor

Generalizes 1-NN to smooth away noise in the labels

A new point is now assigned the most frequent label of its k nearest
neighbors

Label it red, when k = 3

Label it blue, when k = 7

Carla P. Gomes
CS4700
KNN Example
Food Chat Fast Price Bar BigTip
(3) (2) (2) (3) (2)
1 great yes yes normal no yes
2 great no yes normal no yes
3 mediocre yes no high no no
4 great yes yes normal yes yes

Similarity metric: Number of matching attributes (k=2)

New examples:
– Example 1 (great, no, no, normal, no) Yes
 most similar: number 2 (1 mismatch, 4 match)  yes

Second most similar example: number 1 (2 mismatch, 3 match)  yes

– Example 2 (mediocre, yes, no, normal, no) Yes/No

 Most similar: number 3 (1 mismatch, 4 match)  no

Second most similar example: number 1 (2 mismatch, 3 match)  yes

Selecting the Number of Neighbors

Increase k:
– Makes KNN less sensitive to noise

Decrease k:
– Allows capturing finer structure of space

Pick k not too large, but not too small (depends on data)

Carla P. Gomes
CS4700
Curse-of-Dimensionality

Prediction accuracy can quickly degrade when number of attributes

grows.
– Irrelevant attributes easily “swamp” information from relevant
attributes
– When many irrelevant attributes, similarity/distance measure
becomes less reliable

Remedy
– Try to remove irrelevant attributes in pre-processing step
– Weight attributes differently
– Increase k (but not too much)

Carla P. Gomes
CS4700
Advantages and Disadvantages of KNN

Need distance/similarity measure and attributes that “match” target

function.

For large training sets,

 Must make a pass through the entire dataset for each classification.
This can be prohibitive for large data sets.

Prediction accuracy can quickly degrade when number of attributes

grows.

Simple to implement algorithm;

Requires little tuning;
Often performs quite weel!
(Try it first on a new learning problem). Carla P. Gomes
CS4700

Painless Pre-Algebra
From Everand
Painless Pre-Algebra
Amy Stahl
Rating: 3 out of 5 stars
3/5 (2)
Chapter 7 - K-Nearest-Neighbor: Data Mining For Business Analytics
Document16 pages
Chapter 7 - K-Nearest-Neighbor: Data Mining For Business Analytics
Everett Tu
No ratings yet
Society and Culture
Document17 pages
Society and Culture
Jamion Knight
75% (4)
Summer Bridge Math, Grades 1 - 2
From Everand
Summer Bridge Math, Grades 1 - 2
Summer Bridge Activities
No ratings yet
Lecture 07 KNN 14112022 034756pm
Document24 pages
Lecture 07 KNN 14112022 034756pm
Misbah
100% (1)
SITXWHS003 Implement and Monitor Work Health and Safety Practices Learner Assessment Pack V2.0 - 06 - 2019
Document74 pages
SITXWHS003 Implement and Monitor Work Health and Safety Practices Learner Assessment Pack V2.0 - 06 - 2019
Jyoti Verma
100% (1)
Chatbot Final Year Project PDF Free
Document24 pages
Chatbot Final Year Project PDF Free
Ali Shan
No ratings yet
Introduction To Discourse Analysis
Document19 pages
Introduction To Discourse Analysis
Ricky Sanjaya
No ratings yet
KNN PDF
Document30 pages
KNN PDF
avinash singh
No ratings yet
Module 8 (Educ 1)
Document3 pages
Module 8 (Educ 1)
Novelyn Mitra
No ratings yet
CS 4700: Foundations of Artificial Intelligence
Document9 pages
CS 4700: Foundations of Artificial Intelligence
Amir Fors
No ratings yet
Week 6 - K-Nearest Neighbor-Dikonversi
Document9 pages
Week 6 - K-Nearest Neighbor-Dikonversi
Divira Iga Firdianti
No ratings yet
1-Nearest Neighbor: One of The Simplest of All Machine Learning Classifiers A New Point The
Document8 pages
1-Nearest Neighbor: One of The Simplest of All Machine Learning Classifiers A New Point The
KumarItžMë
No ratings yet
Improving Performance Handout
Document4 pages
Improving Performance Handout
kouroshkutm7885
No ratings yet
ML Assignment No. 3: 3.1 Title
Document6 pages
ML Assignment No. 3: 3.1 Title
laxman
No ratings yet
Lecture 4
Document31 pages
Lecture 4
Ahmed Mosa
No ratings yet
Machine Learning - Classification: CS102 Winter 2019
Document36 pages
Machine Learning - Classification: CS102 Winter 2019
technetvn
No ratings yet
Classification Slides
Document41 pages
Classification Slides
omsaiboggala811
No ratings yet
Minmax
Document59 pages
Minmax
anushanallagasu851
No ratings yet
Isbm College of Engineering, Pune Department of Computer Engineering Academic Year 2021-22
Document9 pages
Isbm College of Engineering, Pune Department of Computer Engineering Academic Year 2021-22
Vaibhav Srivastava
No ratings yet
Lab # 12 K-Nearest Neighbor (KNN) Algorithm: Objective
Document5 pages
Lab # 12 K-Nearest Neighbor (KNN) Algorithm: Objective
Rana Arsalan Ali
No ratings yet
1305 Notes CH02
Document26 pages
1305 Notes CH02
Zarlish Khan
No ratings yet
Clustering: ISOM3360 Data Mining For Business Analytics
Document28 pages
Clustering: ISOM3360 Data Mining For Business Analytics
Claire Lee
No ratings yet
Lecture 22 - K-Nearnest Neighbours
Document11 pages
Lecture 22 - K-Nearnest Neighbours
bscs-20f-0009
No ratings yet
Day 2 - Session 2: - KNN - Decision Tree - Random Forest - Naïve Bayes Classification
Document50 pages
Day 2 - Session 2: - KNN - Decision Tree - Random Forest - Naïve Bayes Classification
sartg
No ratings yet
Midterm Exam Solution
Document11 pages
Midterm Exam Solution
ShelaRamos
No ratings yet
KNN Dan KMeans
Document37 pages
KNN Dan KMeans
michelle winata
No ratings yet
ML Assignment No. 3: 3.1 Title
Document6 pages
ML Assignment No. 3: 3.1 Title
Kirti Phegade
No ratings yet
K Nearest Neighbor KNN
Document18 pages
K Nearest Neighbor KNN
Talha Khan
No ratings yet
K Nearest Neighbors (KNN) : "Birds of A Feather Flock Together"
Document16 pages
K Nearest Neighbors (KNN) : "Birds of A Feather Flock Together"
Sourabh Chiprikar
No ratings yet
DB Scan
Document7 pages
DB Scan
UJJAWAL PUGALIA
No ratings yet
Comp106 Numbertheory
Document99 pages
Comp106 Numbertheory
AOne ThreeONine
No ratings yet
K Nearest Neighbors: Probably A Duck."
Document14 pages
K Nearest Neighbors: Probably A Duck."
Daneil Radcliffe
No ratings yet
Nearest Neighbor Classification: Presented by Sam Brown Sbbrown@uvm - Edu DATA MINING - Xindong Wu
Document39 pages
Nearest Neighbor Classification: Presented by Sam Brown Sbbrown@uvm - Edu DATA MINING - Xindong Wu
Hussain
No ratings yet
Classification
Document12 pages
Classification
s21b0062
No ratings yet
Univt - IV
Document72 pages
Univt - IV
mananrawat537
No ratings yet
Chap 5 1 NN Classification
Document22 pages
Chap 5 1 NN Classification
ayman
0% (1)
Electron Probe Microanalysis Epma: Trace Element Analysis
Document39 pages
Electron Probe Microanalysis Epma: Trace Element Analysis
Arvind Mishra
No ratings yet
Instance Based Learning
Document16 pages
Instance Based Learning
Swathi Reddy
No ratings yet
Hw4 Probs
Document12 pages
Hw4 Probs
sfaawrgawg
No ratings yet
Part A 3. KNN Classification
Document35 pages
Part A 3. KNN Classification
Akshay kashyap
No ratings yet
K-NN Algorithm in Machine Learning
Document11 pages
K-NN Algorithm in Machine Learning
Emily Johnson
No ratings yet
Lecture8 KNN1
Document16 pages
Lecture8 KNN1
Zarin Tasnim
No ratings yet
Non Pilot Aided Detectors
Document41 pages
Non Pilot Aided Detectors
Ferzia Firdousi
No ratings yet
3 UnSupervised Learning
Document53 pages
3 UnSupervised Learning
Zaeem Abbas
No ratings yet
EE514 - CS535 KNN Algorithm: Overview, Analysis, Convergence and Extensions
Document53 pages
EE514 - CS535 KNN Algorithm: Overview, Analysis, Convergence and Extensions
Zubair Khalid
No ratings yet
KNN Presentation
Document16 pages
KNN Presentation
Ram Royal
No ratings yet
CSP
Document9 pages
CSP
Srinivas Vemuri
No ratings yet
Why Nearest Neighbor?: Used To Classify Objects Based On Closest Training Examples in The Feature Space
Document40 pages
Why Nearest Neighbor?: Used To Classify Objects Based On Closest Training Examples in The Feature Space
UrsTruly Anirudh
No ratings yet
Mathematical Foundations For Machine Learning and Data Science
Document25 pages
Mathematical Foundations For Machine Learning and Data Science
chaimae
No ratings yet
Process Capability: Evaluate The Process From Accuracy of Parts Machine Repeatability Tool Life Program Coolant
Document17 pages
Process Capability: Evaluate The Process From Accuracy of Parts Machine Repeatability Tool Life Program Coolant
Agung Kaswadi
No ratings yet
Wcna 2004
Document57 pages
Wcna 2004
Aya Hameid Abo Elnaga
No ratings yet
Accelerated Data Science Introduction To Machine Learning Algorithms
Document37 pages
Accelerated Data Science Introduction To Machine Learning Algorithms
sanketjaiswal
No ratings yet
K Nearest Neighbor (Revised)
Document20 pages
K Nearest Neighbor (Revised)
Aradhya
No ratings yet
SUMSEM-2020-21 MEE6070 ETH VL2020210700842 Reference Material I 16-Jul-2021 K-Nearest Neighbors (KNN) Algorithm (Repaired) Week-3
Document40 pages
SUMSEM-2020-21 MEE6070 ETH VL2020210700842 Reference Material I 16-Jul-2021 K-Nearest Neighbors (KNN) Algorithm (Repaired) Week-3
Krupa
No ratings yet
Yr 9 Topic Assessment 1 Whole Numbers and Decimals
Document1 page
Yr 9 Topic Assessment 1 Whole Numbers and Decimals
Aryan Khanna
No ratings yet
The RSA Algorithm
Document31 pages
The RSA Algorithm
Selva Raj
No ratings yet
Lecture15 Decision Trees
Document67 pages
Lecture15 Decision Trees
chowsaj9
No ratings yet
Tema5 Teoria-2830
Document57 pages
Tema5 Teoria-2830
Luis Alejandro Sanchez Sanchez
No ratings yet
WINSEM2020-21 ECE3047 ETH VL2020210503202 Reference Material II 09-Apr-2021 KNN Presentation
Document16 pages
WINSEM2020-21 ECE3047 ETH VL2020210503202 Reference Material II 09-Apr-2021 KNN Presentation
sartg
No ratings yet
Quiz 6: CS 436/536: Introduction To Machine Learning
Document2 pages
Quiz 6: CS 436/536: Introduction To Machine Learning
Karishma Jani
No ratings yet
K-Nearest Neighbor Learning
Document31 pages
K-Nearest Neighbor Learning
Edward Kenway
No ratings yet
Numerical Computation: Prepared By: Richard Mitchell Humber College
Document53 pages
Numerical Computation: Prepared By: Richard Mitchell Humber College
Terrence Sia
No ratings yet
ps4 New
Document2 pages
ps4 New
Marta BG
No ratings yet
Ps 4
Document2 pages
Ps 4
Marta BG
No ratings yet
Notes 1. Introduction - Regression - Analysis - and - Gradient - Descent
Document14 pages
Notes 1. Introduction - Regression - Analysis - and - Gradient - Descent
Ali Shan
No ratings yet
Web Lect 10
Document111 pages
Web Lect 10
Ali Shan
No ratings yet
Notes 2. Linear - Regression - With - Multiple - Variables
Document10 pages
Notes 2. Linear - Regression - With - Multiple - Variables
Ali Shan
No ratings yet
Web Lect 5
Document32 pages
Web Lect 5
Ali Shan
No ratings yet
Web Engineering Lect 7 (CSS Extra) : Instructor: Faheem Shaukat Meeting Hours Wednesday and Thursday 12PM To 02PM
Document89 pages
Web Engineering Lect 7 (CSS Extra) : Instructor: Faheem Shaukat Meeting Hours Wednesday and Thursday 12PM To 02PM
Ali Shan
No ratings yet
Web Engineering Lect 7 (Css Bootstrap) : Instructor: Faheem Shaukat Meeting Hours Wednesday and Thursday 12Pm To 02Pm
Document84 pages
Web Engineering Lect 7 (Css Bootstrap) : Instructor: Faheem Shaukat Meeting Hours Wednesday and Thursday 12Pm To 02Pm
Ali Shan
No ratings yet
Web Engineering Lect 6 (CSS) : Instructor: Faheem Shaukat Meeting Hours Wednesday and Thursday 12PM To 02PM
Document82 pages
Web Engineering Lect 6 (CSS) : Instructor: Faheem Shaukat Meeting Hours Wednesday and Thursday 12PM To 02PM
Ali Shan
No ratings yet
PDF 2
Document5 pages
PDF 2
Ali Shan
No ratings yet
University Automated Chatbot Using Text Classification With Machine Learning and NLP
Document5 pages
University Automated Chatbot Using Text Classification With Machine Learning and NLP
Ali Shan
No ratings yet
Web Lect 10
Document111 pages
Web Lect 10
Ali Shan
No ratings yet
University Automated Chatbot Using Text Classification With Machine Learning and NLP
Document5 pages
University Automated Chatbot Using Text Classification With Machine Learning and NLP
Ali Shan
No ratings yet
FYP Introduction
Document5 pages
FYP Introduction
Ali Shan
No ratings yet
Ogl481 5 Symbolic Frame Worksheet Paige Tadlock
Document3 pages
Ogl481 5 Symbolic Frame Worksheet Paige Tadlock
api-621885196
No ratings yet
ANECDOTAL RECOR-WPS Office
Document3 pages
ANECDOTAL RECOR-WPS Office
C Lalrindika RN 008
No ratings yet
STAT 02 (Module)
Document29 pages
STAT 02 (Module)
Alleija 09
No ratings yet
Leading Workplace Communication
Document26 pages
Leading Workplace Communication
Eahbm Kadu
No ratings yet
I-Day 35
Document3 pages
I-Day 35
Rainman Insanity
No ratings yet
Lecture0 CSE211
Document12 pages
Lecture0 CSE211
Krishna Vamsi
No ratings yet
Corporate Communication
Document9 pages
Corporate Communication
s_87045489
No ratings yet
Strengthening Social-Emotional Learning of Students Under New Normal Education Brought by The Pandemic
Document8 pages
Strengthening Social-Emotional Learning of Students Under New Normal Education Brought by The Pandemic
Eulla May Corcega
No ratings yet
Infographic
Document1 page
Infographic
api-593931076
No ratings yet
Evaluation of Master'S Project Course Learning Outcomes at Open University of Malaysia
Document9 pages
Evaluation of Master'S Project Course Learning Outcomes at Open University of Malaysia
Mozais Mozalina
No ratings yet
A Mediated Model of Employee Commitment: The Impact of Knowledge Management Practices On Organizational Outcomes
Document15 pages
A Mediated Model of Employee Commitment: The Impact of Knowledge Management Practices On Organizational Outcomes
Monika Gupta
No ratings yet
List of Books Remarked
Document2 pages
List of Books Remarked
Kandrosy Manji
No ratings yet
Bab 4&5 Sefta
Document14 pages
Bab 4&5 Sefta
Suci rahmadani
No ratings yet
Peer Practice
Document7 pages
Peer Practice
Maschil Vilches
No ratings yet
Learning Styles Questionnaire
Document3 pages
Learning Styles Questionnaire
Ummi Hani
No ratings yet
Preferences For TNAU UG Admission 2019: Registration Number: Name of Candidate
Document3 pages
Preferences For TNAU UG Admission 2019: Registration Number: Name of Candidate
Raja Raj
No ratings yet
Kinds of Grammar
Document4 pages
Kinds of Grammar
Enigmatic Eve
No ratings yet
SS 57 Basic-Geography
Document9 pages
SS 57 Basic-Geography
Leslie S. Andres
No ratings yet
Adaptive Reuse of Apartments As Heritage Assets in
Document32 pages
Adaptive Reuse of Apartments As Heritage Assets in
Wenna Dale Pasquin
No ratings yet
KSSR Science Year 4 DLP
Document40 pages
KSSR Science Year 4 DLP
zakzaku501
No ratings yet
Barangay Information Management System - Carsuche, Taal, Batangas
Document6 pages
Barangay Information Management System - Carsuche, Taal, Batangas
Vhia Rashelle Galzote
No ratings yet
Fifty Years of Information Science - A Ibliometric Overview
Document24 pages
Fifty Years of Information Science - A Ibliometric Overview
baehaqi17
No ratings yet
代微积拾级代
Document252 pages
代微积拾级代
『傳奇聖世實況』
No ratings yet
Agile Architecture Poster V 1 - 0
Document1 page
Agile Architecture Poster V 1 - 0
The Fixxxer
No ratings yet
Image Classification Using Pre-Trained Convolutional Neural Network in COLAB
Document6 pages
Image Classification Using Pre-Trained Convolutional Neural Network in COLAB
GRD Journals
No ratings yet
Karthikraja S: Personal Info
Document1 page
Karthikraja S: Personal Info
Sampathvasan S
No ratings yet