Welcome to Scribd!

Skip carousel

KNN Algorithm Enhancements for Machine Learning

Uploaded by

DOESSKKU

0% found this document useful (0 votes)

5 views10 pages

KNN Machine Learning LUMS Lecture 5

Original Title

003-05-KNN_Enhancements-W3L2

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

KNN Machine Learning LUMS Lecture 5

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

5 views10 pages

KNN Algorithm Enhancements for Machine Learning

Uploaded by

DOESSKKU

KNN Machine Learning LUMS Lecture 5

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 10

Search inside document

K Nearest Neighbors

Enhancements
Agha Ali Raza

CS535/EE514 – Machine Learning

Sources
Nearest Neighbor Methods, Victor Lavrenko, Assistant Professor at the University of
Edinburgh, https://www.youtube.com/playlist?list=PLBv09BD7ez_48heon5Az-
TsyoXVYOJtDZ
Machine Learning for Intelligent Systems, Kilian Weinberger, Cornell, Lecture 2,
https://www.cs.cornell.edu/courses/cs4780/2018fa/lectures/lecturenote02_kNN.html
Wiki K-Nearest Neighbors: https://en.wikipedia.org/wiki/K-
nearest_neighbors_algorithm
Effects of Distance Measure Choice on KNN Classifier Performance - A Review, V.
B. Surya Prasath et al., https://arxiv.org/pdf/1708.04321.pdf
A Comparative Analysis of Similarity Measures to find Coherent Documents,
Mausumi Goswami et al. http://www.ijamtes.org/gallery/101.%20nov%20ijmte%20-
%20as.pdf
A Comparison Study on Similarity and Dissimilarity Measures in Clustering
Continuous Data, Ali Seyed Shirkhorshidi et al.,
https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0144059&type=
printable
Cover, Thomas, and, Hart, Peter. Nearest neighbor pattern classification. Information
Theory, IEEE Transactions on, 1967, 13(1): 21-27
Parzen Windows and Kernels
3-NN Parzen Window

R
𝒚𝒊 = −𝟏 𝒚𝒊 = −𝟏

R R

𝒚𝒊 = +𝟏 𝒚𝒊 = +𝟏

Parzen Windows

𝑓 𝑥 = sgn ෍ 𝑦𝑖 R
𝑖:𝑥𝑖 ∈𝑅 𝑥
1

𝑓 𝑥 = sgn ෍ 𝑦𝑖 . 1 𝑥𝑖 −𝑥 ≤𝑅
𝑖
Distance from x
Ref: Victor Lavrenko, Univesity of Edinburgh
3
Parzen Windows and Kernels
3-NN Parzen Window

R
𝒚𝒊 = −𝟏 𝒚𝒊 = −𝟏

R R

𝒚𝒊 = +𝟏 𝒚𝒊 = +𝟏

Parzen Windows

𝑓 𝑥 = sgn ෍ 𝑦𝑖
1 A kernel that converts distances to numbers
𝑖:𝑥𝑖 ∈𝑅 𝑥

𝑓 𝑥 = sgn ෍ 𝑦𝑖 . 1 𝑥𝑖 −𝑥 ≤𝑅
𝑖
Distance from x
Ref: Victor Lavrenko, Univesity of Edinburgh
4
Parzen Windows and Kernels
3-NN Parzen Window

R
𝒚𝒊 = −𝟏 𝒚𝒊 = −𝟏

R R

𝒚𝒊 = +𝟏 𝒚𝒊 = +𝟏

Parzen Windows

𝑓 𝑥 = sgn ෍ 𝑦𝑖 𝑓 𝑥 = sgn ෍ 𝑦𝑖 𝐾(𝑥𝑖 , 𝑥)

𝑖:𝑥𝑖 ∈𝑅 𝑥 𝑖
vs
𝑓 𝑥 = sgn ෍ 𝑦𝑖 . 1 𝑥𝑖 −𝑥 ≤𝑅
𝑖 𝑓 𝑥 = sgn ෍ 𝛼𝑖 𝑦𝑖 𝐾(𝑥𝑖 , 𝑥)
𝑖
Ref: Victor Lavrenko, Univesity of Edinburgh
5
Performance of KNN Algorithm
• Time complexity: 𝑂(𝑛𝑑)
• Reduce 𝑑: Dimensionality reduction
• Reduce 𝑛: Compare to a subset of examples
• Identify 𝑚 ≪ 𝑛 potential near neighbors to compare against
• 𝑂 𝑚𝑑
• K-D trees: Low-dimensional, real-valued data
o 𝑂(𝑑𝑙𝑜𝑔2 𝑛), only works when 𝑑 ≪ 𝑛, inexact: can miss neighbors
• Inverted lists: High-dimensional, discrete (sparse) data
o 𝑂(𝑛’𝑑’), where 𝑑 ′ ≪ 𝑑, 𝑛′ ≪ 𝑛, only for sparse data (e.g. text), exact
• Locality-sensitive hashing: high-dimensional, real or discrete
o 𝑂 𝑛′ 𝑑 , 𝑛′ ≪ 𝑛, inexact: can miss neighbors

6
K-D Trees
• Pick a random dimension, find median, split data, repeat
1,9 , 2,3 , 4,1 , 3,7 , 5,4 , 6,8 , 7,2 , 8,8 , 7,9 , 9,6

• 𝑂(𝑑 𝑙𝑜𝑔2 𝑛)
• E.g. test point: (7,4)
• Compare with all the points in the region
• Can easily miss nearest neighbors

Example ref: Victor Lavrenko, University of Edinburgh, https://www.youtube.com/playlist?list=PLBv09BD7ez_48heon5Az-TsyoXVYOJtDZ

7
Locality-sensitive Hashing
• Draw random hyper-planes ℎ1 , … , ℎ𝑘
• The space is sliced into 2𝑘 regions
• Polytopes
• Mutually exclusive
• Compare x only to training points in that
region
• Complexity: 𝑂(𝑑 𝑙𝑜𝑔𝑛) if 𝑘 ≈ 𝑙𝑜𝑔 𝑛
• Inexact: Can miss neighbors
• Repeat with different hyperplanes
• Why do we need these?
• In case of K-D trees, in high dimensions,
someone could be your neighbor in d-1
dimensions, but still very far away in the 𝑑𝑡ℎ
dimension
Example ref: Victor Lavrenko, University of Edinburgh, https://www.youtube.com/playlist?list=PLBv09BD7ez_48heon5Az-TsyoXVYOJtDZ
8
Inverted Lists
• High dimensional, sparse data
• New email: “account review”
• 𝑂 𝑑 𝑛 , where d: non-zero attributes, √𝑛: avg length of list
• Exact: does not miss neighbors

Example ref: Victor Lavrenko, University of Edinburgh, https://www.youtube.com/playlist?list=PLBv09BD7ez_48heon5Az-TsyoXVYOJtDZ

9
For more details please visit

http://aghaaliraza.com

Thank you!
10

Form 4 - Physics Exercises
Document9 pages
Form 4 - Physics Exercises
nik mohamad solehin
100% (3)
Probabilistic ML Expectations and Distributions
Document8 pages
Probabilistic ML Expectations and Distributions
broken heart
No ratings yet
Implement Neural Networks Using Keras and Pytorch: Liang Liang
Document32 pages
Implement Neural Networks Using Keras and Pytorch: Liang Liang
raverita
No ratings yet
Deep Learning Basics Lecture 7 Factor Analysis
Document32 pages
Deep Learning Basics Lecture 7 Factor Analysis
baris
No ratings yet
Attention: Sharad Jones
Document25 pages
Attention: Sharad Jones
David Guevara
No ratings yet
4) AC Circuits
Document40 pages
4) AC Circuits
S x D
No ratings yet
Integral Calculus: Irineo P. Quinto
Document28 pages
Integral Calculus: Irineo P. Quinto
Irineo Quinto
No ratings yet
DC and AC Power Fundamentals SAP
Document51 pages
DC and AC Power Fundamentals SAP
Paul Danniel Aquino
No ratings yet
Deep Learning - GAN
Document31 pages
Deep Learning - GAN
JEFFRY
No ratings yet
6-Continuous Random Variable
Document14 pages
6-Continuous Random Variable
adamoh62
No ratings yet
Stream_Cipher
Document17 pages
Stream_Cipher
hardisnetwork
No ratings yet
003 01 KNN - Intro W3L1
Document8 pages
003 01 KNN - Intro W3L1
DOESSKKU
No ratings yet
Field From A Uniformly Charged Disk Solutions: Electric
Document2 pages
Field From A Uniformly Charged Disk Solutions: Electric
Narendra Kumar
No ratings yet
CNN PART 2 RECAP
Document79 pages
CNN PART 2 RECAP
Lamis Ahmad
No ratings yet
Practical Research 2 Quantitative Research: Inferential Statistics Reference of Formulas Hypothesis-Testing Process
Document4 pages
Practical Research 2 Quantitative Research: Inferential Statistics Reference of Formulas Hypothesis-Testing Process
jessa barbosa
No ratings yet
Chapter 7 radar
Document65 pages
Chapter 7 radar
Muhammad Talha
No ratings yet
Math Exam
Document9 pages
Math Exam
Malik Asad
No ratings yet
Mathematical Tools for Analyzing Random Variables - CS203B
Document25 pages
Mathematical Tools for Analyzing Random Variables - CS203B
Kishan Nawal
No ratings yet
Fluidisation Tutorial
Document22 pages
Fluidisation Tutorial
Rosario QF
No ratings yet
Refraction Seismic II
$Refraction Seismic II$
Document30 pages
Refraction Seismic II
abuobida
No ratings yet
Euclidean Vector Space: Geophysics Ui
Document9 pages
Euclidean Vector Space: Geophysics Ui
fdla rhmah
No ratings yet
AC Circuit Analysis Using Phasors
Document15 pages
AC Circuit Analysis Using Phasors
Bashan Kur Buhroy
No ratings yet
Correlation Test Between Two Variables in R - Easy Guides - Wiki - STHDA
Document11 pages
Correlation Test Between Two Variables in R - Easy Guides - Wiki - STHDA
manas
100% (1)
3 Syntax Analysis - Top Down Parsing
Document9 pages
3 Syntax Analysis - Top Down Parsing
smumin011
No ratings yet
8-2 正则表达式的神经网络化
Document33 pages
8-2 正则表达式的神经网络化
chunhua li
No ratings yet
Problem Set 2 One To One and Inverse Functions
Document2 pages
Problem Set 2 One To One and Inverse Functions
cha618717
No ratings yet
PR 3 Perhitungan
Document1 page
PR 3 Perhitungan
Arrofi Nurmida
No ratings yet
Week 2
Document26 pages
Week 2
Eren Cetin
No ratings yet
Algorithm Design and Analysis (ADA)
Document25 pages
Algorithm Design and Analysis (ADA)
Ananya Jain
No ratings yet
Linear Regression
Document27 pages
Linear Regression
John Roncoroni
100% (1)
9 RNN 8
Document48 pages
9 RNN 8
MInh Thanh
No ratings yet
3 Magnetostatics
Document10 pages
3 Magnetostatics
Tejus V S
No ratings yet
OpenSCAD User Manual: Guide to the OpenSCAD Programming Language (38 pages
Document38 pages
OpenSCAD User Manual: Guide to the OpenSCAD Programming Language (38 pages
alarue
100% (1)
Lecture 9
Document64 pages
Lecture 9
Towsif Salauddin
No ratings yet
Transformer Testing DYn11
Document9 pages
Transformer Testing DYn11
arsalanhamid
50% (2)
Lesson 6
Document19 pages
Lesson 6
Octav Paloaie
No ratings yet
Digital Communications II - Elements of DC Systems
Document23 pages
Digital Communications II - Elements of DC Systems
Sai Kumar Rapeti
No ratings yet
Pagina 1
Document1 page
Pagina 1
mariana mourão
No ratings yet
Determination of Formation Water Resistivity PDF
Document30 pages
Determination of Formation Water Resistivity PDF
showrav
No ratings yet
Chapter 1 - Network Function - Part 2
Document17 pages
Chapter 1 - Network Function - Part 2
Yara Kafa
No ratings yet
Chapter 7
Document27 pages
Chapter 7
awrex
100% (1)
Measures of Dispersion
Document25 pages
Measures of Dispersion
roland
100% (1)
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute Shivkuma@ecse - Rpi.edu
Document20 pages
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute Shivkuma@ecse - Rpi.edu
anon-543542
No ratings yet
EE-101 Frequency Response - 1 July-Nov 2017
Document5 pages
EE-101 Frequency Response - 1 July-Nov 2017
Alpha Wolf
No ratings yet
06 DLEA Generative Models
Document98 pages
06 DLEA Generative Models
Diego di nicolantonio
No ratings yet
CHEG443_Week_9_C7_Lec_13_K
Document34 pages
CHEG443_Week_9_C7_Lec_13_K
Anders Rojas Coa.
No ratings yet
Distributed 3-Profile Estimation for Large Graph Analytics
Document53 pages
Distributed 3-Profile Estimation for Large Graph Analytics
Anonymous TiUWU8RMa
No ratings yet
Runge Phenomenon: Guided By: Dr. Geetanjali Pradhan
Document29 pages
Runge Phenomenon: Guided By: Dr. Geetanjali Pradhan
Swarupa Sarangi
No ratings yet
CH04P03D04
Document3 pages
CH04P03D04
meccaroy00601
No ratings yet
Ac Machinery Formulas
Document4 pages
Ac Machinery Formulas
Nhil
No ratings yet
Chapter 3 - Multiple Linear Regression Models
Document29 pages
Chapter 3 - Multiple Linear Regression Models
Ermias Atalay
No ratings yet
Introduction To Machine Learning: K-Nearest Neighbor Algorithm
Document25 pages
Introduction To Machine Learning: K-Nearest Neighbor Algorithm
Alwan Siddiq
No ratings yet
CVFD2D Slides 2up
Document17 pages
CVFD2D Slides 2up
thanhndb
No ratings yet
Lecture-921-NW Functions
Document12 pages
Lecture-921-NW Functions
Bangle Ch
No ratings yet
ENCH-633: Chemical Thermodynamics: Sathish Ponnurangam Fall 2018 Week - 5: Fugacities in Gas Mixture
Document28 pages
ENCH-633: Chemical Thermodynamics: Sathish Ponnurangam Fall 2018 Week - 5: Fugacities in Gas Mixture
raja.mt
No ratings yet
Database-friendly Random Projections for Dimensionality Reduction
Document8 pages
Database-friendly Random Projections for Dimensionality Reduction
Simon Southern
No ratings yet
L4 Random Signals and Noise PDF
Document55 pages
L4 Random Signals and Noise PDF
criscab12345
No ratings yet
Stable Numerical Schemes for Fluids, Structures and their Interactions
From Everand
Stable Numerical Schemes for Fluids, Structures and their Interactions
Cornel Marius Murea
No ratings yet
Numerical Analysis
From Everand
Numerical Analysis
John Todd
No ratings yet
Singular Points of Complex Hypersurfaces (AM-61), Volume 61
From Everand
Singular Points of Complex Hypersurfaces (AM-61), Volume 61
John Milnor
No ratings yet
CRAN Recipes: DPLYR, Stringr, Lubridate, and RegEx in R
From Everand
CRAN Recipes: DPLYR, Stringr, Lubridate, and RegEx in R
William Yarberry
No ratings yet
003 02 KNN - Distances W3L1
Document10 pages
003 02 KNN - Distances W3L1
DOESSKKU
No ratings yet
003 04 KNN - Error W3L2
Document7 pages
003 04 KNN - Error W3L2
DOESSKKU
No ratings yet
003 01 KNN - Intro W3L1
Document8 pages
003 01 KNN - Intro W3L1
DOESSKKU
No ratings yet
003 03 KNN - The - Algorithm W3L1
Document10 pages
003 03 KNN - The - Algorithm W3L1
DOESSKKU
No ratings yet
Introduction To Electrical Measurement Instrumentation
Document15 pages
Introduction To Electrical Measurement Instrumentation
DOESSKKU
No ratings yet
Sliding Mode Synchronization of Multiple Chaotic Systems With Uncertainties and Disturbances
Document13 pages
Sliding Mode Synchronization of Multiple Chaotic Systems With Uncertainties and Disturbances
DOESSKKU
No ratings yet
All Two-Dimensional, Flexible, Transparent, and Thinnest Thin Film Transistor
Document10 pages
All Two-Dimensional, Flexible, Transparent, and Thinnest Thin Film Transistor
DOESSKKU
No ratings yet
Wiedemann-Franz Law of Cu-Coated Carbon Fiber
Document7 pages
Wiedemann-Franz Law of Cu-Coated Carbon Fiber
DOESSKKU
No ratings yet
Bandgap Renormalization in Monolayer MoS2 On CsPbBr3 Quantum Dots Via Charge Transfer at Room Temperature
Document9 pages
Bandgap Renormalization in Monolayer MoS2 On CsPbBr3 Quantum Dots Via Charge Transfer at Room Temperature
DOESSKKU
No ratings yet
Raman Spectroscopy of Graphene NOVA Childres
Document20 pages
Raman Spectroscopy of Graphene NOVA Childres
Fernando Bonatto
No ratings yet
Common Review Criteria For Writing A Research Paper
Document2 pages
Common Review Criteria For Writing A Research Paper
DOESSKKU
No ratings yet
Making Precision Low Current and High Resistance Measurements
Document16 pages
Making Precision Low Current and High Resistance Measurements
megustalazorra
No ratings yet
Ultrafast Transient Absorption Microscopy Studies of Carrier Dynamics in Epitaxial Graphene
Document6 pages
Ultrafast Transient Absorption Microscopy Studies of Carrier Dynamics in Epitaxial Graphene
DOESSKKU
No ratings yet
Manual For Picoammeter 6487
Document338 pages
Manual For Picoammeter 6487
DOESSKKU
No ratings yet
Transverse Piezoelectric Field-Effect Transistor Based On Single ZnO Nanobelts
Document5 pages
Transverse Piezoelectric Field-Effect Transistor Based On Single ZnO Nanobelts
DOESSKKU
No ratings yet
Tutorial Sheet-1
Document2 pages
Tutorial Sheet-1
Pratham Jain
No ratings yet
Multiple Regression
Document30 pages
Multiple Regression
jagdip_barik
No ratings yet
Class 5 HCF and LCM
Document49 pages
Class 5 HCF and LCM
Chitra Amru
No ratings yet
Novel Fiber Optic Biosensors Based On Nanoplasmonic and Interferometric Modalities
Document152 pages
Novel Fiber Optic Biosensors Based On Nanoplasmonic and Interferometric Modalities
Ian Muri
No ratings yet
Grade 6 Math Lessons
Document49 pages
Grade 6 Math Lessons
Cindy Gellangarin
No ratings yet
A Demonstration Lesson in Math 6 by JUNITO C. COMEROS
Document4 pages
A Demonstration Lesson in Math 6 by JUNITO C. COMEROS
Jun Cueva Comeros
0% (1)
Predict bank marketing success with new data
Document2 pages
Predict bank marketing success with new data
Madhu Evuri
No ratings yet
(Open Agriculture) Genotype X Environment Interaction For Yield of Pickling Cucumber in 24 U.S. Environments
Document16 pages
(Open Agriculture) Genotype X Environment Interaction For Yield of Pickling Cucumber in 24 U.S. Environments
Alfredo Muñoz Pardo
No ratings yet
Reactive Distillation Process Optimization by Empirical Formulae Construction
Document9 pages
Reactive Distillation Process Optimization by Empirical Formulae Construction
Chintan Milan Shah
No ratings yet
Finding The Formula For The NTH Term
Document4 pages
Finding The Formula For The NTH Term
RedMoonLight
No ratings yet
Odd and Even Numbers Term 1
Document2 pages
Odd and Even Numbers Term 1
api-277245562
No ratings yet
JD Deutsche Bank
Document1 page
JD Deutsche Bank
compangel
No ratings yet
Dynamics
Document8 pages
Dynamics
Laud Fumhanda
No ratings yet
Use of Laboratory Log Books
Document2 pages
Use of Laboratory Log Books
hamood
No ratings yet
You Can Create An SFAS Matrix by Following These Steps
Document4 pages
You Can Create An SFAS Matrix by Following These Steps
passant
No ratings yet
Selecting Tube Inserts For Shell-and-Tube Heat Exchangers
Document7 pages
Selecting Tube Inserts For Shell-and-Tube Heat Exchangers
kamranonline999
No ratings yet
Java Placement Questions
Document18 pages
Java Placement Questions
meenakshi
No ratings yet
McTaggart and Mellor on the Unreality of Time
Document7 pages
McTaggart and Mellor on the Unreality of Time
Marisa La Barbera
No ratings yet
Modeling and Simulation of Boiler Drum Level Control
Document6 pages
Modeling and Simulation of Boiler Drum Level Control
Jorge Franco
No ratings yet
Course Planner: Target: JEE (Main+Advanced) 2023
Document2 pages
Course Planner: Target: JEE (Main+Advanced) 2023
Zzo
No ratings yet
A Level Physics Paper 52 MS
Document6 pages
A Level Physics Paper 52 MS
abhijeet.nalle
No ratings yet
Automatically allocate planned and actual indirect activities using keys
Document4 pages
Automatically allocate planned and actual indirect activities using keys
Sudharsan Ponnambalam
No ratings yet
Cavite State University: Market Road, Carmona, Cavite
Document3 pages
Cavite State University: Market Road, Carmona, Cavite
Patricia
No ratings yet
1 Banach Spaces
Document41 pages
1 Banach Spaces
nuriyesan
0% (1)
Topology Aware Load Balancing For Grids
Document21 pages
Topology Aware Load Balancing For Grids
Haitham Barkallah
No ratings yet
Point and Figure Charts: Why You Should Be Using Them
Document81 pages
Point and Figure Charts: Why You Should Be Using Them
Swati Tandon
100% (3)
Comcot Manual V 17 PDF
Document65 pages
Comcot Manual V 17 PDF
merdeka48
No ratings yet
Huffman
Document53 pages
Huffman
divyangkapadia
No ratings yet