Professional Documents
Culture Documents
Calculating Dissimilarities PDF
Calculating Dissimilarities PDF
Calculating Dissimilarities PDF
SAMPLE: You need to measure the dissimilarity between 3 students based on 4 different grades
Name
HS Final
Grade
95
95
95
Anna
Mark
Philip
Standardized
Aptitude Test Grade
84
87
88
Manhattan Distance:
d (Anna, Mark) = |95-95| + |84-87| + |90-95| + |88-90|
=0+3+5+2
= 10
d (Anna, Philip) = |95-95| + |84-88| + |90-90| + |88-90|
=0+4+0+2
=6
d (Mark, Philip) = |95-95| + |87-88| + |95-90| + |90-90|
=0+1+5+0
=6
Dissimilarity Matrix for Manhattan Distance:
Anna
Mark
Philip
Anna
0
10
6
Mark
Philip
0
6
Euclidean Distance:
d (Anna, Mark) = sqrt ( (95-95)2 + (84-87)2 + (90-95)2 + (88-90)2 )
= sqrt ( 0 + 9 + 25 + 4 )
= 6.164
d (Anna, Philip) = ???
= ???
= ???
d (Mark, Philip) = sqrt( (95-95)2 + (87-88)2 + (95-90)2 + (90-90)2 )
= sqrt( 0 + 1 + 25 + 0 )
= 5.099
Dissimilarity Matrix for Euclidean Distance:
Anna
Mark
Philip
Anna
0
6.164
???
Mark
Philip
0
5.099
Math Proficiency
Grade
88
90
90
Anna
Mark
Philip
Anna
0
5
4
Mark
Philip
0
???
d (Martin, Adrian)
d (Martin, Julie)
Mary
Adrian
Julie
0
???
???
0
???
Philip
Normalize the numeric table to reflect values as a transformed value within [0.0, 1.0] (The easiest way
to normalize is to get the value as a percentage of the maximum. There are however, other methods to
normalize values)
Anna
Mark
Philip
Anna
0
Mark
10/10 = 1.0
0
Philip 6/10 = 0.6
6/10 = 0.6 0
Numeric (weight = 1)
Anna Mark
Anna
0
Mark
5
0
Philip 1
2
Philip
Normalize the numeric table to reflect values as a transformed value within [0.0, 1.0] (The easiest way
to normalize is to get the value as a percentage of the maximum. There are however, other methods to
normalize values)
Anna
Mark
Philip
Anna
0
5/5 = 1.0
1/5 = 0.2
Ordinal (weight = 1)
Anna Mark
Anna
0
Mark
0.5
0
Philip 0.5
1
Mark
Philip
0
2/5 = 0.4
Philip