Professional Documents
Culture Documents
HMM
fi
s
fi
.
1c.
1d. Variance is nothing but the measurement of the difference between the
observed values and the average of predicted values where covariance is
the measurement of how two variables vary with respect to each other
In single variate regression, the model describes the relationship between
one independent variable and one dependent variable using a straight line
wherein the multivariate regression, the model describes the relationship
between more than one independent variable and more than one
dependent variable, which are linearly related
Problem 2
Problem 4
ANSWER
Distance from each point to other points is calculated using manhattan
distance given by abs(x2-x1)+abs(y2-y1
Let A = (2,2
B=(2,3
C=(3,3
D=(6,7
E=(9,10
F=(8,7
Distance from A to B = (2,2) to (2,3)=abs(2-2)+abs(2-3)=
Distance from A to C = (2,2) to (3,3) = abs(2-3)+abs(2-3)=
Distance from A to D = (2,2) to (6,7) = abs(2-6)+abs(2-7)=
Distance from A to E = (2,2) to (9,10) = abs(2-9)+abs(2-10)=1
Distance from A to F = (2,2) to (8,7) = abs(2-8)+abs(2-7)=1
Distance from B to C = (2,3) to (3,3)= abs(2-3)+abs(3-3)=
Distance from B to D = (2,3) to (6,7)=abs(2-6)+abs(3-7)=
Distance from B to E = (2,3) to (9,10)=abs(2-9)+abs(3-10)=1
Distance from B to F= (2,3) to (8,7)=abs(2-8)+abs(3-7)=1
Distance from C to D = (3,3) to (6,7)=abs(3-6)+abs(3-7)=
Distance from C to E = (3,3) to (9,10)=abs(3-9)+abs(3-10)=1
Distance from C to F =(3,3) to (8,7)=abs(3-8)+abs(3-7)=
Distance from D to E=(6,7) to (9,10)=abs(6-9)+abs(7-10)=
Distance from D to F=(6,7) to (8,7)=abs(6-8)+abs(7-7)=
Distance from E to F= (9,10) to (8,7)=abs(9-8)+abs(10-7)=
There is 2 set of points with shortest distance (2,2) and (2,3) and (2,3)
to (3,3). We choose randomly one. Let it be (2,2) and (2,3)
So the resulting Clusters are AB, C, D, E,
The centroid of AB is calculated as (2,2.5
Distance from AB to C = (2,2.5) to (3,3) = abs(2-3)+abs(2.5-3)=1.
Distance from AB to D = (2,2.5) to (6,7) = abs(2-6)+abs(2.5-7)=8.
Distance from AB to E = (2,2.5) to (9,10) = abs(2-9)+abs(2.5-10)=14.
Distance from AB to F = (2,2.5) to (8,7) = abs(2-8)+abs(2.5-7)=10.
Distance from C to D = (3,3) to (6,7)=abs(3-6)+abs(3-7)=
Distance from C to E = (3,3) to (9,10)=abs(3-9)+abs(3-10)=1
Distance from C to F =(3,3) to (8,7)=abs(3-8)+abs(3-7)=
)
The dendrogram for this clustering is plotted as shown below. The cut-
off line is marke when there is a sudden jump in the distance from 2 to
5. The resultant clusters are below the cut-off line
.
Problem
5