Professional Documents
Culture Documents
Cluster Analysis
Cluster Analysis
Cluster Analysis
Where is Cluster Analysis Used?
• Understanding Buyer Behaviour:
– Identify homogeneous groups of buyers
• Identify new product opportunities:
– Competitive sets within market can be determined
– Examine current offerings compared to competitors
• Selecting test markets:
– Grouping cities into homogeneous markets
• Reducing data:
– Create sub-groups of data
Cluster Analysis
• Unsupervised learning • Do Sub-populations exist?
• Does not predict anything in – How many?
particular – What are their sizes?
• Not a Classification technique – – Any common properties?
We do not know the classes!! – Can they be split further?
Agglomerative DIVISIVE
• Works “Bottom up” Works “Top Down”
• Two most similar clusters Two least similar clusters
are combined into nodes are split
• Iterated until root cluster
Iterated until leaf cluster
Agglomerative Hierarchical Clustering
Source: http://infolab.stanford.edu/~ullman/mmds/ch7.pdf
Agglomerative Hierarchical Clustering
Source: http://infolab.stanford.edu/~ullman/mmds/ch7.pdf
Hierarchical Clustering
Clustering Analysis - Steps
• Formulate the problem
• Select a distance measure
• Select a clustering procedure
• Decide on the number of clusters
• Interpret and profile the clusters
• Assess the validity of clustering