M4 - Clustering
M4 - Clustering
M4 - Clustering
MCA301
Module 4 - Clustering Topics
1. Clustering Introduction
2. Similarity measures
3. Clustering criteria
4. Distance functions
5. k-Means Clustering
6. Hierarchical clustering
7. DBSCAN
8. Combining Multiple Learners
• Voting
• Bagging
• Boosting
Clustering in machine learning
The clusters are formed in such a way that any two data
objects within a cluster have a minimum distance
value and any two data objects across different clusters
have a maximum distance value
Similarity is an amount that reflects the strength of
relationship between two data items, it represents how
similar 2 data patterns are.
1. Partitional Clustering
2. Density based Clustering
3. Hierarchical clustering
1. Partitional Clustering
Partitional clustering is considered to be the most popular
category of clustering algorithm. Partition clustering
algorithm divides the data points into “k“ partitions, where
each partition represents a cluster.
Example of usage:
Manhattan Distance: