Dav Assignment 5
Dav Assignment 5
ASSIGNMENT 5
Q.1 Give differences between clustering and classification.
❖ Clustering:
It is a technique of organizing a group of data into classes and clusters where the objects with
high similarity reside inside a cluster and the objects of two clusters would be dissimilar to
each other.
Two clusters can be considered as disjoint. The main target of clustering is to divide the
whole data into multiple clusters. Unlike the classification process, here the class labels of
objects are not known before, and clustering is unsupervised learning.
In clustering, the similarity between two objects is measured by the similarity function where
the distance between those two objects is measured. Shorter the distance higher the
similarity, conversely longer the distance higher the dissimilarity.
o Hierarchical Clustering.
o DBSCAN
❖ Classification:
It is a process related to categorization, the process in which ideas and objects are
recognized, differentiated and understood.
o Decision Tree
o Random Forest
1. Classification is used for supervised learning whereas clustering is used for unsupervised
learning.
2. The process of classifying the input instances based on their corresponding class labels is
known as classification whereas grouping the instances based on their similarity without
the help of class labels is known as clustering.
3. As Classification have labels so there is need of training and testing dataset for verifying
the model created but there is no need for training and testing dataset in clustering.
5. Classification examples are Logistic regression, Naive Bayes classifier, Support vector
machines etc. Whereas clustering examples are k-means clustering algorithm, Fuzzy c-
means clustering algorithm, Gaussian (EM) clustering algorithm etc.