0% found this document useful (0 votes)
67 views2 pages

Dav Assignment 5

Clustering groups unlabeled data based on similarity, dividing data into disjoint clusters where objects within a cluster are similar and different clusters are dissimilar. Classification categorizes predetermined classes of labeled data using supervised learning, learning from a training set and categorizing a test set. Key differences are that classification uses supervised learning while clustering uses unsupervised learning, classification predicts labels while clustering groups based on similarity, and classification requires training and testing data while clustering does not.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
67 views2 pages

Dav Assignment 5

Clustering groups unlabeled data based on similarity, dividing data into disjoint clusters where objects within a cluster are similar and different clusters are dissimilar. Classification categorizes predetermined classes of labeled data using supervised learning, learning from a training set and categorizing a test set. Key differences are that classification uses supervised learning while clustering uses unsupervised learning, classification predicts labels while clustering groups based on similarity, and classification requires training and testing data while clustering does not.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

ASSIGNMENT - 5 DAV

ASSIGNMENT 5
Q.1 Give differences between clustering and classification.

❖ Clustering:
It is a technique of organizing a group of data into classes and clusters where the objects with
high similarity reside inside a cluster and the objects of two clusters would be dissimilar to
each other.

Two clusters can be considered as disjoint. The main target of clustering is to divide the
whole data into multiple clusters. Unlike the classification process, here the class labels of
objects are not known before, and clustering is unsupervised learning.

In clustering, the similarity between two objects is measured by the similarity function where
the distance between those two objects is measured. Shorter the distance higher the
similarity, conversely longer the distance higher the dissimilarity.

● Some algorithms for clustering are:


o K-means

o Hierarchical Clustering.

o DBSCAN

❖ Classification:
It is a process related to categorization, the process in which ideas and objects are
recognized, differentiated and understood.

Classification is the process of learning a model that categorizes different predetermined


classes of data. It is a two-step process, comprised of a learning step and a classification step.
The learning step can be accomplished by using an already defined training set of data.

● Some algorithms for classification are:


o Logistic Regression

o Decision Tree

o K-nearest neighbors (KNN)

o Support Vector Machines (SVM)

o Random Forest

Enrolment No. 180500116032Page 1


ASSIGNMENT - 5 DAV

❖ Differences between Classification and Clust

1. Classification is used for supervised learning whereas clustering is used for unsupervised
learning.

2. The process of classifying the input instances based on their corresponding class labels is
known as classification whereas grouping the instances based on their similarity without
the help of class labels is known as clustering.

3. As Classification have labels so there is need of training and testing dataset for verifying
the model created but there is no need for training and testing dataset in clustering.

4. Classification is more complex as compared to clustering as there are many levels in


classification phase whereas only grouping is done in clustering.

5. Classification examples are Logistic regression, Naive Bayes classifier, Support vector
machines etc. Whereas clustering examples are k-means clustering algorithm, Fuzzy c-
means clustering algorithm, Gaussian (EM) clustering algorithm etc.

Enrolment No. 180500116032Page 2

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy