MLLAB2
MLLAB2
Engineering
Air University, Islamabad, Pakistan
Experiment 2: Introduction to supervised and unsupervised machine learning and how to import
Dataset on google colab.
OBJECTIVES:
Unsupervised learning is the training of a machine using information that is neither classified nor labeled
and allowing the algorithm to act on that information without guidance. Here the task of the machine is to
group unsorted information according to similarities, patterns, and differences without any prior training of
data.
Unlike supervised learning, no teacher is provided that means no training will be given to the machine.
Therefore the machine is restricted to find the hidden structure in unlabeled data by itself.
You can use unsupervised learning to examine the animal data that has been gathered and distinguish
between several groups according to the traits and actions of the animals. These groupings might
correspond to various animal species, providing you to categorize the creatures without depending on
labels that already exist.
Imagine you have a machine learning model trained on a large dataset of unlabeled images, containing
both dogs and cats. The model has never seen an image of a dog or cat before, and it has no pre-existing
labels or categories for these animals. Your task is to use unsupervised learning to identify the dogs and
cats in a new, unseen image.
For instance, suppose it is given an image having both dogs and cats which it has never seen.
Thus the machine has no idea about the features of dogs and cats so we can’t categorize it as ‘dogs and cats
‘. But it can categorize them according to their similarities, patterns, and differences, i.e., we can easily
categorize the above picture into two parts. The first may contain all pics having dogs in them and the
second part may contain all pics having cats in them. Here you didn’t learn anything before, which means
no training data or examples.
It allows the model to work on its own to discover patterns and information that was previously
undetected. It mainly deals with unlabelled data.
Dataset
A dataset in machine learning is a collection of data points used for training, validating, or testing a machine
learning model. It typically consists of a set of input-output pairs, where each input corresponds to a desired
output. Datasets can vary widely in size and complexity, and they are essential for developing and evaluating
machine learning algorithms.
Classes
Features
Samples
How to import dataset on colab?
Lab Tasks
Write a code to import three different datasets on colab ,separate features and output classes and print
them.
Also Mention No. of samples , Features and Classes.