Mehakreport
Mehakreport
PROJECT REPORT
OF
MINOR PROJECT LAB
ON
IRIS FLOWER PREDICTION
OF
BACHELOR OF TECHNOLOGY
IN
COMPUTER SCIENCE ENGINEERING
Submitted By:
Name: Mehak
This machine learning tutorial gives you an introduction to machine learning along with the wide
range of machine learning techniques such as Supervised, Unsupervised, and Reinforcement
learning. You will learn about regression and classification models, clustering methods, hidden
Markov models, and various sequential models.
Machine learning enables a machine to automatically learn from data, improve performance
from experiences, and predict things without being explicitly programmed.
With the help of sample historical data, which is known as training data, machine learning
algorithms build a mathematical model that helps in making predictions or decisions without
being explicitly programmed. Machine learning brings computer science and statistics
together for creating predictive models. Machine learning constructs or uses the algorithms
that learn from historical data. The more we will provide the information, the higher will be
the performance.
A machine has the ability to learn if it can improve its performance by gaining more data.
It is a data-driven technology.
Machine learning is much similar to data mining as it also deals with the huge amount of
the data.
The importance of machine learning can be easily understood by its uses cases, Currently,
machine learning is used in self-driving cars, cyber fraud detection, face recognition, and
friend suggestion by Facebook, etc. Various top companies such as Netflix and Amazon have
build machine learning models that are using a vast amount of data to analyze the user interest
and recommend product accordingly.
Supervised learning
Unsupervised learning
Reinforcement learning
1) Supervised Learning
Supervised learning is a type of machine learning method in which we provide sample labeled
data to the machine learning system in order to train it, and on that basis, it predicts the output.
The system creates a model using labeled data to understand the datasets and learn about each
data, once the training and processing are done then we test the model by providing a sample
data to check whether it is predicting the exact output or not.
The goal of supervised learning is to map input data with the output data. The supervised
learning is based on supervision, and it is the same as when a student learns things in the
supervision of the teacher. The example of supervised learning is spam filtering.
Classification
Regression
2) Unsupervised Learning
Unsupervised learning is a learning method in which a machine learns without any supervision.
The training is provided to the machine with the set of data that has not been labeled, classified,
or categorized, and the algorithm needs to act on that data without any supervision. The goal of
unsupervised learning is to restructure the input data into new features or a group of objects with
similar patterns.
In unsupervised learning, we don't have a predetermined result. The machine tries to find
useful insights from the huge amount of data. It can be further classifieds into two categories of
algorithms:
Clustering
Association
3) Reinforcement Learning
Reinforcement learning is a feedback-based learning method, in which a learning agent gets a
reward for each right action and gets a penalty for each wrong action. The agent learns
automatically with these feedbacks and improves its performance. In reinforcement learning,
the agent interacts with the environment and explores it. The goal of an agent is to get the most
reward points, and hence, it improves its performance.
The robotic dog, which automatically learns the movement of his arms, is an example of
Reinforcement learning.
Note: We will learn about the above types of machine learning in detail in later chapters.
Before some years (about 40-50 years), machine learning was science fiction, but today it is the
part of our daily life. Machine learning is making our day to day life easy from self-driving cars
to Amazon virtual assistant "Alexa". However, the idea behind machine learning is so old and
has a long history. Below some milestones are given which have occurred in the history of
machine learning:
1834: In 1834, Charles Babbage, the father of the computer, conceived a device that could be
programmed with punch cards. However, the machine was never built, but all modern computers
rely on its logical structure.
1936: In 1936, Alan Turing gave a theory that how a machine can determine and execute a set of
instructions.
1950: In 1950, Alan Turing published a seminal paper, "Computer Machinery and Intelligence,"
on the topic of artificial intelligence. In his paper, he asked, "Can machines think?"
1952: Arthur Samuel, who was the pioneer of machine learning, created a program that helped
an IBM computer to play a checkers game. It performed better more it played.
1959: In 1959, the term "Machine Learning" was first coined by Arthur Samuel.
The duration of 1974 to 1980 was the tough time for AI and ML researchers, and this duration
was called as AI winter.
In this duration, failure of machine translation occurred, and people had reduced their interest
from AI, which led to reduced funding by the government to the researches.
Machine Learning from theory to reality
1959: In 1959, the first neural network was applied to a real-world problem to remove
echoes over phone lines using an adaptive filter.
1985: In 1985, Terry Sejnowski and Charles Rosenberg invented a neural network NETtalk,
which was able to teach itself how to correctly pronounce 20,000 words in one week.
1997: The IBM's Deep blue intelligent computer won the chess game against the chess expert
Garry Kasparov, and it became the first computer which had beaten a human chess expert.
2006: In the year 2006, computer scientist Geoffrey Hinton has given a new name to neural net
research as "deep learning," and nowadays, it has become one of the most trending technologies.
2012: In 2012, Google created a deep neural network which learned to recognize the image
of humans and cats in YouTube videos.
2014: In 2014, the Chabot "Eugen Goostman" cleared the Turing Test. It was the first Chabot
who convinced the 33% of human judges that it was not a machine.
2014: DeepFace was a deep neural network created by Facebook, and they claimed that it could
recognize a person with the same precision as a human can do.
2016: AlphaGo beat the world's number second player Lee sedol at Go game. In 2017 it beat the
number one player of this game Ke Jie.
2017: In 2017, the Alphabet's Jigsaw team built an intelligent system that was able to learn the
online trolling. It used to read millions of comments of different websites to learn to stop online
trolling.
Modern machine learning models can be used for making various predictions, including weather
prediction, disease prediction, stock market analysis, etc.
Prerequisites
Before learning machine learning, you must have the basic knowledge of followings so that you
can easily understand the concepts of machine learning:
Now that we do understand a bit about what libraries are and what Python is,
let us do a deep dive into some of the most commonly used libraries in Python:
1. Pandas
Figure 1.3
• Cross-Validation: There are several methods for checking the accuracy of
supervised models on unseen data with Scikit Learn for example
the train_test_split method, cross_val_score, etc.
Figure 1.6
7. SciPy: Scipy is a free, open-source Python library used for scientific computing,
data processing, and high-performance computing. The library contains a huge
number of user-friendly routines for quick computation. The package is based on the
NumPy extension, which allows for data processing and visualization as well as
high-level commands.
Figure 1.7
Training Work Undertaken
Project Name: IRIS-FLOWER CLASSIFICATION
The Iris flower classification project using logistic regression is a classic machine learning
example that demonstrates the application of supervised learning algorithms to predict the species
2.2 Dataset:
The Iris dataset is a well-known dataset in the machine learning community. It consists of 150
samples of Iris flowers, where each sample includes the following four features:
• Sepal length
• Sepal width
• Petal length
• Petal width
• Setosa
• Versicolor
• Virginica
Link: https://raw.githubusercontent.com/sarwansingh/Python/master/ClassExamples/data/iris.csv
2.3.3 cleaning the data: