Introduction To Machine Learning
Introduction To Machine Learning
Chapter-1
INTRODUCTION TO MACHINE LEARNING
(ML) is a discipline of artificial intelligence (AI) that provides
machines with the ability to automatically learn from data and
past experiences while identifying patterns to make predictions
with minimal human intervention.
Machine learning methods enable computers to operate
autonomously without explicit programming. ML applications
are fed with new data, and they can independently learn, grow,
develop, and adapt.
Machine learning derives insightful information from large
volumes of data by leveraging algorithms to identify patterns
and learn in an iterative process. ML algorithms use
computation methods to learn directly from data instead of
relying on any predetermined equation that may serve as a
model.
The performance of ML algorithms adaptively improves with an
increase in the number of available samples during the
‘learning’ processes. For example, deep learning is a sub-
domain of machine learning that trains computers to imitate
natural human traits like learning from examples. It offers better
performance parameters than conventional ML algorithms.
Today, with the rise of big data, IoT, and ubiquitous computing,
machine learning has become essential for solving problems
across numerous areas, such as
Computational finance (credit scoring, algorithmic
trading)
Computer vision (facial recognition, motion tracking,
object detection)
Computational biology (DNA sequencing, brain tumor
detection, drug discovery)
Automotive, aerospace, and manufacturing (predictive
maintenance)
Natural language processing (voice recognition)
Data plays a significant role in the machine learning process. One of the
significant issues that machine learning professionals face is the absence
of good quality data. Unclean and noisy data can make the whole
process extremely exhausting.
The process is transforming, and hence there are high chances of error
which makes the learning complex.
The most important task you need to do in the machine learning process
is to train the data to achieve an accurate output. Less amount training
data will produce inaccurate or too biased predictions.
Let us understand this with the help of an example. Consider a machine
learning algorithm similar to training a child.
One day you decided to explain to a child how to distinguish between an
apple and a watermelon. You will take an apple and a watermelon and
show him the difference between both based on their color, shape, and
taste. In this way, soon, he will attain perfection in differentiating
between the two.
But on the other hand, a machine-learning algorithm needs a lot of data
to distinguish. For complex problems, it may even require millions of
data to be trained. Therefore we need to ensure that Machine learning
algorithms are trained with sufficient amounts of data.
6. Slow Implementation
This is one of the common issues faced by machine learning
professionals. The machine learning models are highly efficient in
providing accurate results, but it takes a tremendous amount of time.
Slow programs, data overload, and excessive requirements usually take a
lot of time to provide accurate results. Further, it requires constant
monitoring and maintenance to deliver the best output.
So you have found quality data, trained it amazingly, and the predictions
are really concise and accurate.
Yay, you have learned how to create a machine learning algorithm!! But
wait, there is a twist; the model may become useless in the future as data
grows.
The best model of the present may become inaccurate in the coming
Future and require further rearrangement.
So you need regular monitoring and maintenance to keep the algorithm
working. This is one of the most exhausting issues faced by machine
learning professionals.
(b) Machine Learning can reduce costs, mitigate risks, and improve
quality of life by recommending products/services, detecting cyber
security breaches, and enabling self-driving cars. It is becoming more
common and will soon integrate into many facets of life.
Extracting useful
Introduce algorithm from data as
1. information from large
well as from past experience
amount of data
Exploratory and
11. Predictive and prescriptive
descriptive
Clustering, association
Regression, classification,
14. rule mining, outlier
clustering, deep learning
detection
Data cleaning,
Data cleaning, transformation,
15. transformation, and
and feature engineering
integration
Chapter-2
Understanding Data
We use data science to make it easier to work with data. Data science is
defined as a field that combines knowledge of
mathematics, programming skills, domain expertise, scientific methods,
algorithms, processes, and systems to extract actionable knowledge and
insights from both structured and unstructured data, then apply the
knowledge gleaned from that data to a wide range of uses and domains.
What is Information?
The term Big Data is used in the data definition to describe the data that
is in the petabyte range or higher. Big Data is also described as 5Vs:
variety, volume, value, veracity, and velocity. Nowadays, web-based
eCommerce has spread vastly, business models based on Big Data have
evolved, and they treat data as an asset itself. And there are many
benefits of Big Data as well, such as reduced costs, enhanced efficiency,
enhanced sales, etc.