0% found this document useful (0 votes)

11 views4 pages

Introduction to Machine Learning

The document provides an introduction to machine learning, outlining its main types: supervised, unsupervised, and reinforcement learning, along with their algorithms and applications. It discusses key concepts such as underfitting, overfitting, empirical risk, expected risk, and the machine learning pipeline, which includes steps from data collection to model deployment. The authors emphasize the importance of understanding these concepts for optimizing machine learning techniques in practical applications.

Uploaded by

319emon

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views4 pages

Introduction to Machine Learning

Uploaded by

319emon

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Introduction to Machine Learning

Md. Abid Hasan Miazy Md. Montasir Mamun sagor

ID: 12008001 ID: 12008025
Deptartment of Computer Science and Engineering Department of Computer Sceince and Engineering
Comilla University Comilla University
Cumilla, Bangladesh Cumilla, Bangladesh
abidsusma4288miazy@gmail montasirsagor0@gmail.com

Md. Mehedi Hasan Parvez Romjan Chowodhury

ID: 11908015 ID: 11908048
Department of Computer Science and Engineering Department of Computer Science and Engineering
Comilla Universilty Comilla University
Cumilla, Bangladesh Cumilla, Bangladesh
mehediparvez11908015@stud.cou.ac.bd mrchowdhury587@gmail.com

Abstract—Machine learning is one of the subfields of artificial Key Characteristics:

intelligence which focuses on the development of systems that
are capable of automatically learning from data and making • Labeled Data: Every training data has a corresponding
predictions or decisions. In this work, we explain the main types label.
of machine learning - supervised, unsupervised and reinforcement • Objective: Minimize the difference between expected
learning – discussing their algorithms and applications in detail. output and actual output.
Moreover, we cover essential topics such as underfitting and • Applications: Identifying spam, classifying images and
overfitting, empirical vs. expected risk, and we also describe the
machine learning pipeline. This type of knowledge provides the giving medical diagnosis.
groundwork to understand how machine learning techniques can Example: For the purpose of supervised learning, let us
and should be optimized for practical applications. consider an example where emails are to be classified as
Index Terms—Machine Learning, Supervised Learning, Unsu-
pervised Learning, Reinforcement Learning, Overfitting, Under- ’spam’ or ’not spam’. The model is executed using a dataset
fitting, Empirical Risk, Expected Risk, ML Pipeline. containing all emails labeled ‘spam’ or ‘not spam’. After
training, the model understands the features of emails (like
keywords or sender information) and is able to accurately
I. I NTRODUCTION
classify future emails.
Identifying patterns, and designing algorithms to process Common Algorithms:
increasingly large volumes of data are some of the defining • Linear Regression
features of the modern world. In fact, machine learning is • Logistic Regression
a branch of artificial intelligence which allows computers to • Support Vector Machines (SVM)
learn from information without explicit human interaction, • Decision Trees
hence making it increasingly crucial in contemporary society. • Random Forests
This report presents the basic concepts of ML along with its • k-Nearest Neighbors (k-NN)
differentiated types focusing on supervised, unsupervised, and • Naı̈ve Bayes
reinforcement learning. Moreover, overfitting and underfitting
along with examples will be covered as well as definitions of Applications:
empirical and expected risks will be provided. • Healthcare: Predicting disease progression given patient
health records.
II. S UPERVISED , U NSUPERVISED , AND R EINFORCEMENT • Finanacial: Estimating the credit score given a customer’s
L EARNING financial history.
• Retail: Forecasting demand in order to estimate the stock
A. Supervised Learning needed.
Supervised learning is similar to learning with the aid of a
teacher. In this case, the model is taught using a labeled dataset
B. Unsupervised Learning
where every piece of data is matched with the relevant output.
The main focus is to help the model understand how to map Data which has no labeled information can be grouped
inputs and outputs so that it can one day generate predictions or strucutre in unsupervised machine learning methods and
for novel data. models.
Fig. 2. Workflow of Unsupervised Machine Learning

Fig. 1. Workflow of Supervised Machine Learning

Key Characteristics:
• Raw Data: No prior defined labels or marks.
• Objective: unmask underlying structures and arrang-
ments.
Example: Grouping customer behavior. A machine can
group people based on their spending habits without knowing
the real grouping..
Common Algorithms:
• K mean cluster or simply K means
Fig. 3. Workflow of Reinforcement Machine Learning
• Hierarchial cluster
• Density Based Spatial Clustering of Applications with
Noise (DBSCAN) III. U NDERFITTING AND OVERFITTING
• PCA Principal componenet or PCA A. Underfitting
• T-distributed Stochastic Neighbor Embedding or t-SNE
When the model does not incorporate the underlying data
• Auto-encoders
complexities and is overly simplistic, it leads to underfitting.
Applications: Both the training data and test data yield disappointing results.
• Marketing: make mesured ad campaigns Characteristics:
• Healthcare: Mark abnormalities in imaging data
• High Bias: The model lacks nuance, having a consistent
• Economic: Help determine false and true claims
unrealistic view of the data
C. Reinforcement Learning • Low Variance: The structure which guides the model is

Reinforcement learning in training an agent to make choices uniformly inaccurate

• The model underperforms for both training and test
decisions by interacting in an environment and learning
through rewards and penalties. datasets.
Key Characteristics: Example: Trying to estimate a property’s value using only
• Interaction: It includes an agent that acts and receives the square footage and zeroing in on its location or number
feedback. of rooms is an example of underfitting.
• Objective: Maximal cumulative reward. Prevention Techniques:
• Applications: Robotics, gaming, self-driving cars. • Withstand the irritation of patterns that are clearly non-

Example: A robot navigating a maze receives rewards for linear by adding layers to the model.
reaching the goal and penalties for hitting walls learning the • Include more relevant features to provide the model with

optimal path over time.. additional information.

Common Algorithms: B. Overfitting
• Q-Learning
Overfitting is a phenomenon in machine learning where a
• Deep Q-Networks (DQN)
model learns patterns too well, including random noise, which
• Policy Gradient Methods
leads to excellent performance within the training set but poor
• Actor-Critic Methods
outcomes on new test datasets because the model is unable to
• Proximal Policy Optimization (PPO)
make generalizations.
Applications: Characteristics:
• Gaming: Chess, Go
• Minimal Bias: The model does not show significant
• Robotics: Object manipulation
deviations from the actual values due to details captured
• Finance: Adaptive trading strategies
in the training data.
• High Variance: The model produces different outputs for A. Pipeline Stages
the same function if applied to different datasets. To make predictions, data passes through several steps to
• Great Results During Training: The model has undergone
produce a machine learning model capable of making accurate
extensive training and shows trivial error on the training predictions. These steps include:
set.
• Poor Results on Testing: The model has trained too
1) Data Collection: Data is collected from different
exclusively tailored to the training data, therefore will sources such as databases, APIs, or files. This data is
underperform when tested against any unfamiliar data often raw and requires preprocessing.
2) Data Pre-processing: Involves handling missing values,
Example: A highly sophisticated decision tree that memo- removing duplicates, and correcting errors to improve
rizes every possible pattern for each training instance might the model’s predictions. This step often requires domain
achieve perfect accuracy during training. Such a tree is likely knowledge and creativity.
to underperform on real-world scenarios filled with slight 3) Feature Engineering: Creating new features or select-
deviations when presented with previously unseen data. ing relevant features to enhance model performance.
Prevention Techniques: 4) Data Splitting: Dividing the data into training and
• Cross-validation testing sets to evaluate model performance.
• Regularization 5) Model Selection: Choosing an appropriate machine
• Pruning learning algorithm based on the problem type (e.g.,
• Ensemble methods classification, regression), data characteristics, and per-
formance needs.
IV. E MPIRICAL R ISK AND E XPECTED R ISK
6) Model Training: Training the model on the training
A. Empirical Risk dataset using the selected algorithm.
Empirical risk refers to the average loss of a model over a 7) Model Evaluation: Assessing the model’s performance
finite sample of training data. Given a dataset {(xi , yi )}ni=1 using a separate testing dataset or through cross-
drawn i.i.d. from the underlying data distribution D, the validation techniques.
empirical risk R̂n (h) is defined as: 8) Model Tuning: Adjusting hyperparameters to improve
the model’s performance.
n
1X 9) Model Deployment: Integrating the trained model into a
R̂n (h) = L(h(xi ), yi )
n i=1 production environment where it can make predictions
on unseen data. This may involve creating APIs and
Empirical risk serves as an approximation of the true integrating with other systems.
(expected) risk based on observed data. The principle of 10) Monitoring and Maintenance: Continuously monitor-
Empirical Risk Minimization (ERM) in machine learning ing the model in production and updating it as needed
involves selecting a hypothesis h ∈ H that minimizes the to maintain performance.
empirical risk.
B. Benefits of Machine Learning Pipelines
B. Expected Risk
• Modularization: Simplifies development by breaking the
The expected risk, also known as the true or real risk, is process into independent steps.
a theoretical measure of how well a model performs over the • Efficiency: Automates repetitive tasks, saving time and
entire data distribution, including unseen cases. It represents effort.
the anticipated value of the loss function based on the joint • Scalability: Handles large datasets effectively.
distribution of input-output pairs (x, y). • Experimentation: Facilitates experimentation with dif-
For a hypothesis (model) h and a loss function L(h(x), y), ferent techniques and configurations.
the expected risk R(h) is defined as: • Deployment: Streamlines transition from development to
Z production.
R(h) = E(x,y)∼D [L(h(x), y)] = L(h(x), y) dD(x, y) • Collaboration: Enhances teamwork by structuring the
workflow.
Here, D denotes the (unknown) true probability distribution • Version Control and Documentation: Enables tracking
of the data. Since D is usually not known in practice, the of changes in code and configurations.
expected risk cannot be computed exactly.
VI. C ONCLUSION
V. M ACHINE L EARNING P IPELINE Understanding the core differences among the machine
A machine learning pipeline is a series of interconnected learning algorithms is crucial for selecting the proper way to
data processing and modeling steps designed to automate, solve the problem. Also properly recognizing the overfitting
standardize, and streamline the process of building, training, and underfitting and resolving the issues gives insights to build
evaluating, and deploying machine learning models. suitable models.
ACKNOWLEDGMENT
The authors would like to express their sincere gratitude
to Dr. Mahmudul Hasan, Head of the Department of Com
puter Science and Engineering at Comilla University, for
his invaluable support in the conception and design of this
study. His insightful guidance and encouragement significantly
contributed to the development and direction of this research.
R EFERENCES
[1] S. Badillo, B. Banfai, F. Birzele, I. I. Davydov, et al., “An introduction
to machine learning,” *Clinical Pharmacology Therapeutics*, Wiley
Online Library, 2020.
[2] A. Smola and S. V. N. Vishwanathan, “Introduction to machine
learning,” *Cambridge University*, UK, 2008. [Online]. Available:
academia.edu
[3] Y. Baştanlar and M. Özuysal, “Introduction to machine learning,” in
*miRNomics: MicroRNA biology and applications*, Springer, 2013.
[4] M. Kubat, *An Introduction to Machine Learning*, Springer, 2017.
[5] N. J. Nilson, “Introduction to machine learning,” Draft Textbook, 1998.
[Online]. Available: bafflerbach.github.io
[6] G. Rebala, A. Ravi, and S. Churiwala, *An Introduction to Machine
Learning*, Springer, 2019.
[7] E. Alpaydin, *Introduction to Machine Learning*, MIT Press, 2020.
[8] R. Y. Choi, A. S. Coyner, et al., “Introduction to machine learning, neural
networks, and deep learning,” *Investigative Ophthalmology Visual
Science*, 2020.
[9] O. Simeone, “A brief introduction to machine learning for engineers,”
*Foundations and Trends in Signal Processing*, vol. 12, no. 3–4, pp.
200–399, 2018.
[10] S. Lemm, B. Blankertz, T. Dickhaus, and K. R. Müller, “Introduction
to machine learning for brain imaging,” *NeuroImage*, vol. 56, no. 2,
pp. 387–399, 2011.

Better Penetration Studio Instructions
100% (3)
Better Penetration Studio Instructions
7 pages
Machine Learning
No ratings yet
Machine Learning
40 pages
ML_
No ratings yet
ML_
66 pages
ML UNIT-1 NOTES
No ratings yet
ML UNIT-1 NOTES
13 pages
ML Unit1
No ratings yet
ML Unit1
25 pages
HHS ML Assignment
No ratings yet
HHS ML Assignment
16 pages
9e27d2e7-5dfa-4b8b-b760-d1fb4a21abd0
No ratings yet
9e27d2e7-5dfa-4b8b-b760-d1fb4a21abd0
24 pages
There Are Key Areas in The Process of Machine Learning, Like
No ratings yet
There Are Key Areas in The Process of Machine Learning, Like
45 pages
Machine Learning For Beginners
100% (1)
Machine Learning For Beginners
30 pages
MLT Unit 1
No ratings yet
MLT Unit 1
15 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
3 pages
ML Doc1
No ratings yet
ML Doc1
14 pages
Machine Learning Fundamentals
No ratings yet
Machine Learning Fundamentals
5 pages
THEORY FILE - Machine Learning (6th Sem)!!
No ratings yet
THEORY FILE - Machine Learning (6th Sem)!!
26 pages
Machine Learning concise notes
No ratings yet
Machine Learning concise notes
7 pages
Mod 1
No ratings yet
Mod 1
15 pages
Session 3 Types of Machine Learning (1)
No ratings yet
Session 3 Types of Machine Learning (1)
22 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
4 pages
AI lab6 (1)
No ratings yet
AI lab6 (1)
7 pages
ML All Units Mca 3rd Semester Anna University
No ratings yet
ML All Units Mca 3rd Semester Anna University
100 pages
2 - Types of Machine Learning
No ratings yet
2 - Types of Machine Learning
26 pages
Study On Machine Learning Research Paper
No ratings yet
Study On Machine Learning Research Paper
17 pages
lksk ML typesToStudents
No ratings yet
lksk ML typesToStudents
18 pages
(AIML) : Pimpri Chinchwad College of Engineering & Research, Ravet
No ratings yet
(AIML) : Pimpri Chinchwad College of Engineering & Research, Ravet
9 pages
Lecture 2 Introduction To ML
No ratings yet
Lecture 2 Introduction To ML
35 pages
asset-v1_MKAU+SEng9032+DEV_01+type@asset+block@ChapOne
No ratings yet
asset-v1_MKAU+SEng9032+DEV_01+type@asset+block@ChapOne
29 pages
MAchineLearningNotes
No ratings yet
MAchineLearningNotes
6 pages
Unit-3-ML
No ratings yet
Unit-3-ML
119 pages
Deep Learnng IA
No ratings yet
Deep Learnng IA
69 pages
Introduction to ML
No ratings yet
Introduction to ML
17 pages
Machine Learning (R20a0518)
No ratings yet
Machine Learning (R20a0518)
87 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
48 pages
AIML ASSIGNMENT 1
No ratings yet
AIML ASSIGNMENT 1
11 pages
Datascience Notes
No ratings yet
Datascience Notes
16 pages
Basic of Machine Learning
No ratings yet
Basic of Machine Learning
7 pages
Machine Learning BE Merged Modules
No ratings yet
Machine Learning BE Merged Modules
561 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
21 pages
Intro to Machine Learning 1
No ratings yet
Intro to Machine Learning 1
14 pages
University Institute of Engineering Department of Computer Science and Engg
No ratings yet
University Institute of Engineering Department of Computer Science and Engg
27 pages
DM Chapter 0
No ratings yet
DM Chapter 0
4 pages
Machine Learning File
No ratings yet
Machine Learning File
19 pages
Machine Learning-Lecture 01
No ratings yet
Machine Learning-Lecture 01
28 pages
ai_presentation
No ratings yet
ai_presentation
28 pages
ML R20 Material
No ratings yet
ML R20 Material
96 pages
Python UNIT-5
100% (1)
Python UNIT-5
67 pages
unit 4 (1)
No ratings yet
unit 4 (1)
61 pages
Machine Learning IAI
No ratings yet
Machine Learning IAI
94 pages
1. Machine Learning - Introduction
No ratings yet
1. Machine Learning - Introduction
73 pages
ML
No ratings yet
ML
17 pages
Machine Learning Is A Branch of Artificial Intelligence (AI)
No ratings yet
Machine Learning Is A Branch of Artificial Intelligence (AI)
80 pages
Machine Learning Notes
100% (1)
Machine Learning Notes
8 pages
Machine learning
No ratings yet
Machine learning
12 pages
ML Unit-1 (CEC)
No ratings yet
ML Unit-1 (CEC)
108 pages
Notes Unit 1
No ratings yet
Notes Unit 1
13 pages
ml unit 2
No ratings yet
ml unit 2
23 pages
machine learning notes
No ratings yet
machine learning notes
20 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Null 5
No ratings yet
Null 5
16 pages
ai faheem
No ratings yet
ai faheem
16 pages
Machine Learning ASSIGNMENTS
No ratings yet
Machine Learning ASSIGNMENTS
4 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
Lesson 2 Personal and Business Letter
100% (1)
Lesson 2 Personal and Business Letter
1 page
ML Chapter 7 (CLT) Notes
No ratings yet
ML Chapter 7 (CLT) Notes
59 pages
Progress Tracking Journal - Shivin
No ratings yet
Progress Tracking Journal - Shivin
1 page
Feet First - Object Exploration in Young Infants
No ratings yet
Feet First - Object Exploration in Young Infants
6 pages
PBAAO25
No ratings yet
PBAAO25
1 page
14 Help - Structural Steel Import - Autodesk
No ratings yet
14 Help - Structural Steel Import - Autodesk
8 pages
Insulation Life
No ratings yet
Insulation Life
53 pages
Basic Tech JSS1 First Term Note
No ratings yet
Basic Tech JSS1 First Term Note
46 pages
Telescopic PDF
No ratings yet
Telescopic PDF
8 pages
APAR Arvind 2023-24
No ratings yet
APAR Arvind 2023-24
2 pages
HACCP - What Is It?: Food Safety Management
No ratings yet
HACCP - What Is It?: Food Safety Management
28 pages
Class VIII English Grammar Wsheet
70% (23)
Class VIII English Grammar Wsheet
1 page
USP-NF 724 - Drug Release
No ratings yet
USP-NF 724 - Drug Release
11 pages
Static Analysis of A Coffee Cup: Appendix E
No ratings yet
Static Analysis of A Coffee Cup: Appendix E
24 pages
Lancer Safety Footwear
No ratings yet
Lancer Safety Footwear
9 pages
Jres 126 051
No ratings yet
Jres 126 051
28 pages
PHD Thesis On Environmental Science
100% (2)
PHD Thesis On Environmental Science
5 pages
KOWA Basics of Oil Analysis Booklet 2020
No ratings yet
KOWA Basics of Oil Analysis Booklet 2020
32 pages
DXC 700 AU Competency Checklist
100% (1)
DXC 700 AU Competency Checklist
7 pages
9 кл сор 2 1 четв
No ratings yet
9 кл сор 2 1 четв
4 pages
Topic 6: Workplace Environment & Ergonomic Principles of Ergonomic
No ratings yet
Topic 6: Workplace Environment & Ergonomic Principles of Ergonomic
16 pages
ART7 AsteraBox UN38.3 V1
No ratings yet
ART7 AsteraBox UN38.3 V1
1 page
Mridula-Gupta - Resume - Univ - 2020
No ratings yet
Mridula-Gupta - Resume - Univ - 2020
8 pages
Quine (1992), Pursuit of Truth
No ratings yet
Quine (1992), Pursuit of Truth
124 pages
Tds Conbextra Ep150 Saudi Arabia
No ratings yet
Tds Conbextra Ep150 Saudi Arabia
4 pages
Miracle
No ratings yet
Miracle
5 pages
Question 4 Explanation - Digital SAT Mock Test 1, Section 1, Module 1 - Reading and Writing
No ratings yet
Question 4 Explanation - Digital SAT Mock Test 1, Section 1, Module 1 - Reading and Writing
1 page
A80s03mac 3NX
100% (1)
A80s03mac 3NX
3 pages
Zhukovsky Labs: WQ80424HRS74
No ratings yet
Zhukovsky Labs: WQ80424HRS74
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Introduction to Machine Learning

Uploaded by

Introduction to Machine Learning

Uploaded by

Introduction to Machine Learning

Md. Abid Hasan Miazy Md. Montasir Mamun sagor

Md. Mehedi Hasan Parvez Romjan Chowodhury

Abstract—Machine learning is one of the subfields of artificial Key Characteristics:

Fig. 1. Workflow of Supervised Machine Learning

Reinforcement learning in training an agent to make choices uniformly inaccurate

optimal path over time.. additional information.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.