0% found this document useful (0 votes)

14 views52 pages

Chapter 1 - Machine Learning Fundamentals

The document provides an overview of machine learning fundamentals, including definitions, types, and applications. It discusses the differences between traditional programming and machine learning, as well as the challenges faced in the field. Key concepts such as supervised, unsupervised, semi-supervised, and reinforcement learning are also explored, along with the importance of data quality and algorithm selection.

Uploaded by

ea0949641515

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views52 pages

Chapter 1 - Machine Learning Fundamentals

Uploaded by

ea0949641515

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 52

Fundamentals of Machine

Learning
Instructor: Melaku M.

Target Group: G3SEng

Arba Minch University-FCSE FML by Melaku M
Chapter 1: Machine Learning Fundamentals

FML by Melaku M
Quotes
❖“If you were a current computer science student, what area
would you start studying heavily?
• Answer: Machine Learning.
–Bill Gates, Reddit AMA
❖“Machine learning is today’s discontinuity”
–Jerry Yang, Co-founder, Yahoo
❖“AI is the new electricity! Electricity transformed countless
industries; AI will now do the same.” –Andrew Ng
FML by Melaku M
What is Machine Learning?

FML by Melaku M
Potential Definitions for Machine Learning
❖Machine learning (ML) is a field of study that focuses on creating systems that
can learn from data without being explicitly programmed. “Arthur Samuel (1959)”

❖ Essentially, it allows computers to:

➢Learn from data: Instead of relying on rigid, pre-written/hard-coded rules, ML models
train on large datasets to recognize patterns and relationships.

➢Improve performance over time: As they are exposed to more data, ML models refine
their understanding and become more accurate in their predictions or decisions.

➢Make predictions or decisions: Based on the patterns they've learned, ML algorithms can
predict future outcomes or make informed decisions.
FML by Melaku M
Potential Definitions for Machine Learning
Definition by Tom Mitchell (1998):

Machine Learning is the study of algorithms that:

• improve their performance P
• at some task T
• with experience E
A well−deﬁned learning task is given by <P, T, E>

FML by Melaku M
Defining the Learning Task
• Improve on task T, with respect to performance metric P, based on experience E
T: Recognizing and classifying hand-written words within images
P: Percentage of words correctly classified
E: Dataset of human-labeled images of handwritten words

T: Classify emails as legitimate or spam

P: Percentage of emails labeled correctly

E: Repository of emails, some with human-specified labels

Traditional Programming Vs Machine Learning

FML by Melaku M
When We Need Machine Learning

• Tasks requiring customization • Tasks involving big data

o Email filters o Genomics
o Personalized news o Internet search
o Personalized Tutoring o Anomaly detection
Tasks that adapt and customize themselves to Discover new knowledge from large databases
individual users.

•Tasks for which we don’t •Tasks for which it is challenging

have human expertise to specify our knowledge
o Space exploration o Facial recognition
o Undersea manipulation o Understanding speech
o Cellular robotics o Medical diagnosis
Too diﬃcult/expensive to construct manually
FML by Melaku M
Demystifying AI, ML, Deep Learning and Generative AI
Ability of a machine to imitate intelligent
human behavior. Broad (any intelligent
system)

Gives the computers "the ability to learn

without being explicitly programmed.

Uses neural networks with many layers

(hence "deep") to learn complex patterns
from large amounts of data.

Focused on creating content, such as text,

images, music, or code, by learning patterns
from existing data.

FML by Melaku M
Artificial intelligence (AI)-Broader concept

FML by Melaku M
DL: Automatically extracts features from raw data, reducing the need for manual feature engineering.
FML by Melaku M
State-of-the-Art Applications of Machine Learning

FML by Melaku M
Generative AI •ChatGPT: This LLMs has a foundation of GPT architecture
that generates text that resembles something a human
would produce. It's a helpful companion for research,
strategy, and content creation.
•DALL-E2: This model generates images from text prompts,
so creatives can create vibrant illustrations and concept art
that’s a useful accompaniment to content marketing.
•GitHub Copilot: This collaboration between GitHub and
OpenAI acts as a coding companion to help developers code
faster and more intuitively.
Figure: generative AI platforms
•Gemini: large language model chatbot, also known as a
conversational AI .
FML by Melaku M
Autonomous Cars

▪ Nevada made it legal for autonomous

cars to drive on roads in June 2011 .
▪ As of 2019, 37 states have enacted
legislation regarding autonomous cars.

Penn’s Autonomous Car à

FML by Melaku M
(Ben Franklin Racing Team)
Autonomous Cars

Path Planning

Adaptive Vision

FML by Melaku M
Deep Learning in the Headlines

15
FML by Melaku M
Deep Networks Learn Layered Representations
1980s-Era Neural Network Deep Neural Networks

FML by Melaku M
Image: https://www.pnas.org/content/116/4/1074
Object Recognition

FML by Melaku M
Image Translation- Sketch to Photo

FML by Melaku M
Image Synthesis – Image Inpainting

Image inpainting is essentially the art of filling in missing or damaged parts of an image.
E.g., Repair scratches, cracks, Take out unwanted objects, Fill in areas for artistic FML by Melaku M
purposes
NLP-Named Entity Recognition

FML by Melaku M
NLP: Text Generation

FML by Melaku M
NLP: Text Translation/Machine Translation

FML by Melaku M
Automatic Speech Recognition
A Typical Speech Recognition System

ML used to predict of phone states from the sound spectrogram

Deep learning has state-of-the-art results

# Hidden Layers 1 2 4 8 10 12

Word Error Rate % 16.0 12.8 11.4 10.9 11.0 11.1

Baseline GMM performance = 15.4%

[Zeiler et al. “On rectified linear units for speech
recognition” ICASSP 2013]
21 by Melaku M
FML
Recommendation Systems

FML by Melaku M
Machine learning is currently the preferred approach in
the following domains:

FML by Melaku M
Machine Learning pipeline

Figure: A standard Machine Learning pipeline FML by Melaku M

Types of Machine Learning

FML by Melaku M
FML by Melaku M
Supervised Learning
❖The model is trained on labeled data, meaning that the input
data is paired with the correct output.

❖Given (x1, y1), (x2, y2), ..., (xn, yn)

– Learn a function f(x) to predict y given x

– y is categorical == classification

– y is real-valued == regression

FML by Melaku M
Supervised Learning: Spam Detection
• This is a binary classification task:
Assign label (i.e. spam/not-spam) to the
input (an email message)

• Classification requires a model (a

classifier) to determine which label to
assign to input.

FML by Melaku M
Supervised Learning: Document Classification
• This is a multi-class classification task:
Assign label (i.e.Politics, Sports, Finance, Arts) to the input

Training Process Deployment

?
Machine
Learning Classifier Classifier
Algorithm predicted
model
label
labeled data new document

In this class, we study algorithms and techniques to learn such models from data
FML by Melaku M
Supervised Learning: Document Classification
• This idea generalizes to many types of data and applications

Data Labels
Documents Politics,Sports,Finance
Sentences Positive,Negative
Phrases Person,Location
Images Cat,Dog,Snake,Horse Re-
M edical records admit soon/Not
...

FML by Melaku M
Supervised Learning: Digit Recognition
What is a ‘2’? What is a ‘4’?

FML by Melaku M
Unsupervised Learning
❖ The model is trained on unlabeled dataset. The model aims to discover hidden
patterns, structures, or relationships within the data. Given x1, x2, ..., xn (without
labels)

❖ Output hidden structure behind the x’s

– E.g., clustering

FML by Melaku M
Genes
Unsupervised Learning

Individuals
Genomics application:
Social network analysis
group individuals by genetic similarity Finding image similarity

Image credit: NASA/JPL-Caltech/E. Churchwell (Univ. of Wisconsin, Madison)

Customer Segmentation Astronomical data analysis 33 FML by Melaku M

Unsupervised Learning
• Independent component analysis – separate a combined signal
into its original sources

FML by Melaku M
Image credit: statsoft.com Audio from http://www.ism.ac.jp/~shiro/research/blindsep.html
Unsupervised Learning
• Independent component analysis – separate a
combined signal into its original sources

FML by Melaku M
Image credit: statsoft.com Audio from http://www.ism.ac.jp/~shiro/research/blindsep.html
Semi-supervised Learning:
❖Definition: Combines a small amount of labeled data with large amounts
of unlabeled data.

❖This approach combines aspects of supervised and unsupervised learning.

✓ It uses both labeled and unlabeled data for training.
✓ This can be useful when labeling data is expensive/costly or time-consuming
(e.g., medical imaging, speech recognition)..

❖ Common Applications:
✓ Medical imaging (e.g., labeling diseases in X-rays with limited labeled data).
✓ Speech recognition (e.g., learning from a few transcribed audio clips).
FML by Melaku M
Reinforcement Learning
❖ Reinforcement learning (RL): involves learning through interaction with an
environment. An agent learns to take actions in an environment to maximize a
reward signal.

❖ The agent learns through trial and error, receiving rewards or penalties for its
actions. The goal is to maximize the cumulative reward.

FML by Melaku M
The Agent-Environment Interface

FML by Melaku M
FML by Melaku M
Main Challenges of Machine Learning
•In short, since your main task is to select a learning algorithm and
train it on some data, the two things that can go wrong are 1) "bad
data" and 2) "bad algorithm".

1. Dataset(training data)

FML by Melaku M
Main Challenges of Machine Learning - Dataset

1- Insufficient Quantity of Training Data :

•Machine Learning takes a lot of data for most Machine Learning

algorithms to work properly. Even for very simple problems you
typically need thousands of examples, and for complex problems
such as image or speech recognition you may need millions of
examples (unless you can reuse parts of an existing model).

FML by Melaku M
Main Challenges of Machine Learning - Dataset

2) Non-representative Training Data:

•In order to generalize well, it is crucial that your training data be

representative of the new cases you want to generalize to. This is
true whether you use instance based learning or model-based
learning.

FML by Melaku M
Main Challenges of Machine Learning - Dataset

3) Poor-Quality Data:

•If your training data is full of errors, outliers, and noise (e.g., due
to poor quality measurements), it will make it harder for the system
to detect the underlying patterns, so your system is less likely to
perform well. It is often well worth the effort to spend time cleaning
up your training data. The truth is, most data scientists spend a
significant part of their time doing just that.

FML by Melaku M
Main Challenges of Machine Learning - Dataset
4 Irrelevant Features:

•Your system will only be capable of learning if the training data

contains enough relevant features and not too many irrelevant ones. A
critical part of the success of a Machine Learning project is coming up
with a good set of features to train on.

• ..,

FML by Melaku M
Main Challenges of Machine Learning - Algorithm
1) Overfitting the Training Data:
❖ Overfitting happens when a model learns the detail and noise in the training
data to the extent that it negatively impacts the performance of the model
on new data.

❖ The model performs well on the training data, but it does not generalize
well.

FML by Melaku M
Main Challenges of Machine Learning - Algorithm
2) Underfitting the Training Data:
✓ Underfitting is the opposite of overfitting: it occurs when your
model is too simple to learn the underlying structure of the data.

Other ML challenges include:

❖ Interpretability: Some models (e.g., deep learning) are often considered "black boxes," making
their decisions difficult to explain.

❖ Computational Resources: Training complex models can require significant time and resources.

❖Bias and Fairness: Models can inherit biases present in the data.
FML by Melaku M
ML in Practice
Designing a Learning System
• Understand domain, prior knowledge, and goals
• Choose the training experience and what is to be
learned
Loop – i.e. the target function
• Data integration, selection, cleaning, pre-processing, etc.
• Learn models
• Choose a learning algorithm to infer the target
function from the experience
• Interpret results
• Consolidate and deploy discovered knowledge FML by Melaku M
FML by Melaku M

Unit 1: Shobana T S Assistant Professor Dept. of ISE, BMSCE
No ratings yet
Unit 1: Shobana T S Assistant Professor Dept. of ISE, BMSCE
114 pages
Machine Learning PPT For Students
70% (10)
Machine Learning PPT For Students
18 pages
ML Merged
No ratings yet
ML Merged
433 pages
21AI63 Module 1
No ratings yet
21AI63 Module 1
38 pages
Introduction To Machine Learning: WWW - Seas.upenn - Edu/ Cis519
100% (1)
Introduction To Machine Learning: WWW - Seas.upenn - Edu/ Cis519
51 pages
ML - Unit I - Final
No ratings yet
ML - Unit I - Final
132 pages
ML - Unit - 1 (24-25)
No ratings yet
ML - Unit - 1 (24-25)
43 pages
ML m1-m5 NOTES
No ratings yet
ML m1-m5 NOTES
160 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
25 pages
Norvig Google ESTF2019
No ratings yet
Norvig Google ESTF2019
71 pages
Deep Reinforcement Learning
100% (1)
Deep Reinforcement Learning
410 pages
Lecture Compiled
No ratings yet
Lecture Compiled
224 pages
MLUnit 1
No ratings yet
MLUnit 1
131 pages
Lecture 1
No ratings yet
Lecture 1
43 pages
AML All Merged PDF Class 1 To 8
No ratings yet
AML All Merged PDF Class 1 To 8
423 pages
U1 ML Intro and Applications
No ratings yet
U1 ML Intro and Applications
123 pages
BE02000041 Funda of AI Unit 3 Basics of ML
No ratings yet
BE02000041 Funda of AI Unit 3 Basics of ML
86 pages
Intro To ML - 1
No ratings yet
Intro To ML - 1
29 pages
Elements of Machine Learning
No ratings yet
Elements of Machine Learning
116 pages
IDS Unit 1 Notes
No ratings yet
IDS Unit 1 Notes
24 pages
Machine Learning
No ratings yet
Machine Learning
42 pages
Unit 1&2
No ratings yet
Unit 1&2
270 pages
IntroToMachineLearning - 25-07-2019
No ratings yet
IntroToMachineLearning - 25-07-2019
37 pages
Lec01 Introduction
No ratings yet
Lec01 Introduction
29 pages
1 - ML Intro 24
No ratings yet
1 - ML Intro 24
26 pages
Unit 3
No ratings yet
Unit 3
80 pages
01 Introduction ML
No ratings yet
01 Introduction ML
48 pages
L1 Overview
No ratings yet
L1 Overview
28 pages
ENG6500 1 IntroductionToMLDL Part1
No ratings yet
ENG6500 1 IntroductionToMLDL Part1
74 pages
MLT Uint1
No ratings yet
MLT Uint1
26 pages
Overview of Machine Learning
No ratings yet
Overview of Machine Learning
60 pages
Unit 1
No ratings yet
Unit 1
62 pages
Chap 1
No ratings yet
Chap 1
56 pages
AI321: Theoretical Foundations of Machine Learning: Dr. Motaz El-Saban
No ratings yet
AI321: Theoretical Foundations of Machine Learning: Dr. Motaz El-Saban
44 pages
First Cut Draft LS1.1
No ratings yet
First Cut Draft LS1.1
12 pages
Introduction To Machine Learning
100% (1)
Introduction To Machine Learning
11 pages
ML Lecture#1
No ratings yet
ML Lecture#1
52 pages
Machine Learning: Professional CORE (CET3006B) T. Y. B.Tech CSE
No ratings yet
Machine Learning: Professional CORE (CET3006B) T. Y. B.Tech CSE
106 pages
ENG6500 1 IntroductionToMLDL Part1
No ratings yet
ENG6500 1 IntroductionToMLDL Part1
63 pages
I MSC DS ML Notes
No ratings yet
I MSC DS ML Notes
109 pages
Chapter 1
No ratings yet
Chapter 1
62 pages
Lecture 1 - Introduction
No ratings yet
Lecture 1 - Introduction
49 pages
Lecture 01 - Introduction To AML-Jan24
No ratings yet
Lecture 01 - Introduction To AML-Jan24
66 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
01 Introduction
No ratings yet
01 Introduction
43 pages
CE469 - Introduction To Machine Learning: Lecturer Contact
No ratings yet
CE469 - Introduction To Machine Learning: Lecturer Contact
33 pages
Machine Learning: Introducing
No ratings yet
Machine Learning: Introducing
18 pages
1 Introduction
No ratings yet
1 Introduction
24 pages
Module 1
No ratings yet
Module 1
34 pages
Introduction
No ratings yet
Introduction
18 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Chapter 1 - Introduction
No ratings yet
Chapter 1 - Introduction
28 pages
Lec 1,2
No ratings yet
Lec 1,2
69 pages
Lecture1 PDF
No ratings yet
Lecture1 PDF
37 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
16 pages
Machine Learning With Python Programming: - Presentation by Uplatz - Contact Us: - Email: - Phone
No ratings yet
Machine Learning With Python Programming: - Presentation by Uplatz - Contact Us: - Email: - Phone
22 pages
Machine Learning and Soft Computing: CSCC53 Mca V Sem 2020
No ratings yet
Machine Learning and Soft Computing: CSCC53 Mca V Sem 2020
33 pages
Introduction To AI and ML - Day 1: Gururajan Narasimhan Erode
No ratings yet
Introduction To AI and ML - Day 1: Gururajan Narasimhan Erode
39 pages
De Florio PHD Thesis
No ratings yet
De Florio PHD Thesis
142 pages
L21 Intro ML
No ratings yet
L21 Intro ML
30 pages
Big Data - SRM University PDF
No ratings yet
Big Data - SRM University PDF
29 pages
Computer Science & Engineering: Apex Institute of Technology
No ratings yet
Computer Science & Engineering: Apex Institute of Technology
13 pages
ML Practical File
100% (2)
ML Practical File
43 pages
Parkinsons Disease Pase 1
No ratings yet
Parkinsons Disease Pase 1
17 pages
Chatbot For Mental Well-Being
100% (1)
Chatbot For Mental Well-Being
5 pages
Dissertation Means in Bengali
100% (2)
Dissertation Means in Bengali
7 pages
Campagnucci, F. (2025) - Artificial Intelligence For Participation Brazil. Policy Brief
No ratings yet
Campagnucci, F. (2025) - Artificial Intelligence For Participation Brazil. Policy Brief
26 pages
Practical Aspects of Deep Learning PI
No ratings yet
Practical Aspects of Deep Learning PI
46 pages
2023 Scopus Enhanced Road Damage Detection
No ratings yet
2023 Scopus Enhanced Road Damage Detection
11 pages
CSC 422 522 001
No ratings yet
CSC 422 522 001
8 pages
AI Question Bank KCS - 071
No ratings yet
AI Question Bank KCS - 071
86 pages
Awh 23163strategic Redo
No ratings yet
Awh 23163strategic Redo
23 pages
NOTICE - 2026 Batch Kapture CX B.Tech (CSE - AIML & AIML) Register by 4
No ratings yet
NOTICE - 2026 Batch Kapture CX B.Tech (CSE - AIML & AIML) Register by 4
2 pages
Ker As Tutorial
No ratings yet
Ker As Tutorial
33 pages
A Deep Learning-Based Experiment On Forest Wildfire Detection in Machine Vision Course
No ratings yet
A Deep Learning-Based Experiment On Forest Wildfire Detection in Machine Vision Course
11 pages
JD - MTS (Data Science)
No ratings yet
JD - MTS (Data Science)
2 pages
Indoor Plants
No ratings yet
Indoor Plants
8 pages
From Pulse To Prescription: Exploring The Rise of AI in Medicine and Its Implications
No ratings yet
From Pulse To Prescription: Exploring The Rise of AI in Medicine and Its Implications
18 pages
A Deep Neuro-Fuzzy Network For Image Classification
No ratings yet
A Deep Neuro-Fuzzy Network For Image Classification
10 pages
Interdisciplinary Project Using Federated Learning For Synthetic Data Generation in The Medical Domain Iva Pezo
No ratings yet
Interdisciplinary Project Using Federated Learning For Synthetic Data Generation in The Medical Domain Iva Pezo
11 pages
Predicting Gold Prices: Megan Potoski
No ratings yet
Predicting Gold Prices: Megan Potoski
5 pages
COMP3010 Machine Learning Trimester 1 2025 Dubai Intern'l Academic City INT
No ratings yet
COMP3010 Machine Learning Trimester 1 2025 Dubai Intern'l Academic City INT
13 pages
DSA210 2025spring Syllabus-3
No ratings yet
DSA210 2025spring Syllabus-3
4 pages
Efficient Machine Learning On Edge Computing Through Data Compression Techniques
No ratings yet
Efficient Machine Learning On Edge Computing Through Data Compression Techniques
10 pages
AIML Projectsynopsis Format 2024-25
No ratings yet
AIML Projectsynopsis Format 2024-25
4 pages
Ai Paper 5
No ratings yet
Ai Paper 5
8 pages
Ay 2023 - 24
No ratings yet
Ay 2023 - 24
5 pages
Mask CTC: Non-Autoregressive End-to-End ASR With CTC and Mask Predict
No ratings yet
Mask CTC: Non-Autoregressive End-to-End ASR With CTC and Mask Predict
6 pages
Beyond Silicon
From Everand
Beyond Silicon
Piyush yadav
5/5 (1)
Machine Learning: Adaptive Behaviour Through Experience: Thinking Machines
From Everand
Machine Learning: Adaptive Behaviour Through Experience: Thinking Machines
alasdair gilchrist
4.5/5 (5)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Chapter 1 - Machine Learning Fundamentals

Uploaded by

Chapter 1 - Machine Learning Fundamentals

Uploaded by

Fundamentals of Machine

Target Group: G3SEng

❖ Essentially, it allows computers to:

Machine Learning is the study of algorithms that:

T: Classify emails as legitimate or spam

P: Percentage of emails labeled correctly

E: Repository of emails, some with human-specified labels

• Tasks requiring customization • Tasks involving big data

•Tasks for which we don’t •Tasks for which it is challenging

Gives the computers "the ability to learn

Uses neural networks with many layers

Focused on creating content, such as text,

▪ Nevada made it legal for autonomous

Penn’s Autonomous Car à

ML used to predict of phone states from the sound spectrogram

Deep learning has state-of-the-art results

Word Error Rate % 16.0 12.8 11.4 10.9 11.0 11.1

Baseline GMM performance = 15.4%

Figure: A standard Machine Learning pipeline FML by Melaku M

❖Given (x1, y1), (x2, y2), ..., (xn, yn)

– Learn a function f(x) to predict y given x

• Classification requires a model (a

Training Process Deployment

❖ Output hidden structure behind the x’s

Image credit: NASA/JPL-Caltech/E. Churchwell (Univ. of Wisconsin, Madison)

Customer Segmentation Astronomical data analysis 33 FML by Melaku M

❖This approach combines aspects of supervised and unsupervised learning.

1- Insufficient Quantity of Training Data :

•Machine Learning takes a lot of data for most Machine Learning

2) Non-representative Training Data:

•In order to generalize well, it is crucial that your training data be

•Your system will only be capable of learning if the training data

Other ML challenges include:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.