EE2211 Introduction To Machine Learning: Semester 1 2020/2021
EE2211 Introduction To Machine Learning: Semester 1 2020/2021
Learning
Lecture 1
Semester 1
2020/2021
Li Haizhou (haizhou.li@nus.edu.sg)
Acknowledgement:
EE2211 development team
(Thomas, Kar-Ann, Chen Khong, Helen, Robby and Haizhou)
• Lectures, tutorials, quiz (mid-term) and final exam are conducted online
• Important dates
– A1 released in Week 4 on on 31 Aug (Monday) and submitted in 3 weeks
– A2 released in Week 6 on 14 Sept (Monday) and submitted in 4 weeks
– A3 released in Week 10 on 19 Oct (Monday) and submitted in 4 weeks
– Quiz (mid-term) will be on 25 Sept (Friday)
References
• [Book1] Andriy Burkov, “The Hundred-Page Machine Learning Book”,
2019. (read first, buy later: http://themlbook.com/wiki/doku.php)
• [Book2] Andreas C. Muller and Sarah Guido, “Introduction to Machine
Learning with Python: A Guide for Data Scientists”, O’Reilly Media, Inc.,
2017.
• [Book3] Jeff Leek, “The Elements of Data Analytic Style: A guide for
people who want to analyze data”, Lean Publishing, 2015.
5
© Copyright EE, NUS. All Rights Reserved.
2001: A Space Odyssey
HAL listens, talks, sings, reads lips, plays chess, and solves problems !
https://www.bbc.com/news/technology-35785875
https://www.businessinsider.com/googles-alphago-made-
artifical-intelligence-history-2016-3
8
© Copyright EE, NUS. All Rights Reserved.
What is Machine Learning?
❑ Machine learning
➢ is a subfield of computer science that is concerned with
building algorithms which, to be useful, rely on a
collection of examples of some phenomenon. - Andriy
Burkov
✓These examples can come from nature, be handcrafted by
humans or generated by another algorithm.
Discrete
• Reinforcement Learning Classification Clustering
discrete
Continuous
x Classification y Dimensionality
Regression
continuous Reduction
x Regression y
Training
Apple
Orange
Model Testing
This is an orange
𝑥1
𝑇
𝐱 𝑖 = ⁞ or 𝐱 𝑖 = 𝑥1 , … , 𝑥𝑗 , … , 𝑥𝐷 , i = 1, . . . , N
𝑖
𝑥𝐷 𝑖
– Each element 𝐱 𝑖 among N is called a feature vector.
• A feature vector is a vector in which each dimension j = 1, . .
. , D contains a value that describes the example somehow.
y 1-Dimensional Case
“0” “1”
0 x
(e.g., repeated word count)
(1D view)
Decision line
(threshold)
(age)
x 2 2-Dimensional Case
Malignant (harmful)
Benign (not harmful)
x 1 (tumor size)
(price)
y
x
(size in meter square)
Training Test
𝑀 𝑀 𝑁 𝑁
𝐱𝑖 𝑖=1 𝑦𝑖 𝑖=1 𝐱𝑘 𝑘=1 𝑦𝑘 𝑘=1
Model Model
Data Known Data Predicted
Parameters to label Learned label
learn parameters
Goal: to learn the model’s Goal: to predict the label of
parameters from the given novel data 𝐱 𝑘 𝑁
𝑘=1 using the
data and labels 𝐱 𝑖 , 𝑦𝑖 𝑀
𝑖=1 learned parameters
source: SUTD
Training
I found two
types of fruits!
(age)
x2 2-Dimensional Case
Malignant (harmful)
Benign (not harmful)
x1 (tumor size)
(age)
x2 2-Dimensional Case
x1 (tumor size)
x1 (tumor size)
x1 (tumor size)
https://en.wikipedia.org/wiki/Social_network_analysis#/media/File:Kencf0618FacebookNetwork.jpg
Labelled +
Unlabelled data
Labelled data Unlabelled data
+
Typically plenty of
unlabelled data
Learning Model
action
Environment
Agent
S1 S2
reward
Type of inferences
Example
Inductive Deductive
• To reach probable conclusions. • To reach logical conclusions
• All needed information is unavailable or deterministically: all information that can
unknown, causing uncertainty in the conclusions lead to the correct conclusion is available