0% found this document useful (0 votes)

13 views

01 Intro

This document provides an introduction to deep learning and artificial intelligence. It discusses how deep learning aims to learn from experience and understand the world in terms of hierarchies of concepts built upon each other. Previous approaches to AI like knowledge bases and machine learning focused on formal rules or extracting patterns from raw data, but deep learning learns representations of the data. The performance of machine learning depends greatly on the representation, and deep learning aims to learn representations from data as well. The document provides a brief history of artificial intelligence and an overview of the organization of the book.

Uploaded by

Niranjan Pandey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

01 Intro

Uploaded by

Niranjan Pandey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 45

Introduction

Lecture slides for Chapter 1 of Deep Learning

www.deeplearningbook.org
Ian Goodfellow

Adapted by: m.n. for CMPS 392

Introduction
• Inventors have long dreamed of creating machines that think

• Today, artificial intelligence(AI) is a thriving field with many

practical applications and active research topics.

• The field rapidly tackled and solved problems that are intellectually
difficult for human beings but relatively straightforward for
computers
q Mathematical rules

• The true challenge to artificial intelligence proved to be solving the

tasks that are easy for people to perform but hard for people to
describe formally
q Recognizing spoken words or faces in images.

(Goodfellow 2016)
Deep learning
• Learn from experience
• understand the world in terms of a hierarchy of
concepts
q each concept defined in terms of its relation to
simpler concepts.
• If we draw a graph showing how these concepts are
built on top of each other, the graph is deep, with
many layers.
q For this reason, we call this approach to AI deep
learning.

(Goodfellow 2016)
Computer vs. Human
• IBM’s Deep Blue chess-playing system
defeated world champion Garry Kasparov in
1997
q Chess is of course a very simple world!

• A person’s everyday life requires an

immense amount of knowledge about the
world.
q Much of this knowledge is subjective and
intuitive, and therefore difficult to
articulate in a formal way.

• Computers need to capture this same

knowledge in order to behave in an
intelligent way.

(Goodfellow 2016)
Previous AI appraches
• Knowledge base: A computer can reason about
statements in a formal language automatically using
logical inference rules.
• Cyc failed to understand a story about a person
named Fred shaving in the morning
q people do not have electrical parts,
q but Fred was holding an electric razor
q “FredWhileShaving” contained electrical parts.
q Cyc asked whether Fred was still a person while
he was shaving!

(Goodfellow 2016)
Machine learning
• Extracting patterns from raw data.
q A simple machine learning algorithm called
logistic regression can determine whether to
recommend cesarean delivery
q A simple machine learning algorithm called naive
Bayes can separate legitimate e-mail from spam
e-mail.

• The performance of these simple machine learning

algorithms depends heavily on the representation of
the data they are given.
(Goodfellow 2016)
Features
• Each piece of information included in the
representation of the patient is known as a feature.

• Logistic regression learns how each of these

features of the patient correlates with various
outcomes

• the choice of representation has an enormous effect

on the performance of machine learning algorithms

(Goodfellow 2016)
Representations Matter

Figure 1.1 (Goodfellow 2016)

Representation Learning
• However, for many tasks, it is difficult to know what
features should be extracted.
q For example, suppose that we would like to write
a program to detect cars in photographs.
q We know that cars have wheels,
q But how to describe exactly what a wheel looks
like in terms of pixel values?
• We need to discover not only the mapping from
representation to output
q but also the representation itself.

(Goodfellow 2016)
Autoencoders
• An autoencoder is the combination of an encoder
function that converts the input data into a different
representation,

• and a decoder function that converts the new

representation back into the original format.

• Autoencoders are trained to preserve as much

information

• but are also trained to make the new representation

have various nice properties.

(Goodfellow 2016)
Factors of variation
• When analyzing an image of a car, the factors of
variation include the position of the car, its color,
and the angle and brightness of the sun.

• When analyzing a speech recording, the factors of

variation include the speaker’s age, their sex, their
accent and the words that they are speaking

• How to disentangle the factors of variation and

discard the ones that we do not care about?

(Goodfellow 2016)
Depth: Repeated Composition

Figure 1.2 (Goodfellow 2016)

Multilayer perceptron (MLP)
• A multilayer perceptron is just a mathematical
function mapping some set of input values to output
values.

• The function is formed by composing many simpler

functions.

• We can think of each application of a different

mathematical function as providing a new
representation of the input.

(Goodfellow 2016)
Multi-step computer program
• Another perspective is that depth allows the
computer to learn a multi-step computer program.
• Each layer of the representation can be thought of
as the state of the computer’s memory after
executing another set of instructions in parallel
• Networks with greater depth can execute more
instructions in sequence.
• Sequential instructions offer great power because
later instructions can refer back to the results of
earlier instructions.

(Goodfellow 2016)
Computational Graphs
Logistic regression: p y = 1 x ; 𝜽) =σ(𝜽T x).

Figure 1.3 (Goodfellow 2016)

Notion of depth
• Depth is the length of the longest path from input
to output but depends on the definition of what
constitutes a possible computational step.
q If we use addition, multiplication and logistic
sigmoids as the elements of our computer
language, then this model has depth three.
q If we view logistic regression as an element itself,
then this model has depth one.

(Goodfellow 2016)
Deep learning vs. machine
learning
• Deep learning is:
q An approach to AI
q A type of machine learning
q a technique that allows computer systems to
improve with experience and data
q can safely be regarded as the study of models
that either involve a greater amount of
composition of learned functions or learned
concepts than traditional machine learning does.

(Goodfellow 2016)
Machine Learning and AI
Machine learning can
operate in
complicated, real-
world environments

Deep learning is a
particular kind of
machine learning
that achieves great
power and flexibility

Figure 1.4 (Goodfellow 2016)

Learning Multiple Components
Figure 1.5

(Goodfellow 2016)
Organization of the Book
Figure 1.6

(Goodfellow 2016)
Who should take this
course?
• University students (undergraduate or graduate)
q If you want to begin a career in deep learning and
artificial intelligence research
q If you want to work as software engineer and want to
rapidly acquire machine learning background and
begin using deep learning in your product or platform.

• Applications:
q computer vision, speech and audio processing,
natural language processing, robotics, bioinformatics
and chemistry, video games, search engines, online
advertising and finance.

(Goodfellow 2016)
Prerequisities
• We do assume that all readers come from a
computer science background.

• We assume familiarity with

q programming,
q a basic understanding of computational
performance issues, complexity theory,
q introductory level calculus
q and some of the terminology of graph theory.

(Goodfellow 2016)
Deep learning history
• DL has had a long and rich history, but has gone by
many names reflecting different philosophical
viewpoints, and has waxed and waned in popularity.
• DL has become more useful as the amount of
available training data has increased.
• DL models have grown in size over time as
computer infrastructure (both hardware and
software).
• DL has solved increasingly complicated applications
with increasing accuracy over time.

(Goodfellow 2016)
History
• Three waves of development of deep learning:
q Cybernetics in the 1940s–1960s
q Connectionism in the 1980s–1990s
q Deep learning starting 2006

• Artificial neural networks (ANNs): engineered systems

inspired by the biological brain
q the brain provides a proof by example that intelligent
behavior is possible
q ANNs can help understanding the brain and the principles
that underlie human intelligence
• Current deep learning frameworks are not necessarily
neurally inspired

(Goodfellow 2016)
Historical Waves

Figure 1.7 (Goodfellow 2016)

Perceptron (Rosenblatt,
1958, 1962)
• These models were designed to take a set of n input
values x1, . . . , xn and associate them with an output y.

• These models would learn a set of weights w1, … , wn

and compute their output

𝑓 𝑥, 𝑤 = 𝑥! 𝑤! + ⋯ + 𝑥" 𝑤"

Class is sign (f(x,w))

• The adaptive linear element (ADALINE) simply returned

the value of f (x) itself to predict a real number (Widrow
and Hoff, 1960)

(Goodfellow 2016)
Linear models
(e.g. Perceptron, Adaline)
• The training algorithm used to adapt the weights of
the ADALINE was a special case of an algorithm
called stochastic gradient descent.
• Linear models have many limitations. Most
famously, they cannot learn the XOR function,
where 𝑓 ([0, 1], 𝑤) = 1 and 𝑓([1, 0], 𝑤) =
1 but 𝑓 ([1, 1], 𝑤) = 0 and 𝑓 ([0, 0], 𝑤) = 0.
• Critics who observed these flaws in linear models
caused a backlash against biologically inspired
learning in general (Minsky and Papert, 1969).

(Goodfellow 2016)
Neuroscience
• Neuroscience has given us a reason to hope that a
single deep learning algorithm can solve many
different tasks.

• Neuroscientists have found that ferrets can learn to

“see” with the auditory processing region of their
brain if their brains are rewired to send visual
signals to that area (Von Melchner et al., 2000).

• Today, we simply do not have enough information

about the brain to use it as a guide.

(Goodfellow 2016)
Connectionism
• Distributed representation (Hinton et al., 1986)
q Each input to a system should be represented by
many features,
q and each feature should be involved in the
representation of many possible inputs.
q Example: shape vs. color

• Backpropagation: (Rumelhart et al., 1986; LeCun,

1987).
q currently the dominant approach to training deep
models.

(Goodfellow 2016)
Second winter
• Ambitious claims while seeking investments.

• other fields of machine learning made advances.

Kernel machines (Boser et al., 1992; Cortes and
Vapnik, 1995; Schölkopf et al., 1999) and graphical
models (Jordan, 1998)
q These two factors led to a decline in the
popularity of neural networks that lasted until
2006-2007.

(Goodfellow 2016)
Third wave
• Researchers showed that they were able to train deeper
neural networks than had been possible before, and
focused attention on the theoretical importance of depth

• We have the computational resources to run much

larger models today.

• As of 2016, a rough rule of thumb is that a supervised

deep learning algorithm will generally achieve
acceptable performance with around 5,000 labeled
examples per category, and will match or exceed
human performance when trained with a dataset
containing at least 10 million labeled examples.

(Goodfellow 2016)
Historical Trends: Growing
Datasets

Figure 1.8 (Goodfellow 2016)

Historical Trends: Increasing
model sizes
• faster CPUs,

• the advent of general purpose GPUs,

• faster network connectivity,

• better software infrastructure for distributed

computing.

(Goodfellow 2016)
Connections per Neuron

Figure 1.10 (Goodfellow 2016)

Number of Neurons

Figure 1.11 (Goodfellow 2016)

The MNIST Dataset
the
drosophila
of machine
learning

Figure 1.9 (Goodfellow 2016)

Increasing Accuracy, and
Real-World Impact
• A dramatic moment in the meteoric rise of deep
learning came when a convolutional network won
ILSVRC challenge for the first time and by a wide
margin, bringing down the state-of-the-art top-5
error rate from 26.1% to 15.3% (Krizhevsky et al.,
2012),
q Since then, these competitions are consistently
won by deep convolutional nets
• The introduction of deep learning to speech
recognition resulted in a sudden drop of error rates,
with some error rates cut in half.

(Goodfellow 2016)
Solving Object Recognition

Figure 1.12 (Goodfellow 2016)

Increasing complexity
• Deep networks have also had spectacular
successes for pedestrian detection and image
segmentation
q and yielded superhuman performance in traffic
sign classification

• neural networks could learn to output an entire

sequence of characters transcribed from an image,
rather than just identifying a single object.

(Goodfellow 2016)
Other applications
• Recurrent neural networks, such as the LSTM sequence
model are now used to model relationships between
sequences and other sequences rather than just fixed
inputs.
• In the context of reinforcement learning, an autonomous
agent must learn to perform a task by trial and error,
without any guidance from the human operator.
q DeepMind demonstrated that a deep reinforcement
learning system is capable of learning to play Atari
video games, reaching human-level performance
q Deep learning has also significantly improved the
performance of reinforcement learning for robotics

(Goodfellow 2016)
Companies and tools
• Google, Microsoft, Facebook, IBM, Baidu,
Apple, Adobe, Netflix, NVIDIA and NEC.

• Competition and Convergence of Deep

Learning Libraries:
q TensorFlow 2.0
q PyTorch 1.3

Python 2 support ended on Jan 1, 2020.

>>> print “Goodbye World”
(Goodfellow 2016)
Turing award
• Yann LeCun

• Geoffrey Hinton

• Yoshua Bengio

Turing Award given for:

“The conceptual and engineering

breakthroughs that have made
deep neural networks a critical
component of computing.”
(Goodfellow 2016)
Online courses
• Fast.ai: Practical Deep Learning for Coders
q Jeremy Howard et al.

• Stanford CS231n: Convolutional Neural Networks for Visual

Recognition

• Stanford CS224n: Natural Language Processing with Deep

Learning
• Deeplearning.ai (Coursera): Deep Learning
q Andrew Ng

• Reinforcement Learning
q David Silver: Introduction to Reinforcement Learning
q OpenAI: Spinning Up in Deep RL

(Goodfellow 2016)
Summary
• Deep learning is an approach to machine learning that has
drawn heavily on our knowledge of the human brain,
statistics and applied math as it developed over the past
several decades.
• In recent years, it has seen tremendous growth in its
popularity and usefulness, due in large part to more
q powerful computers,
q larger datasets and
q techniques to train deeper networks.

• The years ahead are full of challenges and opportunities to

improve deep learning even further and bring it to new
frontiers.

(Goodfellow 2016)
Watch
• https://www.youtube.com/watch?v=vi7lACKOUao

• https://www.youtube.com/watch?v=0VH1Lim8gL8

(Goodfellow 2016)

Trackpad Pro Ver. 5.0 Class 6: WINDOWS 11 & MS OFFICE 2021
From Everand
Trackpad Pro Ver. 5.0 Class 6: WINDOWS 11 & MS OFFICE 2021
Nidhi Arora
No ratings yet
Computational Beauty of Nature
0% (2)
Computational Beauty of Nature
7 pages
Unit - 1 Deep Learning 3-2
No ratings yet
Unit - 1 Deep Learning 3-2
15 pages
Unit I - Fundamentals of DL
No ratings yet
Unit I - Fundamentals of DL
41 pages
Fundamental_Deep learning
No ratings yet
Fundamental_Deep learning
69 pages
DEEP LEARNING
No ratings yet
DEEP LEARNING
22 pages
ITR Roll No.20
No ratings yet
ITR Roll No.20
3 pages
Module1_ Deep Learning
No ratings yet
Module1_ Deep Learning
26 pages
Deep Learning Midsem Merged Previous Batch
No ratings yet
Deep Learning Midsem Merged Previous Batch
423 pages
Unit - 1 Deep Learning Techniques
No ratings yet
Unit - 1 Deep Learning Techniques
18 pages
JNTUK R20 B.Tech CSE 4-1 Deep Learning Techniques Unit 1 Notes
No ratings yet
JNTUK R20 B.Tech CSE 4-1 Deep Learning Techniques Unit 1 Notes
15 pages
Jntuk r20 Unit-I Deep Learning Techniques (WWW - Jntumaterials.co - In)
No ratings yet
Jntuk r20 Unit-I Deep Learning Techniques (WWW - Jntumaterials.co - In)
23 pages
DL Unit 1
No ratings yet
DL Unit 1
200 pages
Deep Learning Introduction
No ratings yet
Deep Learning Introduction
5 pages
DNN Merged Sugata
No ratings yet
DNN Merged Sugata
243 pages
Introduction to Deep Learning
No ratings yet
Introduction to Deep Learning
37 pages
Lec 01 Introduction
No ratings yet
Lec 01 Introduction
98 pages
Deep Learning Algorithms and Architectures
No ratings yet
Deep Learning Algorithms and Architectures
26 pages
AD3501-DL-UNIT 1 NOTES
No ratings yet
AD3501-DL-UNIT 1 NOTES
43 pages
Deep Learning Unit-II
No ratings yet
Deep Learning Unit-II
19 pages
Neural Networks1
No ratings yet
Neural Networks1
164 pages
Deep Learning
No ratings yet
Deep Learning
100 pages
Lecture 12 - Deep Learning
No ratings yet
Lecture 12 - Deep Learning
25 pages
AD3501-DL-Unit 1 Notes
No ratings yet
AD3501-DL-Unit 1 Notes
43 pages
Lec 1
No ratings yet
Lec 1
30 pages
Lecun2015
No ratings yet
Lecun2015
9 pages
AA12_Deep_Learning_2024 (1)
No ratings yet
AA12_Deep_Learning_2024 (1)
30 pages
unit-3 NNDL
No ratings yet
unit-3 NNDL
22 pages
chapter 1
No ratings yet
chapter 1
6 pages
DL Module 1 - CS-1 Fundamentals of Neural Network
No ratings yet
DL Module 1 - CS-1 Fundamentals of Neural Network
81 pages
Lect 4-Introduction to Deep Learning
No ratings yet
Lect 4-Introduction to Deep Learning
33 pages
BMG5109 Winter 2025 Data Science for Enginners Lecture Note 4 ML and DL ANN I
No ratings yet
BMG5109 Winter 2025 Data Science for Enginners Lecture Note 4 ML and DL ANN I
63 pages
Session 2 ANN 2024
No ratings yet
Session 2 ANN 2024
29 pages
Dl All Units Materials
No ratings yet
Dl All Units Materials
138 pages
Unit -1 Deep Learning
No ratings yet
Unit -1 Deep Learning
26 pages
Lecture 1,2,3 - Module 1 - ML Vs DL
No ratings yet
Lecture 1,2,3 - Module 1 - ML Vs DL
26 pages
MVDAFT Final
No ratings yet
MVDAFT Final
30 pages
Deep Learning
100% (3)
Deep Learning
32 pages
Insidedeeplearning Preview
No ratings yet
Insidedeeplearning Preview
5 pages
Unit-3 Notes
No ratings yet
Unit-3 Notes
16 pages
Lecture Slides For Chapter 1 of Deep Learning Ian Goodfellow 2016-09-26
No ratings yet
Lecture Slides For Chapter 1 of Deep Learning Ian Goodfellow 2016-09-26
13 pages
DL Sessional 1
No ratings yet
DL Sessional 1
301 pages
Deep Learning
100% (1)
Deep Learning
21 pages
Deep Learning Unit1
No ratings yet
Deep Learning Unit1
126 pages
Unit-3
No ratings yet
Unit-3
16 pages
Unit I
No ratings yet
Unit I
10 pages
DNN - 1 - M1 - Fundamentals of Neural Network
No ratings yet
DNN - 1 - M1 - Fundamentals of Neural Network
95 pages
Deep Learnig
No ratings yet
Deep Learnig
16 pages
unit-2
No ratings yet
unit-2
19 pages
Artificial neural network course slides
No ratings yet
Artificial neural network course slides
61 pages
Deep Learning Introduction Class (1)
No ratings yet
Deep Learning Introduction Class (1)
46 pages
Nature14539 PDF
No ratings yet
Nature14539 PDF
9 pages
ML_FINAL Reference (1)
No ratings yet
ML_FINAL Reference (1)
89 pages
Machine Learning and Deep Neural Networks
No ratings yet
Machine Learning and Deep Neural Networks
8 pages
Lecture_1
No ratings yet
Lecture_1
10 pages
2019 6S191 L6 PDF
No ratings yet
2019 6S191 L6 PDF
61 pages
Unit 1a - Fundamentals of Deep Learning
No ratings yet
Unit 1a - Fundamentals of Deep Learning
54 pages
Lecture 1a - Introduction
No ratings yet
Lecture 1a - Introduction
38 pages
Deep Learning With Python Illustrated Guide For Beginners & Intermediates: The Future Is Here!: The Future Is Here!, #2
From Everand
Deep Learning With Python Illustrated Guide For Beginners & Intermediates: The Future Is Here!: The Future Is Here!, #2
William Sullivan
1/5 (1)
Math for Deep Learning: What You Need to Know to Understand Neural Networks
From Everand
Math for Deep Learning: What You Need to Know to Understand Neural Networks
Ronald T. Kneusel
No ratings yet
Deep Learning
From Everand
Deep Learning
Manish Soni
No ratings yet
SVM & CNN
No ratings yet
SVM & CNN
62 pages
Unit 6 Application of AI
No ratings yet
Unit 6 Application of AI
91 pages
Ritesh Mangla ML PracticalFile
No ratings yet
Ritesh Mangla ML PracticalFile
55 pages
NNFLC Question
No ratings yet
NNFLC Question
1 page
Implementation of Neural Network Back Propagation Training Algorithm On FPGA
No ratings yet
Implementation of Neural Network Back Propagation Training Algorithm On FPGA
19 pages
A Survey of Deep Learning Techniques Applied To Trading: Limit Order Book Modeling
No ratings yet
A Survey of Deep Learning Techniques Applied To Trading: Limit Order Book Modeling
10 pages
3D Human Pose Estimation in Video With Temporal Convolutions and Semi-Supervised Training
No ratings yet
3D Human Pose Estimation in Video With Temporal Convolutions and Semi-Supervised Training
13 pages
(Advances in Intelligent Systems and Computing 836) Oleg Chertov, Tymofiy Mylovanov, Yuriy Kondratenko, Janusz Kacprzyk, Vladik Kreinovich, Vadim Stefanuk-Recent Developments in Data Science and Intel.pdf
No ratings yet
(Advances in Intelligent Systems and Computing 836) Oleg Chertov, Tymofiy Mylovanov, Yuriy Kondratenko, Janusz Kacprzyk, Vladik Kreinovich, Vadim Stefanuk-Recent Developments in Data Science and Intel.pdf
391 pages
Module 3-1 PDF
No ratings yet
Module 3-1 PDF
43 pages
Leveraging Web Scraping To Develop A Fake News Detection Model For Philippine News Using RNN-LSTM
No ratings yet
Leveraging Web Scraping To Develop A Fake News Detection Model For Philippine News Using RNN-LSTM
7 pages
BL-COMP-6103-LEC-1933T CURRENT Trends and Issues
No ratings yet
BL-COMP-6103-LEC-1933T CURRENT Trends and Issues
26 pages
Generative AI-Driven Storytelling: A New Era For Marketing: Marko Vidrih
No ratings yet
Generative AI-Driven Storytelling: A New Era For Marketing: Marko Vidrih
17 pages
Neural Networks:: Basics Using MATLAB
No ratings yet
Neural Networks:: Basics Using MATLAB
54 pages
Deep Learning - AD3501 - Important Question and 2 Marks With Answers - Unit 1
No ratings yet
Deep Learning - AD3501 - Important Question and 2 Marks With Answers - Unit 1
13 pages
Computer Science
No ratings yet
Computer Science
54 pages
Ai in Communication Electronics
No ratings yet
Ai in Communication Electronics
16 pages
Detection of Turkish Fake News From Tweets With BERT Models
No ratings yet
Detection of Turkish Fake News From Tweets With BERT Models
14 pages
Anomaly Detection With CNN Autoencoders For Cloud-Based AI Systems
No ratings yet
Anomaly Detection With CNN Autoencoders For Cloud-Based AI Systems
28 pages
Minor
No ratings yet
Minor
48 pages
Computer - Science - and - Engineering - 2023-NITW Syllabus
No ratings yet
Computer - Science - and - Engineering - 2023-NITW Syllabus
69 pages
On The Challenges of Learning With Inference Networks On Sparse, High-Dimensional Data
No ratings yet
On The Challenges of Learning With Inference Networks On Sparse, High-Dimensional Data
14 pages
Download Full The Industrial Internet of Things (IIoT): Intelligent Analytics for Predictive Maintenance 1st Edition R. Anandan PDF All Chapters
100% (2)
Download Full The Industrial Internet of Things (IIoT): Intelligent Analytics for Predictive Maintenance 1st Edition R. Anandan PDF All Chapters
50 pages
FPGA Based Implementation of Binarized Neural Network For Sign Language Application
No ratings yet
FPGA Based Implementation of Binarized Neural Network For Sign Language Application
4 pages
Build Neural Network With MS Excel Sample
No ratings yet
Build Neural Network With MS Excel Sample
104 pages
Education Research International-Musso Et Al.
No ratings yet
Education Research International-Musso Et Al.
13 pages
Application of Neural Network Models For Mathematical Programming Problems - A State of The Art Review
No ratings yet
Application of Neural Network Models For Mathematical Programming Problems - A State of The Art Review
12 pages
Reinforcement Learning Toolbox™ Release Notes
No ratings yet
Reinforcement Learning Toolbox™ Release Notes
48 pages
Cheatsheet1
No ratings yet
Cheatsheet1
2 pages
mcq_dlei
No ratings yet
mcq_dlei
16 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

01 Intro

Uploaded by

01 Intro

Uploaded by

Introduction

Lecture slides for Chapter 1 of Deep Learning

Adapted by: m.n. for CMPS 392

• Today, artificial intelligence(AI) is a thriving field with many

• The true challenge to artificial intelligence proved to be solving the

• A person’s everyday life requires an

• Computers need to capture this same

• The performance of these simple machine learning

• Logistic regression learns how each of these

• the choice of representation has an enormous effect

Figure 1.1 (Goodfellow 2016)

• and a decoder function that converts the new

• Autoencoders are trained to preserve as much

• but are also trained to make the new representation

• When analyzing a speech recording, the factors of

• How to disentangle the factors of variation and

Figure 1.2 (Goodfellow 2016)

• The function is formed by composing many simpler

• We can think of each application of a different

Figure 1.3 (Goodfellow 2016)

Figure 1.4 (Goodfellow 2016)

• We assume familiarity with

• Artificial neural networks (ANNs): engineered systems

Figure 1.7 (Goodfellow 2016)

• These models would learn a set of weights w1, … , wn

Class is sign (f(x,w))

• The adaptive linear element (ADALINE) simply returned

• Neuroscientists have found that ferrets can learn to

• Today, we simply do not have enough information

• Backpropagation: (Rumelhart et al., 1986; LeCun,

• other fields of machine learning made advances.

• We have the computational resources to run much

• As of 2016, a rough rule of thumb is that a supervised

Figure 1.8 (Goodfellow 2016)

• the advent of general purpose GPUs,

• faster network connectivity,

• better software infrastructure for distributed

Figure 1.10 (Goodfellow 2016)

Figure 1.11 (Goodfellow 2016)

Figure 1.9 (Goodfellow 2016)

Figure 1.12 (Goodfellow 2016)

• neural networks could learn to output an entire

• Competition and Convergence of Deep

Python 2 support ended on Jan 1, 2020.

Turing Award given for:

“The conceptual and engineering

• Stanford CS231n: Convolutional Neural Networks for Visual

• Stanford CS224n: Natural Language Processing with Deep

• The years ahead are full of challenges and opportunities to

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.