0% found this document useful (0 votes)

12 views28 pages

Lecture Reinforcement Learning

Uploaded by

A Rajagopal am18d301

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views28 pages

Lecture Reinforcement Learning

Uploaded by

A Rajagopal am18d301

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 28

Edge AI and Robotics Teaching Kit

Lecture 5.1
Reinforcement Learning
The Edge AI and Robotics Teaching Kit is licensed by NVIDIA and UMBC under the
Creative Commons Attribution-NonCommercial 4.0 International License.

2
Topics

• Describe concept of Reinforcement Learning

• Reinforcement Learning Algorithms and Approaches
• Deep Learning
• States, Actions, Rewards
• Lab and Example Environments

3
Learning Objectives - Reinforcement Learning

Explain concepts of Reinforcement Learning

Explain different reinforcement learning approaches
Describe DQN and how Q-Learning is leveraged
Gain hands-on experience training agents using sample environments
in Openai Gym

4
Reinforcement Learning Concepts

5
Concepts

• Environment- attributes
• Agents
• State/Actions
• Learning – policies, functions,
models
• Objective
• Rewards

6
© D . Poole and A. Mackworth 2019 Artificial Intelligence: Foundations of Computational Agents
Reinforcement Learning

Agent Environment

7
Reinforcement Learning

What should an agent do given:

• Prior knowledge – possible states, baseline, possible actions
• Observations – current state, immediate reward
• Goal – optimal set of actions that maximizes the mean cumulative discounted reward
We can train this agent approximating its environment

8
© D . Poole and A. Mackworth 2019 Artificial Intelligence: Foundations of Computational Agents
Reinforcement Learning Loop

Figure 1.2 The reinforcement learning control

loop

From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13:
9
9780135172384)
Copyright © 2020 Pearson Education, Inc. All rights reserved.
Rewards and Values

Figure 1.4 Rewards r and values V(s) for each state s in a simple grid-world
environment. The value of a state is calculated from the rewards using
Equation 1.10 with  = 0.9 while using a policy  that always takes the
shortest path to the goal state with r = +1.

From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13: 9780135172384)
10 Copyright © 2020 Pearson Education, Inc. All rights reserved.
Approaches

11
Reinforcement Learning Approaches

Figure 1.5 Deep reinforcement learning

algorithm families

From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13: 9780135172384)
Copyright © 2020 Pearson Education, Inc. All rights reserved.
12
Neural Networks Leveraged for RL

Figure 12.4 Neural network families

14
Simple Environment

Figure 3.1 Simple environment: five states, two actions per state

From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13:
9780135172384)
15 Copyright © 2020 Pearson Education, Inc. All rights reserved.
Simple Environment

Figure 3.2 Optimal Q-values for

the simple environment from
Figure 3.1,  = 0.9

From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng
(ISBN-13: 9780135172384)
16 Copyright © 2020 Pearson Education, Inc. All rights reserved.
Simple Environment - Learning

Figure 3.3 Learning the Q*(s, a) for the simple environment from Figure 3.1

From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13: 9780135172384)
17 Copyright © 2020 Pearson Education, Inc. All rights reserved.
Simple Environment – Optimal Values

Figure 3.4 Optimal Q-values for the simple environment from Figure 3.1,  = 0
(left),  = 1 (right)

From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13:
9780135172384)
18 Copyright © 2020 Pearson Education, Inc. All rights reserved.
Processing of Data

Figure 14.2 Information flow from the world to an algorithm

From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13:
19 9780135172384)
Reinforcement Learning - GPU

20
Environment

21
Gym Openai

Gym (openai.com)

For Jetson- Github Project: dusty-nv

/jetson-reinforcement: Deep reinforcement learning GPU libraries for NVIDIA Jetson TX1/TX2 with
PyTorch, OpenAI Gym, and Gazebo robotics simulator. (github.com)

Sample Notebook Tutorial using GPU: jetson-reinforcement/intro-DQN.ipynb at master · dusty-nv

/jetson-reinforcement (github.com)

With ROS and Gazebo

https://github.com/AcutronicRobotics/gym-gazebo2/blob/dashing/docker/README.md

22
OpenAI Gym - Cartpole

Figure 1.1 CartPole-v0 is a simple toy environment. The objective is to

balance a pole for 200 time steps by controlling the left-
right motion of a cart.

From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13: 9780135172384)
23 Copyright © 2020 Pearson Education, Inc. All rights reserved.
OpenAI Gym - Cartpole

(a) t = 1 (b) t = 2 (c) t (d) t = 4

=3
Figure 14.7 Four consecutive frames of the
CartPole environment

Figure B.3 The LunarLander-v2 environment. The objective is to steer and

land the lander between the flags using minimal fuel,
without crashing.

From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13: 9780135172384)
25 Copyright © 2020 Pearson Education, Inc. All rights reserved.
OpenAI Gym - Environments

CartPole Atari Breakout BipedalWalker

Figure 1.3 Three example environments with different states, actions, and
rewards. These environments are available in
OpenAI Gym.

From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13:
9780135172384)
26 Copyright © 2020 Pearson Education, Inc. All rights reserved.
Additional Information

Foundations of Deep Reinforcement Learning

https://www.pearson.com/us/higher-education/program/Graesser-Foundations-of-Deep-Reinforceme
nt-Learning-Theory-and-Practice-in-Python/PGM2027228.html

NVIDIA Technical Blog – Deep Learning in a Nutshell: Reinforcement Learning

https://developer.nvidia.com/blog/deep-learning-nutshell-reinforcement-learning/

27
Thank You
Edge AI and Robotics Teaching Kit

CMPE257 - W10C13 - Reinforcement Learning
No ratings yet
CMPE257 - W10C13 - Reinforcement Learning
161 pages
RL Introduction
No ratings yet
RL Introduction
225 pages
Siri Jagatthu English 2021
No ratings yet
Siri Jagatthu English 2021
128 pages
Tory Ime: The Story of Ramayan
100% (1)
Tory Ime: The Story of Ramayan
3 pages
Deep Reinforcement Learning: Lecture Notes
No ratings yet
Deep Reinforcement Learning: Lecture Notes
60 pages
Deep Reinforcement Learning
100% (4)
Deep Reinforcement Learning
48 pages
Deep Reinforcement Learning
100% (1)
Deep Reinforcement Learning
410 pages
NPC v. Heirs of Casionan
100% (2)
NPC v. Heirs of Casionan
2 pages
Falcis V CIvil Registrar Case Digest
No ratings yet
Falcis V CIvil Registrar Case Digest
2 pages
BW Fullness of Christ SG INT Final
No ratings yet
BW Fullness of Christ SG INT Final
128 pages
Drow Elves R.C.C. Information
100% (1)
Drow Elves R.C.C. Information
25 pages
Functional Requirement Document
No ratings yet
Functional Requirement Document
12 pages
Case No 114 Philippine Tobacco Flu Curing and Redrying Corp Vs NLRC Dec 10, 1998
No ratings yet
Case No 114 Philippine Tobacco Flu Curing and Redrying Corp Vs NLRC Dec 10, 1998
4 pages
Questions and Solutions
No ratings yet
Questions and Solutions
47 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
406 pages
DRL Final Notes
No ratings yet
DRL Final Notes
281 pages
Daftar Pustaka
100% (3)
Daftar Pustaka
18 pages
Lesson Plan 1-Triumpfh of Surgery
No ratings yet
Lesson Plan 1-Triumpfh of Surgery
9 pages
(Addison-Wesley Data & Analytics Series) Laura Graesser - Wah Loon Keng - Foundations of Deep Reinforcement Learning - Theory and Practice in Python-Addison-Wesley Professional (2019) PDF
100% (1)
(Addison-Wesley Data & Analytics Series) Laura Graesser - Wah Loon Keng - Foundations of Deep Reinforcement Learning - Theory and Practice in Python-Addison-Wesley Professional (2019) PDF
656 pages
Beige Scrapbook Geography Presentation
No ratings yet
Beige Scrapbook Geography Presentation
60 pages
Discrete Maths
No ratings yet
Discrete Maths
6 pages
Responsible Use of Media and Information
No ratings yet
Responsible Use of Media and Information
14 pages
Cystotomy 37 L
No ratings yet
Cystotomy 37 L
19 pages
0500, Paper 2, Section B, Narrative Writing: by Ms Mehala Lesson 1 and 2 Monday 4/5/2020
100% (3)
0500, Paper 2, Section B, Narrative Writing: by Ms Mehala Lesson 1 and 2 Monday 4/5/2020
12 pages
VSIM Clinical Worksheet 07.16.2020
No ratings yet
VSIM Clinical Worksheet 07.16.2020
6 pages
The Power of The Mantram: by Eknath Easwaran
No ratings yet
The Power of The Mantram: by Eknath Easwaran
16 pages
DCRG8
No ratings yet
DCRG8
16 pages
Presentation by Francois Mercer
No ratings yet
Presentation by Francois Mercer
14 pages
CIVL 4750 Numerical Solutions To Geotechnical Problems: I: TA: T V: Tuesday/ C O
No ratings yet
CIVL 4750 Numerical Solutions To Geotechnical Problems: I: TA: T V: Tuesday/ C O
3 pages
11 Effective Note Taking Strategies - PDF - Safe
No ratings yet
11 Effective Note Taking Strategies - PDF - Safe
2 pages
Lecture 1: Introduction: Reinforcement Learning With Tensorflow&Openai Gym
No ratings yet
Lecture 1: Introduction: Reinforcement Learning With Tensorflow&Openai Gym
18 pages
3.03 Who Has The Power?: Name
No ratings yet
3.03 Who Has The Power?: Name
2 pages
13-RL DRL
No ratings yet
13-RL DRL
102 pages
Reign of Ashurbanipal and Akhenaten
No ratings yet
Reign of Ashurbanipal and Akhenaten
2 pages
Classroom 1 Class Notes For Article
No ratings yet
Classroom 1 Class Notes For Article
2 pages
Reinforcement Learning (RL) : Big Data Mining
No ratings yet
Reinforcement Learning (RL) : Big Data Mining
86 pages
18 Feng Shui Secrets
100% (10)
18 Feng Shui Secrets
14 pages
Profile of Acute Lower Respiratory Tract Infection in Children Under Fourteen Years of Age at Nepal Medical College Teaching Hospital (NMCTH)
No ratings yet
Profile of Acute Lower Respiratory Tract Infection in Children Under Fourteen Years of Age at Nepal Medical College Teaching Hospital (NMCTH)
4 pages
Report ML Aat g1 Final
No ratings yet
Report ML Aat g1 Final
8 pages
Stockhammer TCP 2019
No ratings yet
Stockhammer TCP 2019
37 pages
w7 - Reinforcement Learning
No ratings yet
w7 - Reinforcement Learning
5 pages
Introductin Ibra
No ratings yet
Introductin Ibra
2 pages
History of Computers
No ratings yet
History of Computers
3 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
47 pages
RL Intro-2
No ratings yet
RL Intro-2
24 pages
An Invitation To Deep Reinforcement Learning: Bernhard Jaeger
No ratings yet
An Invitation To Deep Reinforcement Learning: Bernhard Jaeger
39 pages
Unit - 1
No ratings yet
Unit - 1
14 pages
Chapter 1
No ratings yet
Chapter 1
33 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
25 pages
Unit 4
No ratings yet
Unit 4
23 pages
Reinforcement Learning With Python
No ratings yet
Reinforcement Learning With Python
24 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
25 pages
ARTICLEONnlp
No ratings yet
ARTICLEONnlp
18 pages
RL PyTexas 2017 PDF
No ratings yet
RL PyTexas 2017 PDF
29 pages
Unit 5 ML
No ratings yet
Unit 5 ML
49 pages
Lec 23
No ratings yet
Lec 23
51 pages
Lec 4
No ratings yet
Lec 4
7 pages
A Crash Course On Reinforcement Learning
No ratings yet
A Crash Course On Reinforcement Learning
40 pages
Lec 1 Intro Course Overview
No ratings yet
Lec 1 Intro Course Overview
50 pages
6S191 MIT DeepLearning L5
No ratings yet
6S191 MIT DeepLearning L5
62 pages
Reinforcement Learning For IoT - Final
No ratings yet
Reinforcement Learning For IoT - Final
45 pages
L13 Reinforcement Learning
No ratings yet
L13 Reinforcement Learning
35 pages
cs224r L01 Intro
No ratings yet
cs224r L01 Intro
51 pages
Chapter 1 Introduction RL Report Kiran
No ratings yet
Chapter 1 Introduction RL Report Kiran
2 pages
SSRN 4768234
No ratings yet
SSRN 4768234
6 pages
Midterm Report Example3
No ratings yet
Midterm Report Example3
4 pages
Unit 5d - Deep Reinforcement Learning
No ratings yet
Unit 5d - Deep Reinforcement Learning
52 pages
Fundamentals of Reinforcement Learning
No ratings yet
Fundamentals of Reinforcement Learning
33 pages
Origins of Life Questions and Debates
No ratings yet
Origins of Life Questions and Debates
12 pages
Reinforcement Learning-1
No ratings yet
Reinforcement Learning-1
19 pages
Lecture Week12
No ratings yet
Lecture Week12
37 pages
Lec 01
No ratings yet
Lec 01
60 pages
Report On Reinforcement Learning
No ratings yet
Report On Reinforcement Learning
26 pages
An Introduction To Deep Reinforcement Learning PDF
No ratings yet
An Introduction To Deep Reinforcement Learning PDF
140 pages
Lecture Notes v1.0 687 F22
No ratings yet
Lecture Notes v1.0 687 F22
115 pages
03 04 Lessonarticle
No ratings yet
03 04 Lessonarticle
5 pages
1 Introduction To RL
No ratings yet
1 Introduction To RL
46 pages
Autonomous Car Racing in Simulation Environment Using Deep Reinforcement Learning
No ratings yet
Autonomous Car Racing in Simulation Environment Using Deep Reinforcement Learning
6 pages
Towards Adapting Reinforcement Learning Agents To New Tasks: Insights From Q-Values
No ratings yet
Towards Adapting Reinforcement Learning Agents To New Tasks: Insights From Q-Values
10 pages
Reinforcement Learning Notes ?
No ratings yet
Reinforcement Learning Notes ?
40 pages
Building Reinforcement Learning Environment
No ratings yet
Building Reinforcement Learning Environment
7 pages
Typical Examples of Cultural Differences
No ratings yet
Typical Examples of Cultural Differences
2 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
3 pages
AI-unit 3
No ratings yet
AI-unit 3
55 pages
A Beginners Guide To Deep Reinforcement Learning PDF
No ratings yet
A Beginners Guide To Deep Reinforcement Learning PDF
9 pages
Deep Reinforcement Learning Mohit Sewak
No ratings yet
Deep Reinforcement Learning Mohit Sewak
6 pages
Deep Reinforcement Learning - Guide To Deep Q-Learning
No ratings yet
Deep Reinforcement Learning - Guide To Deep Q-Learning
1 page
Lecture 1: Introduction To Reinforcement Learning: David Silver
No ratings yet
Lecture 1: Introduction To Reinforcement Learning: David Silver
46 pages
Practical Deep Reinforcement Learning with Python: Concise Implementation of Algorithms, Simplified Maths, and Effective Use of TensorFlow and PyTorch (English Edition)
From Everand
Practical Deep Reinforcement Learning with Python: Concise Implementation of Algorithms, Simplified Maths, and Effective Use of TensorFlow and PyTorch (English Edition)
Ivan Gridin
4/5 (1)
Math for Deep Learning: What You Need to Know to Understand Neural Networks
From Everand
Math for Deep Learning: What You Need to Know to Understand Neural Networks
Ronald T. Kneusel
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lecture Reinforcement Learning

Uploaded by

Lecture Reinforcement Learning

Uploaded by

Edge AI and Robotics Teaching Kit

• Describe concept of Reinforcement Learning

Explain concepts of Reinforcement Learning

What should an agent do given:

Figure 1.2 The reinforcement learning control

Figure 1.5 Deep reinforcement learning

Figure 12.4 Neural network families

Figure 3.2 Optimal Q-values for

Figure 14.2 Information flow from the world to an algorithm

For Jetson- Github Project: dusty-nv

Sample Notebook Tutorial using GPU: jetson-reinforcement/intro-DQN.ipynb at master · dusty-nv

With ROS and Gazebo

Figure 1.1 CartPole-v0 is a simple toy environment. The objective is to

(a) t = 1 (b) t = 2 (c) t (d) t = 4

Figure B.3 The LunarLander-v2 environment. The objective is to steer and

CartPole Atari Breakout BipedalWalker

Foundations of Deep Reinforcement Learning

NVIDIA Technical Blog – Deep Learning in a Nutshell: Reinforcement Learning

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.