0% found this document useful (0 votes)

44 views26 pages

RL Vishnu Sankar

Uploaded by

Vishnu Vgrp1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views26 pages

RL Vishnu Sankar

Uploaded by

Vishnu Vgrp1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

Reinforcement Learning

By,
Vishnu Sankar
Roll no 51
Agenda
• What is Reinforcement Learning
• Why it is important
• Important terms used in RL
• How it works
• Types of Reinforcement Learning
• Reinforcement Learning Algorithms
• Learning Models of Reinforcement
• Reinforcement learning VS Deep learning
• Reinforcement learning VS Supervised Learning
• Characteristic
• Application
• Challenges
• Key takeaways
What is Reinforcement learning
• Reinforcement Learning is defined as a Machine
Learning method
• That is concerned with how software agents
should take actions in an environment.
• Reinforcement Learning is a part of the deep
learning method
• That helps you to maximize some portion of the
cumulative reward.
• The computer employs trial and error to come up
with a solution to the problem.
• To get the machine to do what the programmer
wants, the artificial intelligence gets either rewards
or penalties for the actions it performs
Why It is important
• Reinforcement learning delivers decisions.

• By creating a simulation of an entire business or system

• It becomes possible for an intelligent system to test new actions or

approaches
• That change course when failures happen (or negative reinforcement)

• While building on successes (or positive reinforcement)

Important terms used in RL
 Agent: It is an assumed entity which performs actions in an environment
to gain some reward.
 Environment (e): A scenario that an agent has to face.
 Reward (R): An immediate return given to an agent when he or she
performs specific action or task.
 State (s): State refers to the current situation returned by the
environment.
 Policy (π): It is a strategy which applies by the agent to decide the next
action based on the current state.
• Value (V): It is expected long-term return with discount, as
compared to the short-term reward.
 Value Function: It specifies the value of a state that is the total
amount of reward. It is an agent which should be expected
beginning from that state.
 Model of the environment: This mimics the behaviour of the
environment. It helps you to make inferences to be made and also
determine how the environment will behave.
 Model based methods: It is a method for solving
reinforcement learning problems which use model-based
methods.
 Q value or action value (Q): Q value is quite similar to value.
The only difference between the two is that it takes an
additional parameter as a current action.
How Reinforcement learning works

• It is about taking suitable action to maximize reward.

• In reinforcement there is no answer key, but the agent decides
what to do to perform the task.
• Suppose we have an agent and a reward with many hurdles in
between.
• The agent is supposed to find the best possible path to reach the
reward.
• Lets take an example
• Your cat is an agent that is exposed to the environment (house).
• Our agent reacts by performing an action transition from one
"state" to another "state.“
• An example of a state could be your cat sitting, and you use a
specific word in for cat to walk.
• For example, your cat goes from sitting to walking.
• The reaction of an agent is an action, and the policy is a method of
selecting an action given a state in expectation of better
outcomes.
• After the transition, they may get a reward or penalty in return.
Types of Reinforcement learning
Positive:
• Defined as an event that occurs because of specific behavior.
• It increases the strength and frequency.
• In positive reinforcement, a favorable stimulus is added
• Helps to maximize performance and sustain change for a longer
period.
• The stimuli act as a reward, for doing something.
• Strengthens or maintains the probability of recurrence of
response.
Negative:
• Defined as strengthening of behavior that occurs because of
negative condition.
• Helps to define minimum stand of performance.
• In negative reinforcement, an unfavorable stimulus is removed.
• In negative reinforcement, the stimuli act like a penalty, for not
doing something.
• learns to get rid of nasty responses.
Reinforcement Learning Algorithms

• There are three approaches to implement a Reinforcement Learning

algorithm.
Value-Based:
• In a value-based Reinforcement Learning method, you should try to
maximize a value function V(s).
• In this method, the agent is expecting a long-term return of the
current states under policy π.
Policy-based:
• In a policy-based RL method, you try to come up with such a policy that
the action performed in every state helps you to gain maximum reward
in the future.
Two types of policy-based methods are:
– Deterministic & Stochastic
Model-Based:
• In this method, you need to create a virtual model for each
environment. The agent learns to perform in that specific environment.
Learning Models of Reinforcement

There are 2 types of Learning models for Reinforcement Learning

Markov Decision Process
The following parameters are used to get a solution:
– Set of actions- A
– Set of states -S
– Reward- R
– Policy- n
– Value- V
• In the problem, an agent is supposed to decide the best action to
select based on his current state.
• When this step is repeated, the problem is known as a Markov
Decision Process.
Q-Learning
• Q-learning is a values-based learning algorithm.
• Value based algorithms updates the value function based on an
equation.
• It uses Bellman equation to update the value function.
Reinforcement Learning VS Deep Learning
Reinforcement learning VS Supervised Learning
Characteristics

• There is no supervisor, only a real number or reward signal.

• Sequential decision making.
• Time plays a crucial role in Reinforcement problems.
• Feedback is always delayed, not instantaneous.
• Agent's actions determine the subsequent data it receives.
Applications

• Robotics for industrial automation.

• Business strategy planning.

• Machine learning and data processing.
• Aircraft control and robot motion control.
Challenges of Reinforcement Learning

• Feature/reward design which should be very involved

• Parameters may affect the speed of learning.

• Realistic environments can have partial observability.

• Too much Reinforcement may lead to an overload of states

which can diminish the results.
• Realistic environments can be non-stationary.
Key Takeaways
• Reinforcement Learning is a Machine Learning method.
• Helps you to discover which action yields the highest reward over the
longer period.
• Two types of reinforcement learning are 1) Positive 2) Negative.
• Application or reinforcement learning methods are: Robotics for
industrial automation and business strategy planning.
• You should not use this method when you have enough data to solve
the problem.
• The biggest challenge of this method is that parameters may affect the
speed of learning.
Thank You..

Ai Unit 3
No ratings yet
Ai Unit 3
23 pages
Introduction To Prolog-Unit3
No ratings yet
Introduction To Prolog-Unit3
30 pages
Unit 5
No ratings yet
Unit 5
58 pages
Reinforcement Learning
100% (1)
Reinforcement Learning
64 pages
CMPE257 - W10C13 - Reinforcement Learning
No ratings yet
CMPE257 - W10C13 - Reinforcement Learning
161 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
8 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
11 pages
Types of Data:: Reference Website
No ratings yet
Types of Data:: Reference Website
15 pages
DLMAIRIL01 Q4-2024 Session1
No ratings yet
DLMAIRIL01 Q4-2024 Session1
84 pages
Reinforcement Learning Is An Autonomous
No ratings yet
Reinforcement Learning Is An Autonomous
3 pages
Unit V Reinforcement Learning and Genetic Algorithm
No ratings yet
Unit V Reinforcement Learning and Genetic Algorithm
40 pages
Unit 5 ML 3year
No ratings yet
Unit 5 ML 3year
17 pages
Unit 6
No ratings yet
Unit 6
34 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
38 pages
5 6089131777291453670
100% (1)
5 6089131777291453670
70 pages
Module - 1 - Reinforcement Learning and Markov Decision Process
No ratings yet
Module - 1 - Reinforcement Learning and Markov Decision Process
19 pages
Lecture Notes On Reinforcement Learning Basics
No ratings yet
Lecture Notes On Reinforcement Learning Basics
6 pages
7.reinforcement Learning-Introduction-The Learning Task Q-Learning
No ratings yet
7.reinforcement Learning-Introduction-The Learning Task Q-Learning
34 pages
Introduction To Reinforcement Learning: Presented by - Rohit Mahto
No ratings yet
Introduction To Reinforcement Learning: Presented by - Rohit Mahto
9 pages
Unit4 (AI) 2024 Docx-1
No ratings yet
Unit4 (AI) 2024 Docx-1
22 pages
Unit 5-1
No ratings yet
Unit 5-1
8 pages
RL & DL Notes
No ratings yet
RL & DL Notes
43 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
38 pages
Reinforcement Learning: Nazia Bibi
100% (1)
Reinforcement Learning: Nazia Bibi
61 pages
Unit 4
No ratings yet
Unit 4
56 pages
RL Week - 1
No ratings yet
RL Week - 1
53 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
17 pages
Unit 3
No ratings yet
Unit 3
29 pages
IntroductiontoRL BR
No ratings yet
IntroductiontoRL BR
22 pages
21ai020 & Reinforcement Learning UNIT 1-LM:1
No ratings yet
21ai020 & Reinforcement Learning UNIT 1-LM:1
8 pages
Reinforcemnet Learning
No ratings yet
Reinforcemnet Learning
8 pages
RL & DL Notes
No ratings yet
RL & DL Notes
73 pages
Artificial Intelligence: Computer Science & Engineering, Khulna University
No ratings yet
Artificial Intelligence: Computer Science & Engineering, Khulna University
30 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
5 pages
Lecture 9 Reiforcement Learning
No ratings yet
Lecture 9 Reiforcement Learning
29 pages
Reinforcement Learning
100% (1)
Reinforcement Learning
25 pages
Winter Semester 2023-24 - CSE4037 - ETH - AP2023246000594 - 2024-01-05 - Reference-Material-I
No ratings yet
Winter Semester 2023-24 - CSE4037 - ETH - AP2023246000594 - 2024-01-05 - Reference-Material-I
35 pages
Ai PPT New
No ratings yet
Ai PPT New
14 pages
Lecture 5
No ratings yet
Lecture 5
28 pages
Module 1
No ratings yet
Module 1
72 pages
November 2015
100% (3)
November 2015
100 pages
Reinforcement Learning-1
No ratings yet
Reinforcement Learning-1
19 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
15 pages
Unit 5 ML
No ratings yet
Unit 5 ML
15 pages
ML 10
No ratings yet
ML 10
9 pages
Reinforcement
No ratings yet
Reinforcement
9 pages
L11 Reinforcement Learning 1
No ratings yet
L11 Reinforcement Learning 1
18 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
32 pages
Lecture Week12
No ratings yet
Lecture Week12
37 pages
Reinforcement Learning: By: Chandra Prakash IIITM Gwalior
No ratings yet
Reinforcement Learning: By: Chandra Prakash IIITM Gwalior
64 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
12 pages
MLT Unit-5 Notes
No ratings yet
MLT Unit-5 Notes
17 pages
Reinforced Learning
No ratings yet
Reinforced Learning
25 pages
ML Assignment 2
No ratings yet
ML Assignment 2
6 pages
Reinforcement Learning MY101
No ratings yet
Reinforcement Learning MY101
15 pages
Unit-5 Mla
No ratings yet
Unit-5 Mla
22 pages
Reinforcement Learning - Basics
No ratings yet
Reinforcement Learning - Basics
7 pages
L-14 - Reinforcement-L-d-07062024-111949am
No ratings yet
L-14 - Reinforcement-L-d-07062024-111949am
22 pages
Unit 5 - Reinforcement Learning
No ratings yet
Unit 5 - Reinforcement Learning
15 pages
Fundamentals of Reinforcement Learning
No ratings yet
Fundamentals of Reinforcement Learning
33 pages
Unit 5
No ratings yet
Unit 5
45 pages
Ielts
No ratings yet
Ielts
1 page
Math C4 Practice
No ratings yet
Math C4 Practice
53 pages
Site Case Study
No ratings yet
Site Case Study
3 pages
Modul Session 12 Akuntasi Feb
No ratings yet
Modul Session 12 Akuntasi Feb
26 pages
2019 ASHRAE Boston Product Guide Final PDF
No ratings yet
2019 ASHRAE Boston Product Guide Final PDF
75 pages
BARTEC Engineers Manual
No ratings yet
BARTEC Engineers Manual
12 pages
3 - Technical - Methods of Development
No ratings yet
3 - Technical - Methods of Development
29 pages
Philippine Primitive Art
100% (1)
Philippine Primitive Art
3 pages
Potato Specification
No ratings yet
Potato Specification
3 pages
Mccall Diesel Motor Works Case Study
100% (2)
Mccall Diesel Motor Works Case Study
14 pages
Cement Project
No ratings yet
Cement Project
16 pages
Bourdon Pressure - Gauges PDF
No ratings yet
Bourdon Pressure - Gauges PDF
2 pages
Leporello Aluminium Casting Alloys RHEINFELDEN ALLOYS 2018
No ratings yet
Leporello Aluminium Casting Alloys RHEINFELDEN ALLOYS 2018
10 pages
ERQ Marked Samples Discuss The Role That One Cultural Dimension May Have On Behaviour.
No ratings yet
ERQ Marked Samples Discuss The Role That One Cultural Dimension May Have On Behaviour.
4 pages
ASHS Financial Aid Application - 2025-2026
No ratings yet
ASHS Financial Aid Application - 2025-2026
6 pages
Ae8502 Question Bank-2022
No ratings yet
Ae8502 Question Bank-2022
153 pages
Literature Review
No ratings yet
Literature Review
3 pages
4-Lens and Cataract
No ratings yet
4-Lens and Cataract
59 pages
Status and Prospects For Helicopter Apus in Russia Gavrilov V.V., Ponomarev B.A
No ratings yet
Status and Prospects For Helicopter Apus in Russia Gavrilov V.V., Ponomarev B.A
16 pages
Bai Tap Ham Tai Chinh
No ratings yet
Bai Tap Ham Tai Chinh
4 pages
12620101AN - KS-VISION - Modbus Supervision Protocol Rev08
No ratings yet
12620101AN - KS-VISION - Modbus Supervision Protocol Rev08
16 pages
Journal of Oral Health and Dentistry Research (ISSN: 2583-522X) Case Report The in Uence of The Pulp On The Periodontium: A Viewpoint
No ratings yet
Journal of Oral Health and Dentistry Research (ISSN: 2583-522X) Case Report The in Uence of The Pulp On The Periodontium: A Viewpoint
11 pages
Mobiltherm 605 Pds
No ratings yet
Mobiltherm 605 Pds
2 pages
Mazarana - Case Study - S. Vicky
No ratings yet
Mazarana - Case Study - S. Vicky
4 pages
Ug II New Sem 2024 Time Table
No ratings yet
Ug II New Sem 2024 Time Table
4 pages
4925-300 E2 Accubind Elisa Rev 5
No ratings yet
4925-300 E2 Accubind Elisa Rev 5
2 pages
Editpadrsep 1712951867
No ratings yet
Editpadrsep 1712951867
2 pages
Chemistry Homework 8-1
No ratings yet
Chemistry Homework 8-1
7 pages
MVP Comprehensive Resource Impacts Agreement
No ratings yet
MVP Comprehensive Resource Impacts Agreement
16 pages
Fraud
No ratings yet
Fraud
7 pages
Vishnu Sankar - NLU
No ratings yet
Vishnu Sankar - NLU
12 pages
Class Assignment 1: Shift From Street Hawker To A Premium Tea Shop
No ratings yet
Class Assignment 1: Shift From Street Hawker To A Premium Tea Shop
1 page
Vishnu Sankar Roll No 51: Customer Satisfaction, Quality of Product, Service Provided
No ratings yet
Vishnu Sankar Roll No 51: Customer Satisfaction, Quality of Product, Service Provided
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

RL Vishnu Sankar

Uploaded by

RL Vishnu Sankar

Uploaded by

Reinforcement Learning

• By creating a simulation of an entire business or system

• It becomes possible for an intelligent system to test new actions or

• While building on successes (or positive reinforcement)

• It is about taking suitable action to maximize reward.

• There are three approaches to implement a Reinforcement Learning

There are 2 types of Learning models for Reinforcement Learning

• There is no supervisor, only a real number or reward signal.

• Robotics for industrial automation.

• Business strategy planning.

• Feature/reward design which should be very involved

• Parameters may affect the speed of learning.

• Too much Reinforcement may lead to an overload of states

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.