Reinforcement Learning in A Id - 12008003
Reinforcement Learning in A Id - 12008003
Presented By
Abdullah Al Rashid Samir
Id:12008003
WHAT IS ARTIFICIAL INTELLIGENCE (AI)?
• 1. Supervised Learning
• 2. Unsupervised Learning
• 3. Reinforcement Learning
SUPERVISED LEARNING
• 1. Linear Regression
• 2. Decision Trees
• 3. Support Vector Machines (SVM)
ADVANTAGES OF SUPERVISED
LEARNING
• 1. Clustering
• 2. Association
CLUSTERING
• 1. K-means clustering
• 2. PCA
ADVANTAGES OF UNSUPERVISED
LEARNING
• According to IBM:
• Beyond the agent-environment-goal triumvirate, four principal
sub-elements characterize reinforcement learning problems.
• - Policy
• - Reward signal.
• - Value function.
• - Model
KEY CONCEPT OF REINFORCEMENT LEARNING
• In supervised learning, you define both the input and the expected associated
output. For instance, you can provide a set of images labeled dogs or cats, and
the algorithm is then expected to identify a new animal image as a dog or cat.
• Supervised learning algorithms learn patterns and relationships between the
input and output pairs. Then, they predict outcomes based on new input data.
It requires a supervisor, typically a human, to label each data record in a
training data set with an output.
• In contrast, RL has a well-defined end goal in the form of a desired result but
no supervisor to label associated data in advance. During training, instead of
trying to map inputs with known outputs, it maps inputs with possible
outcomes. By rewarding desired behaviors, you give weightage to the best
outcomes.
R E IN F OR C E ME N T L E A R N IN G V S .
U N S U P E R V IS E D L E A R N IN G
• 1. Data Collection
• 2. Data Preprocessing
• 3. Model Training
• 4. Model Evaluation
• 5. Model Deployment
HOW FEEDBACK IS INCORPORATED
INTO THE LEARNING PROCESS
• GAMING BOT
GENETIC ALGORITHM
• 1.Google’s Deepmind
• 2. Tesla’s Self Driving Tasks
• 3. Traveling Salesperson Problem (TSP)
SOURCES
• IBM
• https://www.ibm.com/topics/reinforcement-learning
• Amazon
• https://aws.amazon.com/what-is/reinforcement-learning/
• GeeksforGeeks
• https://www.geeksforgeeks.org/what-is-reinforcement-learning/
• Google cloud
• https://cloud.google.com/learn/what-is-artificial-intelligence
SOURCES
• Javapoint
• https://
www.javatpoint.com/genetic-algorithm-in-machine-learning
• IBM. (n.d.). Genetic Algorithms for Optimization. Retrieved from
IBM Research Blog.
• Tutorials point
• Géron, A. (2019). Hands-On Machine Learning with Scikit-
Learn, Keras, and TensorFlow. O'Reilly Media.