0% found this document useful (0 votes)

9 views21 pages

Unit 5 Learning

Uploaded by

kcshristi070

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views21 pages

Unit 5 Learning

Uploaded by

kcshristi070

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

What is Learning?

―Learning denotes changes in the system that are adaptive in the sense that they enable the
system to do the same task (or tasks drawn from the same population) more effectively the next
time.‖ --Herbert Simon

"Learning is constructing or modifying representations of what is being experienced." --Ryszard

Michalski

"Learning is making useful changes in our minds." --Marvin Minsky

Types of Learning:

The strategies for learning can be classified according to the amount of inference the system has
to perform on its training data. In increasing order we have

1. Rote learning – the new knowledge is implanted directly with no inference at all, e.g. simple
memorization of past events, or a knowledge engineer’s direct programming of rules elicited
from a human expert into an expert system.

2. Supervised learning – the system is supplied with a set of training examples consisting of
inputs and corresponding outputs, and is required to discover the relation or mapping between
then, e.g. as a series of rules, or a neural network.

3. Unsupervised learning – the system is supplied with a set of training examples consisting
only of inputs and is required to discover for itself what appropriate outputs should be, e.g. a
Kohonen Network or Self Organizing Map.

Early expert systems relied on rote learning, but for modern AI systems we are generally
interested in the supervised learning of various levels of rules.

The need for Learning:

As with many other types of AI system, it is much more efficient to give the system enough
knowledge to get it started, and then leave it to learn the rest for itself. We may even end up with
a system that learns to be better than a human expert.

The general learning approach is to generate potential improvements, test them, and discard
those which do not work. Naturally, there are many ways we might generate the potential
improvements, and many ways we can test their usefulness. At one extreme, there are model
driven (top-down) generators of potential improvements, guided by an understanding of how the
problem domain works. At the other, there are data driven (bottom-up) generators, guided by
patterns in some set of training data.

Prepared by Gopi Prajapati

Machine Learning:

As regards machines, we might say, very broadly, that a machine learns whenever it changes its
structure, program, or data (based on its inputs or in response to external information) in such a
manner that its expected future performance improves. Some of these changes, such as the
addition of a record to a data base, fall comfortably within the province of other disciplines and
are not necessarily better understood for being called learning. But, for example, when the
performance of a speech-recognition machine improves after hearing several samples of a
person's speech, we feel quite justified in that case saying that the machine has learned.

Machine learning usually refers to the changes in systems that perform tasks associated
with artificial intelligence (AI). Such tasks involve recognition, diagnosis, planning, robot
control, prediction, etc. The changes might be either enhancements to already performing
systems or synthesis of new systems.

Supervised Learning
In supervised learning, you use input/output pairs (labeled data) to train the machine. You show
the algorithm both input variables (x) and an output variable (Y) and then have the algorithm
infer the mapping function from the input to the output.

The main goal here is getting the machine to produce a function that’s approximated enough to
be able to predict outputs for new inputs when you introduce them.

Supervised learning problems can be further grouped into regression problems and classification
problems.

A regression problem is when outputs are actual objects whereas a classification problem, as the
name suggests, is when outputs are categories.

Gradient Descent Learning

Gradient descent is an optimization algorithm used to find the values of parameters (coefficients)
of a function (f) that minimizes a cost function (cost).

Minimizing any function means finding the deepest valley in that function. Keep in mind that,
the cost function is used to monitor the error in predictions of an ML model. So minimizing this,
basically means getting to the lowest error value possible or increasing the accuracy of the
model. Gradient descent is best used when the parameters cannot be calculated analytically (e.g.
using linear algebra) and must be searched for by an optimization algorithm.

Prepared by Gopi Prajapati

Intuition for Gradient Descent
Think of a large bowl like what you would eat cereal out of or store fruit in. This bowl is a plot
of the cost function (f).

A random position on the surface of the bowl is the cost of the current values of the coefficients
(cost).

The bottom of the bowl is the cost of the best set of coefficients, the minimum of the function.

The goal is to continue to try different values for the coefficients, evaluate their cost and select
new coefficients that have a slightly better (lower) cost.

Repeating this process enough times will lead to the bottom of the bowl and you will know the
values of the coefficients that result in the minimum cost.

Gradient Descent Procedure

The procedure starts off with initial values for the coefficient or coefficients for the function.
These could be 0.0 or a small random value.

coefficient = 0.0

The cost of the coefficients is evaluated by plugging them into the function and calculating the
cost.

Prepared by Gopi Prajapati

cost = f(coefficient)

cost = evaluate(f(coefficient))

The derivative of the cost is calculated. The derivative is a concept from calculus and refers to
the slope of the function at a given point. We need to know the slope so that we know the
direction (sign) to move the coefficient values in order to get a lower cost on the next iteration.

delta = derivative(cost)

Now that we know from the derivative which direction is downhill, we can now update the
coefficient values. A learning rate parameter (alpha) must be specified that controls how much
the coefficients can change on each update.

coefficient = coefficient – (alpha * delta)

This process is repeated until the cost of the coefficients (cost) is 0.0 or close enough to zero to
be good enough.

You can see how simple gradient descent is. It does require you to know the gradient of your
cost function or the function you are optimizing, but besides that, it’s very straightforward. Next
we will see how we can use this in machine learning algorithms.

Least Mean Square

The LMS algorithm was introduced by Widrow and Hoff in 1959. It has several names,
including the Widrow-Hoff rule and also Delta rule. LMS is an example of supervised learning
algorithm in NN similar with the perceptron learning algorithm (refer to the previous article,
May 2011). In the perceptron learning algorithm, the algorithm trains the perceptron until it
correctly classifies the output of the training set but LMS uses another termination criterion in
order to train the perceptron. So instead of training the perceptron until a solution is found,
another criterion is to continue training while the Mean-Square Error (MSE) is greater than a
certain value. This is the basis for the LMS algorithm.

Prepared by Gopi Prajapati

LMS is a fast algorithm that minimizes the MSE. The MSE is the average of the weighted sum
of the error for N training sample.

In order to train the perceptron by using LMS, we can iterate the test set, taking a set of inputs,
computing the output and then using the error to adjust the weight. This process can be done
either randomly by the test set, or for each test of the set in succession. The learning rule of LMS
is given as:

The learning rule adjusts the weight based on the error (R-C or expected output minus actual
output). Once the error is calculated, the weights are adjusted by a small amount , p in the
direction of the input, E. This has the effect of adjusting the weights to reduce the output error.

The implementation of LMS is very simple. Initially, the weights vector is initialized with small
random weights. The main repetition then randomly selects a test, calculates the output of the
neuron, and then calculates the error. Using the error, the formula of learning rule is applied to
each weight in the vector. Then continues the repetition to check the MSE to see if it has reached
an acceptable value, and if so, exit and emit the computed truth table for the neuron.

The least mean square algorithm uses a technique called ―method of steepest descent‖ and
continuously estimates results by updating filter weights. Through the principle of algorithm
convergence, the least mean square algorithm provides particular learning curves useful in
machine learning theory and implementation. Many of these ideas are part of dedicated work on
refining machine learning models, matching inputs to outputs, making training and test processes
more effective, and generally pursuing ―convergence‖ where the iterative learning process
resolves into a coherent(reasonable or logical) final result instead of getting off track.

Prepared by Gopi Prajapati

Back Propagation
The Delta rule is a gradient descent learning rule for updating the weights of the inputs
to artificial neurons, making connections between inputs and outputs with layers of artificial
neurons.
The Delta rule is also known as the Delta learning rule.
Back propagation is a supervised learning method, and is an implementation of the Delta rule. It
requires a teacher that knows, or can calculate, the desired output for any given input. It is most
useful for feed-forward networks (networks that have no feedback, or simply, that have no
connections that loop). The term is an abbreviation for "backwards propagation of errors". Back
propagation requires that the activation function used by the artificial neurons (or "nodes") is
differentiable.

As the algorithm's name implies, the errors (and therefore the learning) propagate backwards
from the output nodes to the inner nodes. So technically speaking, back propagation is used to
calculate the gradient of the error of the network with respect to the network's modifiable
weights. This gradient is almost always then used in a simple stochastic gradient descent
algorithm, is a general optimization algorithm, but is typically used to fit the parameters of a
machine learning model, to find weights that minimize the error. Often the term "back
propagation" is used in a more general sense, to refer to the entire procedure encompassing both
the calculation of the gradient and its use in stochastic gradient descent. Back propagation
usually allows quick convergence on satisfactory local minima for error in the kind of networks
to which it is suited.

Back propagation networks are necessarily multilayer perceptrons (usually with one input, one
hidden, and one output layer). In order for the hidden layer to serve any useful function,
multilayer networks must have non-linear activation functions for the multiple layers: a
multilayer network using only linear activation functions is equivalent to some single layer,
linear network.

Prepared by Gopi Prajapati

Characteristics:
• A multi-layered perceptron has three distinctive characteristics
– The network contains one or more layers of hidden neurons
– The network exhibits a high degree of connectivity
– Each neuron has a smooth (differentiable everywhere) nonlinear activation function, the most
common is the sigmoidal nonlinearity.

The back propagation algorithm provides a computational efficient method for training multi-
layer networks

Summary of the backpropagation technique:

1. Present a training sample to the neural network.
2. Compare the network's output to the desired output from that sample. Calculate the error in
each output neuron.
3. For each neuron, calculate what the output should have been, and a scaling factor, how much
lower or higher the output must be adjusted to match the desired output. This is the local error.
4. Adjust the weights of each neuron to lower the local error.

5. Assign "blame" for the local error to neurons at the previous level, giving greater
responsibility to neurons connected by stronger weights.
6. Repeat from step 3 on the neurons at the previous level, using each one's "blame" as its error.

Prepared by Gopi Prajapati

Stochastic Gradient Descent for Machine Learning
Gradient descent can be slow to run on very large datasets.

Because one iteration of the gradient descent algorithm requires a prediction for each instance in
the training dataset, it can take a long time when you have many millions of instances.

In situations when you have large amounts of data, you can use a variation of gradient descent
called stochastic gradient descent.

In this variation, the gradient descent procedure described above is run but the update to the
coefficients is performed for each training instance, rather than at the end of the batch of
instances.

The first step of the procedure requires that the order of the training dataset is randomized. This
is to mix up the order that updates are made to the coefficients. Because the coefficients are
updated after every training instance, the updates will be noisy jumping all over the place, and so

Prepared by Gopi Prajapati

will the corresponding cost function. By mixing up the order for the updates to the coefficients, it
harnesses this random walk and avoids it getting distracted or stuck.

The update procedure for the coefficients is the same as that above, except the cost is not
summed over all training patterns, but instead calculated for one training pattern.

The learning can be much faster with stochastic gradient descent for very large training
datasets and often you only need a small number of passes through the dataset to reach a good or
good enough set of coefficients, e.g. 1-to-10 passes through the dataset.

Stochastic gradient descent is a popular algorithm for training a wide range of models in machine
learning, including (linear) support vector machines, logistic regression and graphical
models. When combined with the backpropagation algorithm, it is the de facto standard
algorithm for training artificial neural networks..

Unsupervised learning

Unsupervised learning is the training of an artificial intelligence (AI) algorithm using

information that is neither classified nor labeled and allowing the algorithm to act on that
information without guidance.

In unsupervised learning, an AI system may group unsorted information according to similarities

and differences even though there are no categories provided. AI systems capable of
unsupervised learning are often associated with generative learning models, although they may
also use a retrieval-based approach (which is most often associated with supervised
learning). Chatbots, self-driving cars, facial recognition programs, expert systemsand robots are
among the systems that may use either supervised or unsupervised learning approaches.

In unsupervised learning, an AI system is presented with unlabeled, uncategorised data and the
system’s algorithms act on the data without prior training. The output is dependent upon the
coded algorithms. Subjecting a system to unsupervised learning is one way of testing AI.

Unsupervised learning algorithms can perform more complex processing tasks than supervised
learning systems. However, unsupervised learning can be more unpredictable than the alternate

Prepared by Gopi Prajapati

model. While an unsupervised learning AI system might, for example, figure out on its own how
to sort cats from dogs, it might also add unforeseen and undesired categories to deal with unusual
breeds, creating clutter instead of order.

Hebbian Learning:

The oldest and most famous of all learning rules is Hebb’s postulate of learning:

―When an axon of cell A is near enough to excite a cell B and repeatedly or persistently takes
part in firing it, some growth process or metabolic changes take place in one or both cells such
that A’s efficiency as one of the cells firing B is increased‖

From the point of view of artificial neurons and artificial neural networks, Hebb's principle can
be described as a method of determining how to alter the weights between model neurons. The
weight between two neurons increases if the two neurons activate simultaneously—and reduces
if they activate separately. Nodes that tend to be either both positive or both negative at the same
time have strong positive weights, while those that tend to be opposite have strong negative
weights.

Hebb’s Algorithm:

Step 0: initialize all weights to 0

Step 1: Given a training input, s, with its target output, t, set the activations of the input
units: xi = si
Step 2: Set the activation of the output unit to the target value: y = t
Step 3: Adjust the weights: wi(new) = wi(old) + xiy
Step 4: Adjust the bias (just like the weights): b(new) = b(old) + y

Prepared by Gopi Prajapati

Example:
PROBLEM: Construct a Hebb Net which performs like an AND function, that is, only when
both features are ―active‖ will the data be in the target class.

TRAINING SET (with the bias input always at 1):

X1 X2 Bias target
1 1 1 1
1 -1 1 -1
-1 1 1 -1
-1 -1 1 -1

Prepared by Gopi Prajapati

Training-First Input:

Present the first input (1 1 1) with the target of 1

Training- Second Input:

Present the input (1 -1 1) with the target of -1

Prepared by Gopi Prajapati

Training- Third Input:

Present the input (-1 1 1) with the target of -1

Training- Fourth Input:

Present the input (-1 -1 1) with the target of -1

Prepared by Gopi Prajapati

Final Neuron:

This neuron works:

Competitive learning

Competitive learning is a form of unsupervised learning in artificial Neural Networks. The nodes
compete for the right to respond to a subset of the input data. Competitive learning works by
increasing the specialization of each node in the network. It is well suited to finding clusters
within data.

Examples include: Vector quantization and Kohonen maps (self-organizing maps)

Principles of Competitive Learning:

There are three basic elements to a competitive learning rule:

 A set of neurons that are all the same, except for some randomly distributed synaptic
weights, which respond differently to a given set of input patterns
 A limit which is imposed on the "strength" of each neuron.
 A mechanism that permits the neurons to compete for the right to respond to a given
subset of inputs, such that only one output neuron (or only one neuron per group), is
active (i.e. "on") at a time. The neuron that wins the competition is called a "winner-
take-all" neuron.

Prepared by Gopi Prajapati

Individual neurons of the network learn to specialize on ensembles of similar patterns and
become 'feature detectors' for different classes of input patterns.

The competitive networks recode sets of correlated inputs to one of a few output neurons
essentially removes the redundancy in representation.

Architecture & Implementation:

Competitive Learning is usually implemented with Neural Networks that contain a hidden layer
which is commonly known as ―competitive layer‖.

Every competitive neuron is described by a vector of weights :

For every input vector, the competitive neurons ―compete‖ with each other to see which one of
them is the most similar to that particular input vector. The winner neuron m sets its output to:
Om=1

All the other competitive neurons set their output to:

Usually, in order to measure similarity the inverse of the Euclidean distance is used.

Genetic Algorithm

- A genetic algorithm maintains a population of candidate solutions for the problem at

hand, and makes it evolve by iteratively applying a set of stochastic operators.
- It is a variation of stochastic beam search.
- Inspired by biological evolution process
- Uses concepts of ―Natural Selection‖ i.e. ―Survival of the fittest‖ and ―Genetic
Inheritance‖
- Particularly well suited for hard problems where little is known about the underlying
search space
- Widely used in business, science and engineering.

Prepared by Gopi Prajapati

Genetic process in Nature

 Selection replicates the most successful solutions found in a population at a rate

proportional to their relative quality
 Crossover decomposes two distinct solutions and then randomly mixes their parts to
form novel solutions
 Mutation randomly produces a candidate solution.

Prepared by Gopi Prajapati

Genetic Algorithm
 GA starts with k randomly generated states ( population)
 A state is represented a string over a finite alphabet ( often a string of 0s and 1s)
 Evaluation function (fitness function) defines fitness value of each states.
 Produce the next generation of states by selection, crossover, and mutation.
 The primary advantage of GA comes from crossover operation.

Algorithm

produce an initial population of individuals

evaluate the fitness of all individuals
while (solution not found)
select fitter individuals for reproduction
recombine between individuals
mutate individuals
evaluate the fitness of the modified individuals
generate a new population
end while

Prepared by Gopi Prajapati

GA flowchart

Disadvantage
 GA is better if the problem does not have any mathematical model for the solution.
 GA is less efficient in terms of speed of convergence.
 GA has tendency to get stuck in local maxima rather than global maxima.

Prepared by Gopi Prajapati

Reinforced Learning
Reinforcement learning is an area of Machine Learning. It is about taking suitable action to
maximize reward in a particular situation. It is employed by various software and machines to
find the best possible behavior or path it should take in a specific situation. Reinforcement
learning differs from the supervised learning in a way that in supervised learning the training
data has the answer key with it so the model is trained with the correct answer itself whereas in
reinforcement learning, there is no answer but the reinforcement agent decides what to do to
perform the given task. In the absence of training dataset, it is bound to learn from its experience.

Example : The problem is as follows: We have an agent and a reward, with many hurdles in
between. The agent is supposed to find the best possible path to reach the reward. The following
problem explains the problem more easily.

The above image shows robot, diamond and fire. The goal of the robot is to get the reward that is
the diamond and avoid the hurdles that is fire. The robot learns by trying all the possible paths
and then choosing the path which gives him the reward with the least hurdles. Each right step
will give the robot a reward and each wrong step will subtract the reward of the robot. The total
reward will be calculated when it reaches the final reward that is the diamond.

Main points in Reinforcement learning :

1. Input: The input should be an initial state from which the model will start
2. Output: There are many possible output as there are variety of solution to a particular
problem
3. Training: The training is based upon the input, The model will return a state and the user
will decide to reward or punish the model based on its output.
4. The model keeps continues to learn.
5. The best solution is decided based on the maximum reward.

Prepared by Gopi Prajapati

Difference between Reinforcement learning and Supervised learning:

REINFORCEMENT LEARNING SUPERVISED LEARNING

Reinforcement learning is all about making decisions In Supervised learning the

sequentially. In simple words we can say that the out decision is made on the initial
depends on the state of the current input and the next input or the input given at the
input depends on the output of the previous input start

In Reinforcement learning decision is dependent, So we Supervised learning the

give labels to sequences of dependent decisions decisions are independent of
each other so labels are given to
each decision.

Example: Chess game Example: Object recognition

Types of Reinforcement: There are two types of Reinforcement:

Positive – Positive Reinforcement is defined as when an event, occurs due to a particular

behavior, increases the strength and the frequency of the behavior. In other words it has a
positive effect on the behavior.

Advantages:

 Maximizes Performance
 Sustain Change for a long period of time

Disadvantages:

 Too much Reinforcement can lead to overload of states which can diminish the results

Negative – Negative Reinforcement is defined as strengthening of a behavior because a negative

condition is stopped or avoided.

Advantages of reinforcement learning:

 Increases Behavior
 Provide defiance to minimum standard of performance

Disadvantages of reinforcement learning:

 It Only provides enough to meet up the minimum behavior

Prepared by Gopi Prajapati

Various Practical applications of Reinforcement Learning –

 RL can be used in robotics for industrial automation.

 RL can be used in machine learning and data processing
 RL can be used to create training systems that provide custom instruction and materials
according to the requirement of students.

Prepared by Gopi Prajapati

Inglés Gramática 3º Eso Sandra
No ratings yet
Inglés Gramática 3º Eso Sandra
7 pages
Machine Learning-2
No ratings yet
Machine Learning-2
16 pages
Dale's Cone of Experiment
100% (1)
Dale's Cone of Experiment
23 pages
Thesis Dissertation Urology
100% (3)
Thesis Dissertation Urology
7 pages
Lab 1 Cadence Tutorial
No ratings yet
Lab 1 Cadence Tutorial
36 pages
Psionics Augmented - Compilation 2
100% (5)
Psionics Augmented - Compilation 2
88 pages
Blooket Haks
33% (3)
Blooket Haks
77 pages
Beauty Trends Asia 2022 Shared by WorldLine Technology
No ratings yet
Beauty Trends Asia 2022 Shared by WorldLine Technology
45 pages
Tricks and Tropes Vi Keeland Penelope Ward Kaylee Ryan Samantha Young Tijan Kelly Elliott Bell
No ratings yet
Tricks and Tropes Vi Keeland Penelope Ward Kaylee Ryan Samantha Young Tijan Kelly Elliott Bell
325 pages
Physical Education: Quarter 2 - Module 2b Social Dance (Waltz) and First Aid For Injuries in Dance Settings
No ratings yet
Physical Education: Quarter 2 - Module 2b Social Dance (Waltz) and First Aid For Injuries in Dance Settings
21 pages
Fransiskus Daud Try Surya A Bahasa Inggris PTK PPG DALJAB 2
No ratings yet
Fransiskus Daud Try Surya A Bahasa Inggris PTK PPG DALJAB 2
47 pages
CVS Pharmacology - FULL
No ratings yet
CVS Pharmacology - FULL
34 pages
Statement of Account: Date Narration Chq./Ref - No. Value DT Withdrawal Amt. Deposit Amt. Closing Balance
No ratings yet
Statement of Account: Date Narration Chq./Ref - No. Value DT Withdrawal Amt. Deposit Amt. Closing Balance
26 pages
Machine Learning Lab Manual
100% (2)
Machine Learning Lab Manual
81 pages
Pediatric Community Acquired Pneumonia
No ratings yet
Pediatric Community Acquired Pneumonia
50 pages
Iso 11553 1 2020
No ratings yet
Iso 11553 1 2020
12 pages
Untitled
No ratings yet
Untitled
19 pages
Modernization Forces in Maria Theresa's Peasant Policies, 1740-1780
No ratings yet
Modernization Forces in Maria Theresa's Peasant Policies, 1740-1780
31 pages
Machine Learning and Eural Etwork
100% (1)
Machine Learning and Eural Etwork
21 pages
HHW Xi 24-25
No ratings yet
HHW Xi 24-25
33 pages
Parts of Speech
No ratings yet
Parts of Speech
6 pages
Progress Test 2 (U 3&4)
No ratings yet
Progress Test 2 (U 3&4)
4 pages
Christianity and Sikhism - Edited
No ratings yet
Christianity and Sikhism - Edited
5 pages
Mccracken 2009
No ratings yet
Mccracken 2009
8 pages
ML Unit 1 Notes
No ratings yet
ML Unit 1 Notes
135 pages
Humilitainment Research Paper
No ratings yet
Humilitainment Research Paper
19 pages
Unit 1
No ratings yet
Unit 1
102 pages
Heading / Description: 1 - Poultry Shed/Duck House 78.EW.2.1 - Earthwork in Excavation by Manual Means LIFT UP TO 1.5 METRE in
No ratings yet
Heading / Description: 1 - Poultry Shed/Duck House 78.EW.2.1 - Earthwork in Excavation by Manual Means LIFT UP TO 1.5 METRE in
10 pages
Lecture 1
No ratings yet
Lecture 1
37 pages
Agar Diferencial WL PDF
No ratings yet
Agar Diferencial WL PDF
2 pages
Hauwam Muhammed - Updated CV
No ratings yet
Hauwam Muhammed - Updated CV
4 pages
Week 1 Lecture Notes
No ratings yet
Week 1 Lecture Notes
7 pages
Machine Learning Techniques
100% (2)
Machine Learning Techniques
45 pages
PP2699 Detektor Asap Soteria UL - Edisi 1
No ratings yet
PP2699 Detektor Asap Soteria UL - Edisi 1
3 pages
Yayasan Insan Mulia Merangin Penilaian Tengah Semester: Nama: Kelas/Semester: VI/1 Mapel: Bahasa Inggris
No ratings yet
Yayasan Insan Mulia Merangin Penilaian Tengah Semester: Nama: Kelas/Semester: VI/1 Mapel: Bahasa Inggris
5 pages
Shoes
No ratings yet
Shoes
1 page
1 - Module5 - Machine Learning
100% (1)
1 - Module5 - Machine Learning
78 pages
Read The Text and Complete It With The Simple Present of The Verbs in Brackets
No ratings yet
Read The Text and Complete It With The Simple Present of The Verbs in Brackets
2 pages
Unit 4 Machine Learning
No ratings yet
Unit 4 Machine Learning
22 pages
Unit I
No ratings yet
Unit I
112 pages
Unit-1-Introduction (Fundamentals of ML & AI) February 11, 2023
No ratings yet
Unit-1-Introduction (Fundamentals of ML & AI) February 11, 2023
75 pages
Unit 3 Model Construction 3.1 Machine Learning Concepts - An Overview
No ratings yet
Unit 3 Model Construction 3.1 Machine Learning Concepts - An Overview
36 pages
ML Notes (Unit 1&2)
No ratings yet
ML Notes (Unit 1&2)
42 pages
Unit 4 Part1
No ratings yet
Unit 4 Part1
33 pages
Lecture - 32 - 33
No ratings yet
Lecture - 32 - 33
65 pages
Introducti0n (MLT)
No ratings yet
Introducti0n (MLT)
39 pages
Module 3 - AIML
No ratings yet
Module 3 - AIML
134 pages
Last Time: - Web As A Graph - What Is Link Analysis
No ratings yet
Last Time: - Web As A Graph - What Is Link Analysis
78 pages
ML by Andrew NG
No ratings yet
ML by Andrew NG
2 pages
Machine Learning Manual
No ratings yet
Machine Learning Manual
40 pages
ML RUSA Module 1 Intro
No ratings yet
ML RUSA Module 1 Intro
30 pages
Phantom
No ratings yet
Phantom
6 pages
Learning
No ratings yet
Learning
18 pages
Unit-1-Introduction (Fundamentals of ML & AI) January 29, 2024
No ratings yet
Unit-1-Introduction (Fundamentals of ML & AI) January 29, 2024
80 pages
Unit-1 (Introduction To Machine Learning)
No ratings yet
Unit-1 (Introduction To Machine Learning)
4 pages
Final UNIT-5-AI
No ratings yet
Final UNIT-5-AI
19 pages
Day 2 Part 1
No ratings yet
Day 2 Part 1
52 pages
Ai Unit5 Learning
No ratings yet
Ai Unit5 Learning
62 pages
A Seminar Report On Machine Learing
No ratings yet
A Seminar Report On Machine Learing
10 pages
Chapter 5 Artificial Intelligence Notes
No ratings yet
Chapter 5 Artificial Intelligence Notes
7 pages
Machine Learning - v1
No ratings yet
Machine Learning - v1
30 pages
AI Unit 3 Lecture 3
No ratings yet
AI Unit 3 Lecture 3
17 pages
Introduction To Machine Learning: Unit Structure
No ratings yet
Introduction To Machine Learning: Unit Structure
33 pages
Lect3 UWA PDF
No ratings yet
Lect3 UWA PDF
73 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
54 pages
Unit I
No ratings yet
Unit I
17 pages
ML Unit-1 Notes
No ratings yet
ML Unit-1 Notes
15 pages
Tom Mitchell Provides A More Modern Definition
No ratings yet
Tom Mitchell Provides A More Modern Definition
10 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
AI Notes Module - 4
No ratings yet
AI Notes Module - 4
13 pages
6.1-Fundamentals of Artificial Neural Networks
No ratings yet
6.1-Fundamentals of Artificial Neural Networks
12 pages
Unit 4
No ratings yet
Unit 4
18 pages
Machine Learning 1
No ratings yet
Machine Learning 1
29 pages
Module 1
No ratings yet
Module 1
27 pages
Machine Learning - SoS 2017
No ratings yet
Machine Learning - SoS 2017
15 pages
Notes Artificial Intelligence Unit 5
No ratings yet
Notes Artificial Intelligence Unit 5
11 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Machine Learning: Version 2 CSE IIT, Kharagpur
No ratings yet
Machine Learning: Version 2 CSE IIT, Kharagpur
9 pages
2702 PDF
No ratings yet
2702 PDF
7 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
20 pages
Larning Introduction
No ratings yet
Larning Introduction
6 pages
Machine Learning
No ratings yet
Machine Learning
6 pages
ML Unit 1-Notes
No ratings yet
ML Unit 1-Notes
21 pages
AIML Module - 03 21CS4
No ratings yet
AIML Module - 03 21CS4
34 pages
Unit 1
No ratings yet
Unit 1
6 pages
Learning AI
No ratings yet
Learning AI
27 pages
A Conversation About Calculus
From Everand
A Conversation About Calculus
Ginachukwu Amah
No ratings yet
Differential Evolution: Fundamentals and Applications
From Everand
Differential Evolution: Fundamentals and Applications
Fouad Sabry
No ratings yet
Random Optimization: Fundamentals and Applications
From Everand
Random Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
Mathematical Optimization: Fundamentals and Applications
From Everand
Mathematical Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Unit 5 Learning

Uploaded by

Unit 5 Learning

Uploaded by

What is Learning?

"Learning is constructing or modifying representations of what is being experienced." --Ryszard

"Learning is making useful changes in our minds." --Marvin Minsky

The need for Learning:

Prepared by Gopi Prajapati

Gradient Descent Learning

Prepared by Gopi Prajapati

Gradient Descent Procedure

Prepared by Gopi Prajapati

coefficient = coefficient – (alpha * delta)

Least Mean Square

Prepared by Gopi Prajapati

Prepared by Gopi Prajapati

Prepared by Gopi Prajapati

Summary of the backpropagation technique:

Prepared by Gopi Prajapati

Prepared by Gopi Prajapati

Unsupervised learning is the training of an artificial intelligence (AI) algorithm using

In unsupervised learning, an AI system may group unsorted information according to similarities

Prepared by Gopi Prajapati

Step 0: initialize all weights to 0

Prepared by Gopi Prajapati

TRAINING SET (with the bias input always at 1):

Prepared by Gopi Prajapati

Present the first input (1 1 1) with the target of 1

Training- Second Input:

Present the input (1 -1 1) with the target of -1

Prepared by Gopi Prajapati

Present the input (-1 1 1) with the target of -1

Training- Fourth Input:

Present the input (-1 -1 1) with the target of -1

Prepared by Gopi Prajapati

This neuron works:

Examples include: Vector quantization and Kohonen maps (self-organizing maps)

Principles of Competitive Learning:

There are three basic elements to a competitive learning rule:

Prepared by Gopi Prajapati

Architecture & Implementation:

Every competitive neuron is described by a vector of weights :

All the other competitive neurons set their output to:

- A genetic algorithm maintains a population of candidate solutions for the problem at

Prepared by Gopi Prajapati

 Selection replicates the most successful solutions found in a population at a rate

Prepared by Gopi Prajapati

produce an initial population of individuals

Prepared by Gopi Prajapati

Prepared by Gopi Prajapati

Main points in Reinforcement learning :

Prepared by Gopi Prajapati

REINFORCEMENT LEARNING SUPERVISED LEARNING

Reinforcement learning is all about making decisions In Supervised learning the

In Reinforcement learning decision is dependent, So we Supervised learning the

Example: Chess game Example: Object recognition

Types of Reinforcement: There are two types of Reinforcement:

Positive – Positive Reinforcement is defined as when an event, occurs due to a particular

Negative – Negative Reinforcement is defined as strengthening of a behavior because a negative

Advantages of reinforcement learning:

Disadvantages of reinforcement learning:

 It Only provides enough to meet up the minimum behavior

Prepared by Gopi Prajapati

 RL can be used in robotics for industrial automation.

Prepared by Gopi Prajapati

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.