Skip to content

Commit 551ce42

Browse files
committed
adds intro section to learning notebook
1 parent 915d55f commit 551ce42

File tree

1 file changed

+49
-3
lines changed

1 file changed

+49
-3
lines changed

learning.ipynb

Lines changed: 49 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,14 +1,56 @@
11
{
22
"cells": [
33
{
4-
"cell_type": "code",
5-
"execution_count": null,
4+
"cell_type": "markdown",
65
"metadata": {
76
"collapsed": false
87
},
8+
"source": [
9+
"# Learning\n",
10+
"\n",
11+
"This notebook serves as supporting material for topics covered in **Chapter 18 - Learning from Examples** , **Chapter 19 - Knowledge in Learning**, **Chapter 20 - Learning Probabilistic Models** from the book *Artificial Intelligence: A Modern Approach*. This notebook uses implementations from [learning.py](https://github.com/aimacode/aima-python/blob/master/learning.py). Let's start by importing everything from learning module."
12+
]
13+
},
14+
{
15+
"cell_type": "code",
16+
"execution_count": 1,
17+
"metadata": {
18+
"collapsed": true
19+
},
920
"outputs": [],
1021
"source": [
11-
"import learning"
22+
"from learning import *"
23+
]
24+
},
25+
{
26+
"cell_type": "markdown",
27+
"metadata": {},
28+
"source": [
29+
"## Review\n",
30+
"\n",
31+
"In this notebook, we learn about agents that can improve their behavior through diligent study of their own experiences.\n",
32+
"\n",
33+
"An agent is **learning** if it improves its performance on future tasks after making observations about the world.\n",
34+
"\n",
35+
"There are three types of feedback that determine the three main types of learning:\n",
36+
"\n",
37+
"* **Supervised Learning**:\n",
38+
"\n",
39+
"In Supervised Learning the agent observeses some example input-output pairs and learns a function that maps from input to output.\n",
40+
"\n",
41+
"**Example**: Let's think of an agent to classify images containing cats or dogs. If we provide an image containing a cat or a dog, this agent should output a string \"cat\" or \"dog\" for that particular image. To teach this agent, we will give a lot of input-output pairs like {cat image-\"cat\"}, {dog image-\"dog\"} to the aggent. The agent then learns a function that maps from an input image to one of those strings.\n",
42+
"\n",
43+
"* **Unsupervised Learning**:\n",
44+
"\n",
45+
"In Unsupervised Learning the agent learns patterns in the input even though no explicit feedback is supplied. The most common type is **clustering**: detecting potential useful clusters of input examples.\n",
46+
"\n",
47+
"**Example**: A taxi agent would develop a concept of *good traffic days* and *bad traffic days* without ever being given labeled examples.\n",
48+
"\n",
49+
"* **Reinforcement Learning**:\n",
50+
"\n",
51+
"In Reinforcement Learning the agent from a series of reinforcements—rewards or punishments.\n",
52+
"\n",
53+
"**Example**: Let's talk about an agent to play the popular Atari game—[Pong](http://www.ponggame.org). We will reward a point for every correct move and deduct a point for every wrong move from the agent. Eventually, the agent will figure out its actions prior to reinforcement were most responsible for it."
1254
]
1355
},
1456
{
@@ -38,6 +80,10 @@
3880
"nbconvert_exporter": "python",
3981
"pygments_lexer": "ipython3",
4082
"version": "3.5.1"
83+
},
84+
"widgets": {
85+
"state": {},
86+
"version": "1.1.1"
4187
}
4288
},
4389
"nbformat": 4,

0 commit comments

Comments
 (0)
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy