0% found this document useful (0 votes)

11 views33 pages

Unit 3.3 Learning

The document provides an overview of learning in artificial intelligence, detailing its definition, forms, and methods such as supervised, unsupervised, and reinforcement learning. It discusses learning by taking advice, integrating learning with problem-solving, and induction learning, highlighting key concepts like hypothesis space and generalization. The document also addresses the advantages and pitfalls of different learning approaches, emphasizing the importance of performance measures and the role of experience in improving task performance.

Uploaded by

Prerna Saste

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views33 pages

Unit 3.3 Learning

Uploaded by

Prerna Saste

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Unit 3

Definition of learning, Forms of learning, learning

by taking advice, Learning in problem solving,
Induction learning

Prof. Pranjal Pandit

Contents
• Knowledge representation
• Types of knowledge
• Rules, Rule based expert system.
• Inference: Backward chaining, Forward chaining, Rule value approach,
Inference engine.
• Planning: Goal Tree, Non-linear planning, Hierarchical planning, Goal
stack planning
• Definition of learning, Forms of learning, learning by taking advice,
Learning in problem solving, Induction learning
• Expert systems - Architecture of expert systems, Roles of expert
systems, Knowledge Acquisition
Learning
A standard, widely used formal definition (Tom Mitchell, 1997) is:
A program is said to learn from experience E with respect to some class of
tasks T and performance measure P, if its performance at tasks in T, as measured by P,
improves with experience E.

Plain language: learning = improvement on a task produced by experience/data,

measured by some performance metric.

Example: a spam filter (T = classify emails, P = classification accuracy) improves as it sees

labelled emails (E).

Key ingredients in the definition:

• Task (T) — what the system is expected to do (classification, control, planning).
• Experience (E) — data, interactions, demonstrations, rewards, etc.
• Performance measure (P) — accuracy, average reward, time to solution, etc.
Forms of learning

1. By type of feedback

A) Supervised learning — labeled examples (input → correct output).

Example: image classification, regression.
B) Unsupervised learning — no labels; discover structure (clusters,
manifolds).
Example: k-means clustering, PCA.
C) Reinforcement learning (RL) — scalar reward signal through
interaction; learn policy to maximize long-run reward.
Example: game playing, robotic control.
2. By interaction style
• Batch learning —
learner sees a
dataset and trains
offline.
• Online (incremental)
learning — model
updates continuously
as new data arrives.
• Active learning —
learner queries an
oracle (asks for
labels) for the most
informative
instances.
3. By representation and method
• Instance-based (lazy) — store examples and use them at query time (k-
NN).
• Model-based (parametric) — build a compact model (linear regression,
neural network).
• Symbolic vs subsymbolic — rules/logic vs neural nets.
4. Hybrid and advanced forms

A) Semi-supervised learning — mix of labeled and unlabeled data.

Self-supervised learning model
B) Self-supervised learning — create supervisory signals from the data
itself (predict missing parts).
C) Imitation learning / Learning from demonstration (LfD)
— learn policies from demonstrated trajectories.
D) Transfer learning / Meta-learning — reuse knowledge
from prior tasks to speed learning on new ones.
5. By goal

• Classification / Regression — predict labels or continuous values.

• Clustering / Dimensionality reduction — discover structure.
• Policy learning / Planning — produce actions or plans.
Learning by taking advice

The learner receives advice (hints, rules, demonstrations, corrective feedback)

from a teacher/mentor/oracle and uses it to speed or guide learning.

Modes of advice
• Demonstrations: teacher shows correct behavior (e.g., driving traces →
imitation learning / behavior cloning).
• Corrective feedback / evaluative advice: teacher gives “good/bad” signals
for actions (e.g., TAMER-style human feedback).
• Hints / constraints / rules: symbolic advice like “avoid region X” or
“variable Y is important”.
• Policy advice / demonstrations for bootstrapping: initial policy from
teacher, then refine via RL.
Algorithms / approaches

• Behavioral cloning: treat demonstrations as supervised learning (state

→ action).
• Inverse Reinforcement Learning (IRL): infer the teacher’s reward
function from demonstrations.
• Interactive RL / Reward shaping: use teacher’s evaluative feedback
to shape rewards or policies.
• DAGGER (Dataset Aggregation): combines learner’s trajectories
with teacher corrections to avoid compounding errors.
Advantages
• Faster learning; avoids random/exploratory mistakes.
• Transfers human expertise directly.
• Useful when exploration is costly/dangerous.

Pitfalls & cautions

• Bias / suboptimal advice: poor advice can mislead the learner.
• Over-reliance: learner may fail to generalize beyond advice scope.
• Inconsistency: conflicting advice complicates learning (needs mechanisms
to weigh trust).
• Practical example: a human steering a drone for a few flights
(demonstrations) — the agent clones the behavior, then refines with RL
using a simulator.
Learning in problem solving
Learning can be integrated with classical search and problem-solving to make future
solutions faster or better.

Main ideas
• Learn better heuristics: use past solved problems to learn a heuristic function
h(state) that guides search (A*, IDA*). Example: learning pattern databases for
the 15-puzzle.
• Learn macro-operators / chunks: create higher-level actions (macros) that
collapse repeated subplans into single operators to speed future planning.
• Explanation-based learning (EBL): from a solved example + domain theory,
derive a general rule that makes solving similar problems trivial.
• Case-based reasoning (CBR): store solved problem cases and adapt their
solutions to new similar problems.
• RL for problem solving: learn policies that map problem states to actions (e.g.,
using Q-learning for maze navigation).
Example (8-puzzle):
• Experience: solve many random puzzles via search.
• Learn: estimate of distance-to-goal for common subpatterns (heuristic
table).
• Result: future search cut drastically because heuristic is more informed.

Why it helps
• Reduces search branching; speeds solution time; may enable solving
problems that were infeasible before.
Tradeoffs
• Time + memory spent to learn/store heuristics or cases.
• Risk of over-specialization to seen problems (less generalization).
Induction learning

• What is induction?
Induction is the process of forming general rules/hypotheses from
specific observed examples. In machine learning we typically induce
models (hypotheses) that generalize from training examples to
unseen instances.

Contrast with deduction

• Deduction: apply general rules to derive conclusions about specific
cases.
• Induction: infer general rules from specific observations.
Core components

• Hypothesis space (H): all candidate functions/models the learner

considers.
• Training examples: labeled instances used to evaluate hypotheses.
• Search/selection mechanism: algorithm that finds a good hypothesis
(e.g., minimize training error + regularization).
• Inductive bias: assumptions (e.g., simplicity, smoothness) that let us
prefer some hypotheses over others so generalization is possible.
Classic algorithms & ideas
• Decision trees (ID3/C4.5): induce tree structure from examples using information
gain.
• Linear models: learn weights to fit data (perceptron, logistic regression).
• Nearest neighbour: a lazy induction—use stored examples to classify new points.
• Version space learning: maintain the set of hypotheses consistent with examples;
general-to-specific boundary. (Useful as a conceptual model.)
• Statistical learning theory / PAC: formalizes conditions under which induction
generalizes (sample complexity, VC dimension).

Bias-variance & overfitting

• Overfitting: hypothesis fits training data too closely and fails to generalize.
• Underfitting: hypothesis too simple to capture patterns.
• Principles to manage: regularization, cross-validation, choosing model complexity,
more data.
Simple illustration
Suppose we want to learn isBird(x) from attributes:
• Examples: {(feathers=True) → Bird=True, (feathers=False, swims=True) →
Bird=False, ...}
• Induction might produce rule: isBird(x) := has_feathers(x).
Algorithmic approach: search hypothesis space of logical rules or decision trees;
pick rule with best generalization.

Key points about induction

• It always requires bias — no purely “data-only” method can generalize without
assumptions.
• Quality of induced model judged by generalization (test performance), not
training fit alone.
• Inductive methods range from symbolic (rules, trees) to statistical (probabilistic
models, neural nets).

Final UNIT-5-AI
No ratings yet
Final UNIT-5-AI
19 pages
Sf8 - Fai Unit V Notes
No ratings yet
Sf8 - Fai Unit V Notes
7 pages
Unit 5 Half Ai
No ratings yet
Unit 5 Half Ai
9 pages
Unit 6 Learning System
No ratings yet
Unit 6 Learning System
21 pages
Learning From Examples
No ratings yet
Learning From Examples
22 pages
Learning Agents Overview
No ratings yet
Learning Agents Overview
42 pages
12 Learning
No ratings yet
12 Learning
22 pages
ML Unit 1 Notes
No ratings yet
ML Unit 1 Notes
135 pages
Ntroduction To Achine Earning 1.1 W M L ?
No ratings yet
Ntroduction To Achine Earning 1.1 W M L ?
19 pages
AI Notes Module - 4
No ratings yet
AI Notes Module - 4
13 pages
Unit 5 1
No ratings yet
Unit 5 1
113 pages
Lecture - 32 - 33
No ratings yet
Lecture - 32 - 33
65 pages
Unit 1
No ratings yet
Unit 1
6 pages
5 Le
100% (1)
5 Le
36 pages
Unit-1 ML
No ratings yet
Unit-1 ML
39 pages
Ai Unit5 Learning
No ratings yet
Ai Unit5 Learning
62 pages
Unit I
No ratings yet
Unit I
17 pages
Understanding AI Learning Systems
No ratings yet
Understanding AI Learning Systems
27 pages
Learning and Planning
No ratings yet
Learning and Planning
107 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
134 pages
ML Week 2 Part 2
No ratings yet
ML Week 2 Part 2
6 pages
Decision Theory
No ratings yet
Decision Theory
14 pages
Unit 5
No ratings yet
Unit 5
21 pages
AI Unit 4
No ratings yet
AI Unit 4
18 pages
AI UNIT IV Learning
No ratings yet
AI UNIT IV Learning
7 pages
Larning Introduction
No ratings yet
Larning Introduction
6 pages
Ke 4
No ratings yet
Ke 4
6 pages
Unit 5 6 BTech Artificial Intelligence Notes 2 2 2 2
No ratings yet
Unit 5 6 BTech Artificial Intelligence Notes 2 2 2 2
20 pages
Unit 4
No ratings yet
Unit 4
20 pages
AI Module 1 Simple Notes
No ratings yet
AI Module 1 Simple Notes
14 pages
Chapter 6 - Learning (Autosaved)
No ratings yet
Chapter 6 - Learning (Autosaved)
105 pages
Machine Learning Unit - 2 Supervised Learning
No ratings yet
Machine Learning Unit - 2 Supervised Learning
7 pages
My Hands-On ML Notebook
No ratings yet
My Hands-On ML Notebook
5 pages
AIML Module-03
No ratings yet
AIML Module-03
40 pages
Lecture 1.2 Introduction To Machine Learning
No ratings yet
Lecture 1.2 Introduction To Machine Learning
31 pages
01 Introduction ML
No ratings yet
01 Introduction ML
60 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
54 pages
Machine Learning
No ratings yet
Machine Learning
38 pages
ML Unit 1
No ratings yet
ML Unit 1
15 pages
Unit 1
No ratings yet
Unit 1
18 pages
AI Learning
No ratings yet
AI Learning
43 pages
ML (Unit-1)
No ratings yet
ML (Unit-1)
17 pages
UNIT 4 New
No ratings yet
UNIT 4 New
62 pages
Ai Unit 5 Part 3
No ratings yet
Ai Unit 5 Part 3
9 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
15 pages
Unit-5 1
No ratings yet
Unit-5 1
88 pages
Mod 4
No ratings yet
Mod 4
45 pages
Understanding Learning in AI Systems
No ratings yet
Understanding Learning in AI Systems
25 pages
Learning
No ratings yet
Learning
25 pages
Ai and ML Module 3
No ratings yet
Ai and ML Module 3
12 pages
Module 1
No ratings yet
Module 1
50 pages
Intro to Machine Learning Basics
No ratings yet
Intro to Machine Learning Basics
7 pages
Introduction To AI and ML
No ratings yet
Introduction To AI and ML
24 pages
Intro to Machine Learning Basics
No ratings yet
Intro to Machine Learning Basics
134 pages
AI Unit 4
No ratings yet
AI Unit 4
91 pages
Aiml M3 C1
No ratings yet
Aiml M3 C1
59 pages
Chapter 6:artificial Intelligence Learning: By. Getaneh T
No ratings yet
Chapter 6:artificial Intelligence Learning: By. Getaneh T
59 pages
Gini Index and Decision Tree Analysis
No ratings yet
Gini Index and Decision Tree Analysis
48 pages
Forward Checking CSP
No ratings yet
Forward Checking CSP
7 pages
Unit 1 - Delivery Robots in Practice
No ratings yet
Unit 1 - Delivery Robots in Practice
17 pages
Unit 1.3 Constraint SAtisfaction Problem
No ratings yet
Unit 1.3 Constraint SAtisfaction Problem
22 pages
Unit 2.1
No ratings yet
Unit 2.1
50 pages
Unit 3.1
No ratings yet
Unit 3.1
79 pages
Janhvi Saste - 22311825 - 382063 - Step by Step Development of A Fintech Business Model
No ratings yet
Janhvi Saste - 22311825 - 382063 - Step by Step Development of A Fintech Business Model
18 pages
Implement Forward and Backward Chaining
No ratings yet
Implement Forward and Backward Chaining
16 pages
Unit 2.2
No ratings yet
Unit 2.2
35 pages
Assign The Last Lesson Eng Gr12
No ratings yet
Assign The Last Lesson Eng Gr12
3 pages
Appeals Court Reverses Dismissal
No ratings yet
Appeals Court Reverses Dismissal
7 pages
10.2 Confess Your Sins One To Another
No ratings yet
10.2 Confess Your Sins One To Another
4 pages
Ncp1653 Excel Spreadsheet: Ncp1653 (100 KHZ) Ncp1653A (67 KHZ)
No ratings yet
Ncp1653 Excel Spreadsheet: Ncp1653 (100 KHZ) Ncp1653A (67 KHZ)
1 page
Social Organization
No ratings yet
Social Organization
19 pages
32 Different Types of Starseeds
100% (3)
32 Different Types of Starseeds
20 pages
Nda DDBFT 2020
No ratings yet
Nda DDBFT 2020
3 pages
End Exam Portfolio
No ratings yet
End Exam Portfolio
6 pages
Ear Health Assessment Guide
No ratings yet
Ear Health Assessment Guide
7 pages
1tk5 Latest
No ratings yet
1tk5 Latest
1 page
(Ebook) Rosacea: Diagnosis and Management by Frank Powell ISBN 9781420072587, 9781420072594, 1420072587, 1420072595 Digital Download
No ratings yet
(Ebook) Rosacea: Diagnosis and Management by Frank Powell ISBN 9781420072587, 9781420072594, 1420072587, 1420072595 Digital Download
92 pages
DISC Personality Assessment - Share
No ratings yet
DISC Personality Assessment - Share
2 pages
Pride and Prejudice
No ratings yet
Pride and Prejudice
309 pages
Least-Squares Regression in Cost Estimation: Formulas
No ratings yet
Least-Squares Regression in Cost Estimation: Formulas
4 pages
Unconditional Love and Modern Educayshun
No ratings yet
Unconditional Love and Modern Educayshun
2 pages
Dokumen - Pub The Resurrection of The Body in Western Christianity 2001336 9780231546089
No ratings yet
Dokumen - Pub The Resurrection of The Body in Western Christianity 2001336 9780231546089
478 pages
Craigslist Business Model Case Study
100% (2)
Craigslist Business Model Case Study
10 pages
Anger Management: Presented By: Lolita V. Bucot RGC
No ratings yet
Anger Management: Presented By: Lolita V. Bucot RGC
54 pages
e-book-HTML Tutorial
No ratings yet
e-book-HTML Tutorial
8 pages
Lesson Plan Paed
No ratings yet
Lesson Plan Paed
10 pages
Tuto NX12
No ratings yet
Tuto NX12
31 pages
Evasion of Destierro in Philippine Law
No ratings yet
Evasion of Destierro in Philippine Law
170 pages
Confined Space Safety Guidelines
100% (1)
Confined Space Safety Guidelines
23 pages
Grading Students' Writing Dilemma
No ratings yet
Grading Students' Writing Dilemma
8 pages
Plowman - 1984 - The Ethnobotany of Coca
No ratings yet
Plowman - 1984 - The Ethnobotany of Coca
49 pages
Zaha Hadid Inspiration From Malevich
No ratings yet
Zaha Hadid Inspiration From Malevich
9 pages
Kedah STPM 2009 Biology Marking Scheme
No ratings yet
Kedah STPM 2009 Biology Marking Scheme
14 pages
GoldenGate Sync for DBAs
100% (1)
GoldenGate Sync for DBAs
5 pages
Joyce's Dublin: Individual vs. Society
No ratings yet
Joyce's Dublin: Individual vs. Society
6 pages

Unit 3.3 Learning

Uploaded by

Unit 3.3 Learning

Uploaded by

Unit 3

Definition of learning, Forms of learning, learning

Prof. Pranjal Pandit

Plain language: learning = improvement on a task produced by experience/data,

Example: a spam filter (T = classify emails, P = classification accuracy) improves as it sees

Key ingredients in the definition:

A) Supervised learning — labeled examples (input → correct output).

A) Semi-supervised learning — mix of labeled and unlabeled data.

• Classification / Regression — predict labels or continuous values.

The learner receives advice (hints, rules, demonstrations, corrective feedback)

• Behavioral cloning: treat demonstrations as supervised learning (state

Pitfalls & cautions

Contrast with deduction

• Hypothesis space (H): all candidate functions/models the learner

Bias-variance & overfitting

Key points about induction

You might also like