0% found this document useful (0 votes)

17 views22 pages

Module-2 FIND-S

The document discusses concept learning, which involves inferring general concepts from specific training examples, and introduces key algorithms such as FIND-S and the Candidate Elimination Algorithm. It explains the structure of hypotheses, the process of searching through hypothesis spaces, and the importance of inductive bias. The FIND-S algorithm is detailed as a method for finding the most specific hypothesis consistent with positive training examples, while raising questions about convergence and consistency in learning.

Uploaded by

Tejaswi 123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views22 pages

Module-2 FIND-S

Uploaded by

Tejaswi 123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

MODULE -2

Concept Learning
Topics
• Concept learning task
• Concept Learning as search through a hypothesis
space
• Maximally Specific Hypotheses (FIND-S Algorithm)
• Version Spaces
• The Candidate Elimination Algorithm
• Inductive Bias
Concept Learning
• Learning involves acquiring general concepts from specific training examples.

Example: People continually learn general concepts or categories such as "bird,"

"car," "situations in which I should study more in order to pass the exam," etc.

• Each such concept can be viewed as describing some subset of objects or events
defined over a larger set

• Alternatively, each concept can be thought of as a Boolean-valued function

defined over this larger set. (Example: A function defined over all animals, whose
value is true for birds and false for other animals).
• Concept learning - Inferring a Boolean-valued function from training examples of its
input and output.
2
Notations
• The set of items over which the concept is defined is called the set of instances, which
we denote by X.
Example: X is the set of all possible days to play water sport, each represented by the
attributes: Sky, AirTemp, Humidity, Wind, Water, and Forecast

• The concept or function to be learned is called the target concept, which we denote by c. c
can be any Boolean valued function defined over the instances X. c : X {0, 1}

Example: The target concept corresponds to the value of the attribute EnjoySport
(i.e., c(x) = 1 if EnjoySport = Yes, and c(x) = 0 if EnjoySport = No).
• Instances for which c(x) = 1 are called positive examples, or members of the
target concept.

• Instances for which c(x) = 0 are called negative examples, or non-members of the
target concept.

• The ordered pair (x, c(x)) to describe the training example consisting of the
instance x and its target concept value c(x).

• D to denote the set of available training examples

• The symbol H to denote the set of all possible hypotheses that the learner may
consider regarding the identity of the target concept. Each hypothesis h in H
represents a Boolean-valued function defined over X
h:X {O, 1}

• The goal of the learner is to find a hypothesis h such that h(x) = c(x) for all x in X.
A Concept Learning Task
Consider the example task of learning the target concept

"Days on which my friend Vasu enjoys his favorite water sport."

Example Sky AirTemp Humidity Wind Water Forecast EnjoySport

1 Sunny Warm Normal Strong Warm Same Yes

2 Sunny Warm High Strong Warm Same Yes

3 Rainy Cold High Strong Warm Change No

4 Sunny Warm High Strong Cool Change Yes

The attribute EnjoySport indicates whether or not a Person enjoys his favorite
water sport on this day.

The task is to learn to predict the value of

EnjoySport for an arbitrary day, based on the
values of its other attributes ?
What hypothesis representation is provided to the learner?

Let’s consider a simple representation in which each hypothesis consists of a

conjunction of constraints on the instance attributes.

Let each hypothesis be a vector of six constraints, specifying the values of the six
attributes Sky, AirTemp, Humidity, Wind, Water, and Forecast.

For each attribute, the hypothesis will either

• Indicate by a "?' that any value is acceptable for this attribute,
• Specify a single required value (e.g., Warm) for the attribute, or
• Indicate by a "Φ" that no value is acceptable
If some instance x satisfies all the constraints of hypothesis h, then h classifies
x as a positive example (h(x) = 1).

The hypothesis that PERSON enjoys his favorite sport only on cold days with high
humidity (independent of the values of the other attributes) is represented by the
expression
<?, Cold, High, ?, ?, ?>

The most general hypothesis-that every day is a positive example-is represented by

<?, ?, ?, ?, ?, ?>

The most specific possible hypothesis-that no day is a positive example-is

represented by
<Φ , Φ, Φ, Φ, Φ, Φ>
EnjoySport Concept Learning Task
The Inductive Learning Hypothesis

Any hypothesis found to approximate the target

function well over a sufficiently large set of training
examples will also approximate the target function well
over other unobserved examples.
Concept learning as Search
• Concept learning can be viewed as the task of searching through a large space of hypotheses implicitly defined
by the hypothesis representation.

• The goal of this search is to find the hypothesis that best fits the training

examples.
For example, the instances X and hypotheses H in the EnjoySport learning task. The attribute Sky has three possible
values, and AirTemp, Humidity, Wind, Water Forecast each have two possible values, the instance space X contains
exactly
• 3.2.2.2.2.2 = 96 Distinct instances
• 5.4.4.4.4.4 = 5120 Syntactically distinct hypotheses within H.
Every hypothesis containing one or more " Φ" symbols represents the empty set of instances; that is, it classifies
every instance as negative.
• 1 + (4.3.3.3.3.3) = 973 Semantically distinct hypotheses within H.
General-to-Specific Ordering of Hypotheses
• Consider the two hypotheses
h1 = <Sunny, ?, ?, Strong, ?, ?>
h2 = <Sunny, ?, ?, ?, ?, ?>
• Consider the sets of instances that are classified positive by hl and by h2.
• h2 imposes fewer constraints on the instance, it classifies more instances as
positive. So, any instance classified positive by hl will also be classified positive
by h2. Therefore, h2 is more general than hl.
General-to-Specific Ordering of Hypotheses

• Given hypotheses hj and hk, hj is more-general-than or- equal do hk if and only

if any instance that satisfies hk also satisfies hi

Definition: Let hj and hk be Boolean-valued functions defined over X. Then hj is

more general-than-or-equal-to hk (written hj ≥ hk) if and only if
• In the figure, the box on the left
represents the set X of all
instances, the box on the right the
set H of all hypotheses.

• Each hypothesis corresponds to

some subset of X-the subset of
instances that it classifies positive.

• The arrows connecting hypotheses

represent the more - general -than
relation, with the arrow pointing
toward the less general hypothesis.

• Note the subset of instances

characterized by h2 subsumes the
subset characterized by h l , hence
h2 is more - general– than h1
F IN D- S: Finding a Maximally Specific
Hypothesis
FIND-S Algorithm:
1. Initialize h to the most specific hypothesis in H
2. For each positive training instance x
For each attribute constraint ai in h
If the constraint ai is satisfied by x
Then do nothing
Else replace ai in h by the next more general constraint that is satisfied by x
3. Output hypothesis h
To illustrate this algorithm, assume the learner is given the sequence of training
examples from the EnjoySport task
Example Sky AirTemp Humidity Wind Water Forecast EnjoySport
1 Sunny Warm Normal Strong Warm Same Yes

2 Sunny Warm High Strong Warm Same Yes

3 Rainy Cold High Strong Warm Change No

4 Sunny Warm High Strong Cool Change Yes

The first step of FIND-S is to initialize h to the most specific hypothesis in H

h = <Ø, Ø, Ø, Ø, Ø, Ø>
x1 = <Sunny, Warm, Normal, Strong, Warm, Same>, +
Observing the first training example, it is clear that our hypothesis is too specific. In
particular, none of the "Ø" constraints in h are satisfied by this example, so each is
replaced by the next more general constraint that fits the example
h1 = <Sunny, Warm, Normal, Strong, Warm, Same>
This h is still very specific; it asserts that all instances are negative except for the single
positive training example

x2 = <Sunny, Warm, High, Strong, Warm, Same>, +

The second training example forces the algorithm to further generalize h, this time
substituting a "?' in place of any attribute value in h that is not satisfied by the new
example
h2 = <Sunny, Warm, ?, Strong, Warm, Same>
x3 = <Rainy, Cold, High, Strong, Warm, Change>, -
Upon encountering the third training the algorithm makes no change to h. The
FIND-S algorithm simply ignores every negative example.
h3 = < Sunny, Warm, ?, Strong, Warm, Same>

x4 = <Sunny, Warm, High, Strong, Cool, Change>, +

The fourth example leads to a further generalization of h
h4 = < Sunny, Warm, ?, Strong, ?, ? >
The key properties of the FIND-S algorithm:

• FIND-S is guaranteed to output the most specific hypothesis within H that is

consistent with the positive training examples

• FIND-S algorithm’s final hypothesis will also be consistent with the negative
examples provided the correct target concept is contained in H, and provided the
training examples are correct.
Unanswered by FI ND-S

1. Has the learner converged to the correct target concept?

2. Why prefer the most specific hypothesis?

3. Are the training examples consistent?

4. What if there are several maximally specific consistent hypotheses?

ML LAB Task-1 Task-2 Notes
No ratings yet
ML LAB Task-1 Task-2 Notes
12 pages
Concept Learning and Candidate Elimination Algorithm
No ratings yet
Concept Learning and Candidate Elimination Algorithm
35 pages
Concept Learning in Machine Learning
No ratings yet
Concept Learning in Machine Learning
16 pages
Concept Learning and Genrel To Specific Ordering - 2
No ratings yet
Concept Learning and Genrel To Specific Ordering - 2
46 pages
Concept Learning for Beginners
No ratings yet
Concept Learning for Beginners
59 pages
Module 2 AI N ML Notes
No ratings yet
Module 2 AI N ML Notes
16 pages
UNIT 1 Notes
No ratings yet
UNIT 1 Notes
16 pages
1.concept Learning
No ratings yet
1.concept Learning
50 pages
UNIT1
No ratings yet
UNIT1
82 pages
CSE543: Machine Learning: Lecture 2: August 6, 2014
No ratings yet
CSE543: Machine Learning: Lecture 2: August 6, 2014
27 pages
Concept Learning for ML Beginners
No ratings yet
Concept Learning for ML Beginners
64 pages
Unit2 4
No ratings yet
Unit2 4
7 pages
Chapter 2 Concept Learning
No ratings yet
Chapter 2 Concept Learning
36 pages
Concept Learning
No ratings yet
Concept Learning
11 pages
Concept Learning in Machine Learning
No ratings yet
Concept Learning in Machine Learning
71 pages
Machine Learning Intro for CSE Students
No ratings yet
Machine Learning Intro for CSE Students
16 pages
2 CL
No ratings yet
2 CL
6 pages
Hypothesis Space & Inductive Bias-1
No ratings yet
Hypothesis Space & Inductive Bias-1
47 pages
Lecture 5.2
No ratings yet
Lecture 5.2
8 pages
Concept Learning in Machine Learning
No ratings yet
Concept Learning in Machine Learning
17 pages
Module 1 - Concept Learning
No ratings yet
Module 1 - Concept Learning
50 pages
ED317 Statistical Machine Learning
No ratings yet
ED317 Statistical Machine Learning
174 pages
Machine Learning Notes Unit 1
No ratings yet
Machine Learning Notes Unit 1
25 pages
Concept Learning for AI Students
No ratings yet
Concept Learning for AI Students
13 pages
Lecture3 Concept Learning
No ratings yet
Lecture3 Concept Learning
42 pages
UNIT-2 Notes Part 2
No ratings yet
UNIT-2 Notes Part 2
24 pages
UNIT 1 Notes
No ratings yet
UNIT 1 Notes
16 pages
Combined ML
100% (1)
Combined ML
705 pages
Lecture4 s12 Concept Learnin
No ratings yet
Lecture4 s12 Concept Learnin
16 pages
Lecture 22 25
No ratings yet
Lecture 22 25
71 pages
Concept Learning for Sports Fans
No ratings yet
Concept Learning for Sports Fans
42 pages
AIML Lab: Machine Learning Techniques
No ratings yet
AIML Lab: Machine Learning Techniques
24 pages
Concept Learning
No ratings yet
Concept Learning
59 pages
Lecture 2
No ratings yet
Lecture 2
31 pages
Concept Learning in Machine Learning
No ratings yet
Concept Learning in Machine Learning
40 pages
2 Concept-Learning
No ratings yet
2 Concept-Learning
42 pages
Basics of Learning
No ratings yet
Basics of Learning
53 pages
ML Lecture 2 Version Spaces
No ratings yet
ML Lecture 2 Version Spaces
32 pages
ML 02 Concept
No ratings yet
ML 02 Concept
7 pages
Unit 1
No ratings yet
Unit 1
43 pages
1 Concept-Learning
No ratings yet
1 Concept-Learning
25 pages
Concept Learning in Machine Learning
No ratings yet
Concept Learning in Machine Learning
38 pages
Concept
No ratings yet
Concept
43 pages
Hypothesis Space and Inductive Bias, Training, Test Data and Cross Validation
No ratings yet
Hypothesis Space and Inductive Bias, Training, Test Data and Cross Validation
53 pages
5 - AIML - Module3 - PPT
No ratings yet
5 - AIML - Module3 - PPT
37 pages
ML 1
No ratings yet
ML 1
61 pages
3 ML Ch2 Concept Learning Short
No ratings yet
3 ML Ch2 Concept Learning Short
16 pages
Concept Learning
No ratings yet
Concept Learning
13 pages
AI Lecture 34
No ratings yet
AI Lecture 34
54 pages
Ex - No.2 - Find S Algorithm
No ratings yet
Ex - No.2 - Find S Algorithm
3 pages
03-Computational Cognitive Science
No ratings yet
03-Computational Cognitive Science
42 pages
Concept Learning
No ratings yet
Concept Learning
18 pages
Chapter 11
No ratings yet
Chapter 11
55 pages
Find S Algorithm
No ratings yet
Find S Algorithm
6 pages
Concept Learning and The General-To-Specific Ordering2
No ratings yet
Concept Learning and The General-To-Specific Ordering2
19 pages
ML - PPT - mOD1 - Concept Learning
No ratings yet
ML - PPT - mOD1 - Concept Learning
54 pages
VTU Machine Learning Lab Manual
No ratings yet
VTU Machine Learning Lab Manual
4 pages
Understanding Research Methods and Knowledge
No ratings yet
Understanding Research Methods and Knowledge
11 pages
Performance Task 14.0 - Nery, Thea
No ratings yet
Performance Task 14.0 - Nery, Thea
2 pages
Hague-Harrop-comparative-government-and-politics - An-Introduction-2001 - Ch. 5
No ratings yet
Hague-Harrop-comparative-government-and-politics - An-Introduction-2001 - Ch. 5
18 pages
BAD702 SMLDS Lab Manual (Student Copy)
No ratings yet
BAD702 SMLDS Lab Manual (Student Copy)
47 pages
Experimental and Action Research Fareedullah
No ratings yet
Experimental and Action Research Fareedullah
11 pages
Greenberg Pictorial Semantics
No ratings yet
Greenberg Pictorial Semantics
49 pages
Understanding Psychology: Key Concepts
No ratings yet
Understanding Psychology: Key Concepts
11 pages
PSYC 250 Exam Review Guide
No ratings yet
PSYC 250 Exam Review Guide
13 pages
CSBS - AD3491 - FDSA - IA 2 - Answer Key
67% (3)
CSBS - AD3491 - FDSA - IA 2 - Answer Key
14 pages
Critical Thinking by DR - Kandeel
No ratings yet
Critical Thinking by DR - Kandeel
13 pages
Thesis-Proposal HKU
No ratings yet
Thesis-Proposal HKU
26 pages
Fazey and Hardy 1988
No ratings yet
Fazey and Hardy 1988
3 pages
JSS Hughes Davis Imenda2019
No ratings yet
JSS Hughes Davis Imenda2019
13 pages
Research Assumptions and Hypothesis
100% (4)
Research Assumptions and Hypothesis
6 pages
Natural Selection Portfolio
No ratings yet
Natural Selection Portfolio
4 pages
Chernoz Hansen 2006 JoE
No ratings yet
Chernoz Hansen 2006 JoE
35 pages
Research in Education and Psychology 1st Edition by Pathak 8131758435 9788131758434 - The Ebook in PDF Format Is Ready For Immediate Access
100% (6)
Research in Education and Psychology 1st Edition by Pathak 8131758435 9788131758434 - The Ebook in PDF Format Is Ready For Immediate Access
82 pages
Accident Research Propsal
100% (15)
Accident Research Propsal
4 pages
Hypothesis Testing in Psychological Research
No ratings yet
Hypothesis Testing in Psychological Research
5 pages
McKenney, Susan - Reeves, Thomas C. - Conducting Educational Design Research-Routledge (2012)
100% (1)
McKenney, Susan - Reeves, Thomas C. - Conducting Educational Design Research-Routledge (2012)
257 pages
University of Padua Admission Exam Preparation - TOLC-PSI MOCK EXAM
100% (1)
University of Padua Admission Exam Preparation - TOLC-PSI MOCK EXAM
11 pages
Kenneth Baugh - The Methodology of Herbert Blumer
No ratings yet
Kenneth Baugh - The Methodology of Herbert Blumer
118 pages
Hypothesis Testing Basics
No ratings yet
Hypothesis Testing Basics
30 pages
The Real Startup Book
No ratings yet
The Real Startup Book
209 pages
IT Governance and IT Compliance
No ratings yet
IT Governance and IT Compliance
9 pages
Theoretical Framework and Hypotheses Development
No ratings yet
Theoretical Framework and Hypotheses Development
28 pages
Criminological Research and Statistics - Notes For Criminology Students
No ratings yet
Criminological Research and Statistics - Notes For Criminology Students
12 pages
Sleep Deprivation Impact on STEM Students
No ratings yet
Sleep Deprivation Impact on STEM Students
14 pages
Understanding Null Hypothesis Testing
No ratings yet
Understanding Null Hypothesis Testing
4 pages
Answering Questions Technique Physics SPM Paper 3
100% (13)
Answering Questions Technique Physics SPM Paper 3
10 pages

Module-2 FIND-S

Uploaded by

Module-2 FIND-S

Uploaded by

MODULE -2

Example: People continually learn general concepts or categories such as "bird,"

• Alternatively, each concept can be thought of as a Boolean-valued function

• D to denote the set of available training examples

"Days on which my friend Vasu enjoys his favorite water sport."

1 Sunny Warm Normal Strong Warm Same Yes

2 Sunny Warm High Strong Warm Same Yes

3 Rainy Cold High Strong Warm Change No

4 Sunny Warm High Strong Cool Change Yes

The task is to learn to predict the value of

Let’s consider a simple representation in which each hypothesis consists of a

For each attribute, the hypothesis will either

The most general hypothesis-that every day is a positive example-is represented by

The most specific possible hypothesis-that no day is a positive example-is

Any hypothesis found to approximate the target

• Given hypotheses hj and hk, hj is more-general-than or- equal do hk if and only

Definition: Let hj and hk be Boolean-valued functions defined over X. Then hj is

• Each hypothesis corresponds to

• The arrows connecting hypotheses

• Note the subset of instances

2 Sunny Warm High Strong Warm Same Yes

3 Rainy Cold High Strong Warm Change No

4 Sunny Warm High Strong Cool Change Yes

The first step of FIND-S is to initialize h to the most specific hypothesis in H

x2 = <Sunny, Warm, High, Strong, Warm, Same>, +

x4 = <Sunny, Warm, High, Strong, Cool, Change>, +

• FIND-S is guaranteed to output the most specific hypothesis within H that is

1. Has the learner converged to the correct target concept?

2. Why prefer the most specific hypothesis?

3. Are the training examples consistent?

4. What if there are several maximally specific consistent hypotheses?

You might also like