0% found this document useful (0 votes)

45 views53 pages

Unit 4-NLP

The document discusses Parts of Speech (PoS) tagging, a key task in Natural Language Processing (NLP) that assigns grammatical categories to words. It outlines the workflow for PoS tagging, including tokenization, language model loading, text preprocessing, linguistic analysis, tagging, and result evaluation, while also highlighting the challenges and various approaches such as rule-based and stochastic tagging. Additionally, it introduces Hidden Markov Models (HMM) as a probabilistic method for determining the most likely sequence of PoS tags based on observed words.

Uploaded by

Viha Shukla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views53 pages

Unit 4-NLP

Uploaded by

Viha Shukla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Unit 4

Syntax Level Analysis

POS Tagging
• Parts of Speech (PoS) tagging is a core task in NLP.
• It gives each word a grammatical category such as nouns, verbs,
adjectives and adverbs.
• Through better understanding of phrase structure and semantics, this
technique makes it possible for machines to study human language
more accurately.
Example
Implementation of Parts-of-Speech tagging using
NLTK
Workflow of POS Tagging in NLP

• Tokenization: The input text is divided into individual tokens, representing words or subwords.
Tokenization is the foundational step in most NLP tasks which enables further analysis at the word
level.
• Loading a Language Model: Tools like NLTK or SpaCy requires a pre-trained language model to
perform POS tagging. These models are trained on large datasets and provide insights into the
grammatical rules and structure of the language.
• Text Preprocessing: The text is then cleaned to improve accuracy. Common preprocessing steps
include converting text to lowercase, removing special characters, and eliminating irrelevant
content.
• Linguistic Analysis: This stage involves parsing the sentence to understand the grammatical role
of each token. It lays the groundwork for assigning the appropriate part of speech by interpreting
the sentence’s syntactic structure.
• POS Tagging: Each token is then assigned a specific part-of-speech label. This is based on its role
in the sentence and contextual clues provided by surrounding words.
• Result Evaluation: Finally, the POS-tagged output is reviewed to ensure accuracy. Any
misclassifications or anomalies are identified and corrected as needed.
Why Difficult ?
• Although it seems easy, Identifying the part of speech tags is much
more complicated than simply mapping words to their part of speech
tags.

If it is difficult, then what approaches do we have?

Word Classes

• In grammar, a part of speech or part-of-speech (POS) is known as

word class or grammatical category, which is a category of words that
have similar grammatical properties.
• The English language has four major word classes: Nouns, Verbs,
Adjectives, and Adverbs.
• Commonly listed English parts of speech are nouns, verbs, adjectives,
adverbs, pronouns, prepositions, conjunctions, interjections,
numerals, articles, and determiners.
• These can be further categorized into open and closed classes.
Closed Class

• Closed classes are those with a relatively fixed/number of words, and

we rarely add new words to these POS, such as prepositions. Closed
class words are generally functional words like of, it, and, or
you, which tend to be very short, occur frequently, and often have
structuring uses in grammar.
• Example of closed class-
• Determiners: a, an, the Pronouns: she, he, I, others Prepositions: on,
under, over, near, by, at, from, to, with
Open Class

• Open Classes are mostly content-bearing, i.e., they refer to objects,

actions, and features; it's called open classes since new words are
added all the time.
• By contrast, nouns and verbs, adjectives, and adverbs belong to open
classes; new nouns and verbs like iPhone or to fax are continually
being created or borrowed.
• Example of open class-
• Nouns: computer, board, peace, school Verbs: say, walk, run,
belong Adjectives: clean, quick, rapid, enormous Adverbs: quickly,
softly, enormously, cheerfully
Tag set
The problem is that many words belong to more than one word class.

And to do POS tagging, a standard set needs to be chosen.

We could pick very simple/coarse tag sets such as Noun (NN), Verb
(VB), Adjective (JJ), Adverb (RB), etc.

But to make tags more dis-ambiguous, the commonly used set is finer-grained,
University of Pennsylvania’s “UPenn TreeBank tagset”, having a total of 45 tags.
Parts of Speech Tagging
• Tagging is a disambiguation task; words are ambiguous i.e. have
more than one a possible part of speech, and the goal is to find the
correct tag for the situation.
• For example, a book can be a verb (book that flight) or a noun (hand
me that book).
• The goal of POS tagging is to resolve these ambiguities, choosing the
proper tag for the context.
Looking into the Operational Modalities Adopted in Some of the POS Tagging Tools in Identification of Contextual Part -of-Speech of Words in Texts
[Link]
Speech_of_Words_in_Texts
Rule-Based Tagging

• Rule-based tagging is the oldest tagging approach where we use contextual information to assign
tags to unknown or ambiguous words.
• The rule-based approach uses a dictionary to get possible tags for tagging each word. If the word
has more than one possible tag, then rule-based taggers use hand-written rules to identify the
correct tag.
• Since rules are usually built manually, therefore they are also called Knowledge-driven taggers.
We have a limited number of rules, approximately around 1000 for the English language.
• One of example of a rule is as follows:
• Sample Rule: If an ambiguous word “X” is preceded by a determiner and followed by a noun, tag
it as an adjective;
• A nice car: nice is an ADJECTIVE here.
• Limitations/Disadvantages of Rule-Based Approach:
• High development cost and high time complexity when applying to a large corpus of text
• Defining a set of rules manually is an extremely cumbersome process and is not scalable at all
Stochastic POS Tagging

• Stochastic POS Tagger uses probabilistic and statistical information

from the corpus of labeled text (where we know the actual tags of
words in the corpus) to assign a POS tag to each word in a sentence.
• This tagger can use techniques like Word frequency
measurements and Tag Sequence Probabilities. It can either use one
of these approaches or a combination of both.
Word Frequency Measurements

• The tag encountered most frequently in the corpus is the one assigned to
the ambiguous words(words having 2 or more possible POS tags).
• Let’s understand this approach using some example sentences :
• Ambiguous Word = “play”
• Sentence 1 : I play cricket every day. POS tag of play = VERB
• Sentence 2 : I want to perform a play. POS tag of play = NOUN
• The word frequency method will now check the most frequently used POS
tag for “play”. Let’s say this frequent POS tag happens to be VERB; then we
assign the POS tag of "play” = VERB
• The main drawback of this approach is that it can yield invalid sequences of
tags.
Tag Sequence Probabilities

• In this method, the best tag for a given word is determined by the probability that it occurs with “n”
previous tags.
• Simply put, assume we have a new sequence of 4 words, w1 , w2 , w3 , w4, and we need to identify the POS
tag of w4
• If n = 3, we will consider the POS tags of 3 words prior to w4 in the labeled corpus of text
• Let’s say the POS tags for
• w1 = NOUN, w2 = VERB , w3 = DETERMINER
• In short, N, V, D: NVD
• Then in the labeled corpus of text, we will search for this NVD sequence.
• Let’s say we found 100 such NVD sequences. Out of these -
• 10 sequences have the POS of the next word is NOUN 90 sequences have the POS of the next word is VERB
• Then the POS of the word w4 = VERB
• The main drawback of this technique is that sometimes the predicted sequence is not Grammatically
correct.
Transformation-Based Learning Tagger: TBL

• Transformation-based tagging is the combination of Rule-based &

stochastic tagging methodologies.
• Transformation based tagging is also called Brill tagging.
Probabilistic Approach
•Idea: Pick the most likely tag for the word
•Approach
Generate data from the class
Determine class for data
•Training Data: Available in the form (data, class)
•Two Types of Models
Generative
Discriminative
Generative vs. Discriminative Learning – A
Story
Zed and Zack are twin brothers.
They’re so alike that you can’t tell who’s who by looking at them.
The twins are child prodigies and jointly hold the topper’s position in their class.
Zed’s approach (Generative style):
Zed can learn everything about a given topic.
He goes in-depth and understands every little detail about a subject.
Once he’s grasped it, he never forgets it.
But this is cumbersome, especially if there’s a lot to learn under said topic.
What’s more, he has to prepare for his exams much sooner than his brother.
Zack’s approach (Discriminative style):
On the other hand, Zack studies by creating a mind map.
He gets the general idea of a topic and then learns the differences and patterns
between the subtopics.
This gives him a lot more flexibility in his thinking process.
You could say he learns by learning the differences.
Conclusion:
As we can see, the brothers have very different learning approaches but both seem to work,
as evident by the topper’s position they’ve held for so long.
Generative and Discriminative Machine
Learning Approaches – A Small Story

• Translating the analogy to our discussion:

Generative models work like Zed
Discriminative models work like Zack
[Link]
discriminative-models-for-deep-learning
Discriminative model

• The majority of discriminative models, aka conditional models, are

used for supervised machine learning.
• They do what they ‘literally’ say, separating the data points into
different classes and learning the boundaries using probability
estimates and maximum likelihood
Generative model

• As the name suggests, generative models can be used to generate

new data points.
• These models are usually used in unsupervised machine learning
problems.
Hidden Markov Model POS Tagging: HMM

• HMM is a probabilistic sequence model, i.e., for POS tagging a given

sequence of words, it computes a probability distribution over
possible sequences of POS labels and chooses the best label
sequence.
• This makes HMM model a good and reliable probabilistic approach to
finding POS tags for the sequence of words.
Markov Model (or Markov Chain)
• Assume we have three types of weather conditions: sunny, rainy, and
foggy.
• The problem at hand is to predict the next day’s weather using the
previous day's weather.
• Let qn = variable denoting the weather on the nthday
• We want to find the probability of qn given weather conditions of
previous {n-1} days. This can be mathematically written as :
• P(qn∣qn−1,qn−2,.............,q1)=?
• According to first-order Markov Assumption -
• The weather condition on the nth day is only dependent on the weather of (n-
1)th day.
• i.e. tomorrow’s weather is only dependent on today's weather conditions only.
Hidden Markov Model

• A Markov chain is useful when we need to compute a probability for a

sequence of observable events.
• In many cases, the events we are interested in are hidden, i.e., we
don’t observe them directly.
• For example, we don’t normally observe part-of-speech tags in a text.
Rather, we see words and must infer the tags from the word
sequence. We call the tags hidden because they are not observed.
• A hidden Markov model (HMM) allows us to talk about both
observed events (like words that we see in the input) and hidden
events (like part-of-speech tags).
Hidden Markov Model (HMM)
• Markov Model: Future depends only on the present, not on the past.
• Hidden Markov Model:
• States are hidden (not directly visible).
• We only see the observations.
• Goal: Predict hidden states using visible observations.
• Hidden Markov Model (HMM) =
Hidden states (not visible)
Observations (visible outcomes)
Probabilities (transition + emission)
Simple Analogy

•Example: Student’s Mood

Hidden State: Happy / Sad (not directly visible)
Observation: Smile, Cry, Study more, Study less

We guess the mood based on what we can observe.

Components of HMM

1. States (Q): Hidden variables (e.g., Noun, Verb / Happy, Sad)

2. Observations (O): What we see (e.g., Words / Smile, Cry)
3. Transition Probability (A): P(qᵢ → qⱼ) or P(tag|next tag)
4. Emission Probability (B): P(observation | state) or (Pword|tag)
5. Initial Probability (π): Probability of starting in a state

Example Analogy – Student’s Mood

•Hidden state = Student’s Mood (Happy, Sad)
•Observation = Actions (Smile, Cry, Study more, Study less)
•Transition = Probability of mood changing (Happy→Sad, Sad→Happy)
•Emission = Probability of action given mood (Happy→Smile, Sad→Cry)
Example in NLP (POS Tagging)

•Sentence: “John can see Will”

•Hidden States (POS Tags): Noun, Modal, Verb
•Observations: John, can, see, Will
•HMM helps to find the most likely sequence of POS tags for the sentence.
Hidden Markov Model

• A Hidden Markov Model (HMM) is a probabilistic graphical model

used for modeling systems that exhibit sequential or temporal
behavior, where understanding the underlying states and transitions is
essential.
•Hidden part = Temperature (because we don’t
directly observe the actual temperature).
•Observed part = Weather condition (Sun / Rain /
Snow), which we can see.
•Transitions = The tendency of temperature to
change (cold to moderate, moderate to hot, etc.).
•Emissions = The kind of weather we are likely
to see given a temperature state.
POS tagging with Hidden Markov Model

• Let us consider an example proposed by [Link] Serrano and find out

how HMM selects an appropriate tag sequence for a sentence.

[Link]
What are we trying to do?
• We want to assign the correct POS tags (Noun, Modal, Verb) to words
in sentences like “Ted will spot Will”.
• The model uses:
• Transition probability → likelihood that one tag follows another.
• Emission probability → likelihood that a word belongs to a tag.
Emission Probabilities (Word → Tag)
• Training sentences:
• Mary Jane can see Will
• Spot will see Mary
• Will Jane spot Mary?
• Mary will pat Spot
• We count how many times each word appears as Noun (N), Modal (M),
Verb (V).
• Example:
• "Mary" occurs 4 times as a Noun → Emission P(Mary|Noun) = 4/9
• "Will" occurs 1 time as Noun, 3 times as Model →
• P(Will|Noun) = 1/9
• P(Will|Modal) = 3/4
• The following table gives the likelihood of each word belonging to a tag.
Emission probabilities.
Transition Probabilities (Tag → Next Tag)
We add <S> (start) and <E> (end) to sentences.
Then count tag-to-tag transitions.
Example:
•<S> followed by Noun 3 times → P(N|<S>) = 3/4

•Model followed by Verb 3 times → P(V|M) = 3/4

Total number of co-occurrences of the tag in
consideration,
Evaluating a Tagged Sequence
• Test Sentence = Take a new sentence:
“Will can spot Mary”
• Suppose (wrong tagging):
• Will → Model (M)
• Can → Verb (V)
• Spot → Noun (N)
• Mary → Noun (N)
• Step 1: Transition probabilities <S> → M → V → N → N → <E>
• Multiply row probabilities from the transition table.
• Step 2: Emission probabilities
• P(Will|M) = ¾
• P(Can|V) = 0 (because "Can" never appeared as a verb in training → zero)
• So whole sequence probability = 0.
Correct Tagging
• Correct sequence is:<S> → N (Will) → M (can) → V (spot) → N (Mary)
→ <E>
• Transition product = (3/4 * 1/9 * 3/9 * 1/4 … )
• Emission product = (P(Will|N) * P(can|M) * P(spot|V) * P(Mary|N))
• Final probability (after multiplying all) = 0.00025720164 (non-zero).
The next step is to delete all the vertices and edges with probability zero, also the vertices which do not lead
to the endpoint are removed. Also, we will mention-
Now there are only two paths that lead to the end, let us calculate the
probability associated with each path.

• <S>→N→M→N→N→<E> =3/4*1/9*3/9*1/4*1/4*2/9*1/9*4/9*4/9=0.00000846754
• <S>→N→M→N→V→<E>=3/4*1/9*3/9*1/4*3/4*1/4*1*4/9*4/9=0.00025720164

• Clearly, the probability of the second sequence is much higher and

hence the HMM is going to tag each word in the sentence according
to this sequence.
Named Entity Recognition

• Named Entity Recognition (NER) in NLP focuses on identifying and

categorizing important information known as entities in text.
• These entities can be names of people, places, organizations, dates,
etc.
• It helps in transforming unstructured text into structured information
which helps in tasks like text summarization, knowledge graph
creation and question answering.
Working of Named Entity Recognition (NER)

• Analyzing the Text: It processes entire text to locate words or phrases that could represent
entities.
• Finding Sentence Boundaries: It identifies starting and ending of sentences using punctuation
and capitalization which helps in maintaining meaning and context of entities.
• Tokenizing and Part-of-Speech Tagging: Text is broken into tokens (words) and each token is
tagged with its grammatical role which provides important clues for identifying entities.
• Entity Detection and Classification: Tokens or groups of tokens that match patterns of known
entities are recognized and classified into predefined categories like Person, Organization,
Location etc.
• Model Training and Refinement: Machine learning models are trained using labeled datasets and
they improve over time by learning patterns and relationships between words.
• Adapting to New Contexts: A well-trained model can generalize to different languages, styles and
unseen types of entities by learning from context.
Semantic analysis
• Semantic Analysis → Meaning extraction from text
• Vector Space Model → Representing words & documents
mathematically
• Applications → Search engines, Chatbots, Machine Translation,
Question Answering
Semantic Relations among Lexemes
• Homonymy – Same form, different meaning (e.g., bank = river bank /
money bank)
• Polysemy – One word with related senses (e.g., mouth = of a river / of
a person)
• Synonymy – Same meaning, different words (big/large)
• Hyponymy – Hierarchical relation (rose is a type of flower)
WordNet and Hierarchy
• Definition: Identifying the correct sense of a word in context
• Example: “He deposited money in the bank” vs “The boat is near the
bank”
• Approaches to WSD:
• Knowledge-based (WordNet, dictionaries)
• Supervised ML (training data with senses)
• Unsupervised (clustering by context words)
Vector Space Models (VSM)
• Vector space models are to consider the relationship between data
that are represented by vectors.
• It is popular in information retrieval systems but also useful for other
purposes. Generally, this allows us to compare the similarity of two
vectors from a geometric perspective.
References
• [Link]
markov-models-5d1b548ece00

POS Tagging: Techniques and Challenges
No ratings yet
POS Tagging: Techniques and Challenges
75 pages
Lecture#11 (POS Tagging)
No ratings yet
Lecture#11 (POS Tagging)
19 pages
4 Pos
No ratings yet
4 Pos
62 pages
Part-of-Speech (POS) Tagging
No ratings yet
Part-of-Speech (POS) Tagging
94 pages
Unit 3 - Syntax Level Analysis
No ratings yet
Unit 3 - Syntax Level Analysis
91 pages
L11-POS - Tagging - II
No ratings yet
L11-POS - Tagging - II
43 pages
POS Tagging and Word Classes Explained
No ratings yet
POS Tagging and Word Classes Explained
40 pages
NLP Unit III Notes
No ratings yet
NLP Unit III Notes
30 pages
POS Tagging for NLP Enthusiasts
No ratings yet
POS Tagging for NLP Enthusiasts
47 pages
2025-NLP-Lecture 05 - Sequence Labeling For Parts of Speech and Name Entities
No ratings yet
2025-NLP-Lecture 05 - Sequence Labeling For Parts of Speech and Name Entities
69 pages
Part-of-Speech Tagging Techniques
No ratings yet
Part-of-Speech Tagging Techniques
83 pages
Lecture Part of Speech Tagging
No ratings yet
Lecture Part of Speech Tagging
41 pages
3.1 Chap NLP Pos - Tagging - Lecture3
No ratings yet
3.1 Chap NLP Pos - Tagging - Lecture3
38 pages
NLPChapter 3
No ratings yet
NLPChapter 3
14 pages
Apznzaaczprqee1da4bjade7ul0meb Ap8tjou Feozcgqct6cpnh0z32ibu3faj 0wgfmnhp5p Eneunhaucakhow Bie9yhlaoqtsknu7yq0gfnxrzjd2mjuyrbnhadveb2wj7gjgcxpffbjgyxl4nzdqf5qeux-Lla2ggr5kg9w4bp8ev5hqrj7bwr3npwnp9gfmazwtau
No ratings yet
Apznzaaczprqee1da4bjade7ul0meb Ap8tjou Feozcgqct6cpnh0z32ibu3faj 0wgfmnhp5p Eneunhaucakhow Bie9yhlaoqtsknu7yq0gfnxrzjd2mjuyrbnhadveb2wj7gjgcxpffbjgyxl4nzdqf5qeux-Lla2ggr5kg9w4bp8ev5hqrj7bwr3npwnp9gfmazwtau
108 pages
Understanding POS Tagging in NLP
No ratings yet
Understanding POS Tagging in NLP
16 pages
Lec3-Posner Intro
No ratings yet
Lec3-Posner Intro
30 pages
NLP Notes Unit2 & Unit3
No ratings yet
NLP Notes Unit2 & Unit3
22 pages
Understanding Part-Of-Speech Tagging
No ratings yet
Understanding Part-Of-Speech Tagging
53 pages
Unit-3.Word Level Analysis AIML
No ratings yet
Unit-3.Word Level Analysis AIML
5 pages
What Is POS Tagging in NLP
No ratings yet
What Is POS Tagging in NLP
8 pages
Module 3 NLP
No ratings yet
Module 3 NLP
97 pages
Ai TXT Unit4
No ratings yet
Ai TXT Unit4
39 pages
NLP Chapter 3
No ratings yet
NLP Chapter 3
36 pages
Unit 2 Pos Tagger
No ratings yet
Unit 2 Pos Tagger
9 pages
Lecture 20-23 Part of Speech Tagging
No ratings yet
Lecture 20-23 Part of Speech Tagging
36 pages
Understanding POS Tagging Basics
No ratings yet
Understanding POS Tagging Basics
35 pages
Lecture 16-17-18-19
No ratings yet
Lecture 16-17-18-19
42 pages
Pos Tagging Pushpak
No ratings yet
Pos Tagging Pushpak
88 pages
Understanding Parts of Speech Tagging
No ratings yet
Understanding Parts of Speech Tagging
31 pages
Understanding POS Tagging in NLP
No ratings yet
Understanding POS Tagging in NLP
33 pages
Parts of Speech Tagging
No ratings yet
Parts of Speech Tagging
62 pages
Lecture6 2022
No ratings yet
Lecture6 2022
101 pages
3 Natural Language Processing-PoS Tagging
No ratings yet
3 Natural Language Processing-PoS Tagging
14 pages
Techniques for POS Tagging Explained
No ratings yet
Techniques for POS Tagging Explained
12 pages
Pos Tagging and Chunking
No ratings yet
Pos Tagging and Chunking
29 pages
Unit 2
No ratings yet
Unit 2
12 pages
Lecture 5
No ratings yet
Lecture 5
56 pages
Parts of Speech
No ratings yet
Parts of Speech
26 pages
3 POS Viet
No ratings yet
3 POS Viet
81 pages
Part of Speech Tagging and Hidden Markov Models
No ratings yet
Part of Speech Tagging and Hidden Markov Models
24 pages
Part-of-Speech Tagging in NLP
No ratings yet
Part-of-Speech Tagging in NLP
72 pages
POS Tagging Approaches Explained
No ratings yet
POS Tagging Approaches Explained
5 pages
Part of Speech Tagging Explained
No ratings yet
Part of Speech Tagging Explained
62 pages
PoS Tagging
No ratings yet
PoS Tagging
41 pages
Module2 Lecture3 POS
No ratings yet
Module2 Lecture3 POS
40 pages
Part-of-Speech Tagging Overview
No ratings yet
Part-of-Speech Tagging Overview
86 pages
POS Tagging Techniques Explained
No ratings yet
POS Tagging Techniques Explained
10 pages
POS Tagging Algorithms Overview
No ratings yet
POS Tagging Algorithms Overview
85 pages
NLP Language Models and POS Tagging
No ratings yet
NLP Language Models and POS Tagging
119 pages
Lecture Notes On Syntactic Processing
No ratings yet
Lecture Notes On Syntactic Processing
14 pages
NLP Session 6
No ratings yet
NLP Session 6
5 pages
Multi-Tagging in Dependency Parsing
No ratings yet
Multi-Tagging in Dependency Parsing
10 pages
Part-Of-Speech Tagging Overview
No ratings yet
Part-Of-Speech Tagging Overview
84 pages
Understanding Word Classes in NLP
No ratings yet
Understanding Word Classes in NLP
141 pages
Speech Recognition Systems Guide
No ratings yet
Speech Recognition Systems Guide
13 pages
Print Lect6 Pos
No ratings yet
Print Lect6 Pos
11 pages
POS Tagging and HMM in NLP
No ratings yet
POS Tagging and HMM in NLP
84 pages
NLP Trigrams and Bi Grams
No ratings yet
NLP Trigrams and Bi Grams
117 pages
Part of Speech Tagging
No ratings yet
Part of Speech Tagging
13 pages
Manya Jain 06316659424 Group1
No ratings yet
Manya Jain 06316659424 Group1
46 pages
English Tutoring Experience Narrative
No ratings yet
English Tutoring Experience Narrative
2 pages
Gotive H42 - Communicator - Brochure - GO.032.01.01.EN.s
No ratings yet
Gotive H42 - Communicator - Brochure - GO.032.01.01.EN.s
4 pages
Japanese Terminology
No ratings yet
Japanese Terminology
11 pages
Markov Chains On Metric Spaces A Short Course 1st Edition Michel Benaïm Tobias Hurth Instant Access 2025
No ratings yet
Markov Chains On Metric Spaces A Short Course 1st Edition Michel Benaïm Tobias Hurth Instant Access 2025
155 pages
Analysis of Hemingway's "A Soldier's Home"
0% (1)
Analysis of Hemingway's "A Soldier's Home"
4 pages
SQL Query: Author Number and Last Name For Every Author:: M SM D CS 472 - Henry Books Case Chapter Three
No ratings yet
SQL Query: Author Number and Last Name For Every Author:: M SM D CS 472 - Henry Books Case Chapter Three
13 pages
Numerical Methods Exam Paper 2017
No ratings yet
Numerical Methods Exam Paper 2017
7 pages
Đề Ba Đình Tháng 2,3-2025
No ratings yet
Đề Ba Đình Tháng 2,3-2025
8 pages
Teacher's File: Listening & Reading Tests
100% (1)
Teacher's File: Listening & Reading Tests
8 pages
CH-DT Grammatik
No ratings yet
CH-DT Grammatik
40 pages
Matrix Lec 1
No ratings yet
Matrix Lec 1
13 pages
AB Sir's Coaching Odd Semester Schedule
No ratings yet
AB Sir's Coaching Odd Semester Schedule
1 page
Digital Fluency Notes (Operating System)
No ratings yet
Digital Fluency Notes (Operating System)
13 pages
To The Evening Star
No ratings yet
To The Evening Star
15 pages
Pentecostal Influence on Christian Zionism
No ratings yet
Pentecostal Influence on Christian Zionism
37 pages
Basic Functions Test
No ratings yet
Basic Functions Test
27 pages
Annie Cushing's Audit Checklist
No ratings yet
Annie Cushing's Audit Checklist
110 pages
AMIS Training Manual for External Users
No ratings yet
AMIS Training Manual for External Users
70 pages
Ge 9 Chapter 5
No ratings yet
Ge 9 Chapter 5
2 pages
C++ Program for Midpoint and Collinearity
No ratings yet
C++ Program for Midpoint and Collinearity
3 pages
Cgela
No ratings yet
Cgela
71 pages
TTD Magazine December English
No ratings yet
TTD Magazine December English
56 pages
Literature, Essay Writing About IGCSE Poems
100% (1)
Literature, Essay Writing About IGCSE Poems
2 pages
Shortlesson Plan Template For Measuring Units With Ruler
100% (1)
Shortlesson Plan Template For Measuring Units With Ruler
6 pages
IspectorJUz r4
No ratings yet
IspectorJUz r4
4 pages
Israel's Historical Journey
No ratings yet
Israel's Historical Journey
3 pages
Hce Quiz - Hosea
No ratings yet
Hce Quiz - Hosea
3 pages
Hulls of Generalized Reed-Solomon Codes
No ratings yet
Hulls of Generalized Reed-Solomon Codes
26 pages
JW Bible Blacks
No ratings yet
JW Bible Blacks
1 page

Unit 4-NLP

Uploaded by

Unit 4-NLP

Uploaded by

Unit 4

Syntax Level Analysis

If it is difficult, then what approaches do we have?

• In grammar, a part of speech or part-of-speech (POS) is known as

• Closed classes are those with a relatively fixed/number of words, and

• Open Classes are mostly content-bearing, i.e., they refer to objects,

And to do POS tagging, a standard set needs to be chosen.

• Stochastic POS Tagger uses probabilistic and statistical information

• Transformation-based tagging is the combination of Rule-based &

• Translating the analogy to our discussion:

• The majority of discriminative models, aka conditional models, are

• As the name suggests, generative models can be used to generate

• HMM is a probabilistic sequence model, i.e., for POS tagging a given

• A Markov chain is useful when we need to compute a probability for a

• A Markov chain is useful when we need to compute a probability for a

•Example: Student’s Mood

We guess the mood based on what we can observe.

1. States (Q): Hidden variables (e.g., Noun, Verb / Happy, Sad)

Example Analogy – Student’s Mood

•Sentence: “John can see Will”

• A Hidden Markov Model (HMM) is a probabilistic graphical model

• Let us consider an example proposed by [Link] Serrano and find out

•Model followed by Verb 3 times → P(V|M) = 3/4

• Clearly, the probability of the second sequence is much higher and

• Named Entity Recognition (NER) in NLP focuses on identifying and

You might also like