0% found this document useful (0 votes)

25 views13 pages

NLP Unit2

The document discusses various statistical methods for ambiguity resolution in natural language processing, including N-gram models, POS tagging with Hidden Markov Models, and Bayesian methods for word sense disambiguation. It explains how these techniques utilize probability distributions and contextual features to determine the most likely interpretations of ambiguous words and phrases. Additionally, it highlights the importance of models like Maximum Entropy and Conditional Random Fields in effectively resolving ambiguities based on contextual clues.

Uploaded by

moses Bendukuri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views13 pages

NLP Unit2

Uploaded by

moses Bendukuri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Statistical Methods for Ambiguity Resolution

Statistical methods rely on probability distributions, corpora, and machine learning models to
disambiguate by choosing the most likely interpretation.
1. N-gram Models
 Predict the likelihood of a word based on the previous (n−1) words.
 Helps in resolving lexical and syntactic ambiguity.
 Example:
o P("bank account") > P("bank river") → likely meaning: financial bank
2. Part-of-Speech (POS) Tagging with Hidden Markov Models (HMM)
 Tags words with their parts of speech based on probabilities.
 Resolves ambiguity in word classes.
e.g., "record" as a noun or verb.
 HMM chooses the sequence of tags with highest probability.
3. Word Sense Disambiguation (WSD) using Bayesian Methods
 Uses Bayes' theorem to find the most probable sense of a word given the context.
 P(sense | context) = P(context | sense) × P(sense) / P(context)
 Naïve Bayes is often used for practical WSD.
4. Maximum Entropy Models (Logistic Regression)
 Uses features like surrounding words, POS tags, and syntactic roles.
 Selects the sense or parse with the highest probability.
 Unlike Naïve Bayes, makes fewer independence assumptions.
5. Conditional Random Fields (CRF)
 Widely used in sequence labeling tasks like POS tagging, Named Entity Recognition.
 Can incorporate many contextual features for disambiguation.
6. Statistical Parsing (Probabilistic Context-Free Grammars - PCFGs)
 Assigns probabilities to grammar rules.
 Resolves syntactic ambiguity by selecting the parse tree with the highest probability.
Example:
Sentence: “I saw her duck.”
 Possible meanings:
1. "duck" = noun (the bird)
2. "duck" = verb (she dodged)
Using a statistical POS tagger trained on real-world sentences, we might find that:
 P(“duck” as noun | “her duck”) > P(“duck” as verb | “her duck”)
→ So, system infers “duck” as a noun.
An N-gram model is a probabilistic language model used in Natural Language Processing (NLP) to
predict the next item (usually a word) in a sequence based on the previous (N−1) items. It’s a
fundamental statistical technique for language modeling, text generation, speech recognition, and
ambiguity resolution.
What is an N-Gram?
 An N-gram is a sequence of N items (typically words) from a given text or speech.
N Name Example (from sentence “I love data mining”)
1 Unigram I, love, data, mining
2 Bigram I love, love data, data mining
3 Trigram I love data, love data mining
4 4-gram I love data mining
How N-Gram Models Work
The probability of a word depends on the previous (N−1) words:
For a sequence of words:
P(w₁, w₂, ..., wₙ) ≈ P(w₁) × P(w₂|w₁) × P(w₃|w₁,w₂) × ...
For bigrams (N = 2):
P(w₁, w₂, ..., wₙ) ≈ P(w₁) × P(w₂|w₁) × P(w₃|w₂) × ...
This simplifies computation by assuming the Markov property (only the most recent history matters).
Why Use N-Gram Models?
 Efficient: Reduces complexity by focusing only on a fixed number of previous words.
 Effective: Captures common language patterns.
 Useful for: Spell-checking, predictive typing, machine translation, and disambiguation.
Limitations of N-Gram Models
 Sparsity: Many word combinations never appear in training data.
 Context Loss: Long-range dependencies are ignored.
 Data Hungry: Requires a large corpus for accuracy.
Smoothing Techniques
To overcome sparsity, smoothing adjusts probabilities of unseen N-grams:
 Laplace Smoothing (Add-1)
 Good-Turing Estimation
 Backoff and Interpolation
Applications of N-Grams
 Speech Recognition
 Spell Correction
 Predictive Text Input
 Machine Translation
 POS Tagging
Example: Lexical Ambiguity Resolution using Bigram Model
Sentence:
"He sat on the bank."
Ambiguity:
 bank could mean:
1. A financial institution
2. A riverbank
Bigram Probability Analysis:
From a large language corpus (like Google Ngram or Brown Corpus), we find bigram frequencies:
Bigram Phrase Frequency Probability (estimated)
"on the bank" 5000 High
"at the bank" 10000 High
"bank account" 12000 High (finance-related)
"river bank" 3000 Moderate
But we look at contextual bigrams, like:
 “sat on” – typically used for physical surfaces
 “on the bank” – more commonly used in nature contexts
We then compute:
 P(bank = riverbank | "sat on the") > P(bank = financial institution | "sat on the")
So, the model prefers “riverbank”, because:
 “sat on the bank” is more frequent in nature-related texts
 “sat at the bank” would be expected for financial meaning
Resolution:
Interpreted Meaning = He sat on the riverbank
Because “sat on the bank” is more statistically likely in a corpus to refer to a physical location than to
a financial institution.
Behind the Scenes:
Using bigram probabilities:
P("bank" | "on the") = frequency("on the bank") / frequency("on the")
Sure! Let's walk through a detailed explanation of Part-of-Speech (POS) tagging with a suitable
example, including how ambiguity is resolved using context and how HMM helps pick the correct
tags.
Part-of-Speech (POS) Tagging – Explained
POS tagging is the process of assigning word categories like noun, verb, adjective, etc., to each word
in a sentence.
Why POS Tagging Needs Ambiguity Resolution
Some words can have multiple parts of speech depending on context.
Example Sentence:
“Can you can a can as a canner can can a can?”
This sentence contains the word “can” used with different meanings and POS tags:

Word Possible Tags

can modal verb, noun, verb
canner noun
you pronoun
a article
as conjunction
Correct POS Tagging (Using Context):
“Can/MODAL you/PRON can/VERB a/DET can/NOUN as/CONJ a/DET canner/NOUN
can/VERB can/VERB a/DET can/NOUN?”
Breakdown:
Word POS Tag Explanation
Can MODAL Helping verb: Can you...?
you PRON Subject pronoun
can VERB Main verb: to can (preserve)
a DET Article
can NOUN Object noun: a can (container)
as CONJ Comparison word
a DET Article
canner NOUN A person who cans food
Word POS Tag Explanation
can VERB Main verb
can VERB Auxiliary verb: can can...
a DET Article
can NOUN Noun again
Ambiguity Without Context:
If we saw just the word "can", we wouldn't know if it's:
 A modal verb (Can you swim?)
 A noun (Open the can.)
 A verb (She can the tomatoes.)
How HMM Helps:
Using a trained Hidden Markov Model, we calculate:
 Transition Probability:
P(current tag | previous tag)
E.g., P(VERB | MODAL) = high (like “can go”)
 Emission Probability:
P(word | tag)
E.g., P("can" | VERB) vs. P("can" | NOUN)
The HMM chooses the sequence of POS tags that gives the highest total probability.
Certainly! Let’s walk through a detailed, suitable example of POS tagging using a Hidden Markov
Model (HMM), showing how it resolves ambiguity using transition and emission probabilities.
POS Tagging with Hidden Markov Model – Step-by-Step Example
Goal:
Tag the sentence:
“He can fish.”
❗ Ambiguity:
 "can" could be a modal verb or a main verb
 "fish" could be a noun (the animal) or a verb (to catch fish)
Possible Interpretations:
Interpretation Meaning
He/MODAL can/VERB fish/VERB He is able to catch fish (correct)
He/NOUN can/NOUN fish/VERB Makes little sense
He/PRON can/NOUN fish/NOUN Grammatically odd
Using HMM to Resolve Ambiguity
Hidden Markov Model uses two types of probabilities:
1. Emission Probability
P(word | tag): How likely is a word to appear as a certain POS?
| Word | POS | P(word | tag) (estimated) |
|--------|-----------|----------------------------|
| he | PRON | 0.9 |
| can | MODAL | 0.7 |
| can | NOUN | 0.2 |
| fish | NOUN | 0.5 |
| fish | VERB | 0.4 |
2. Transition Probability
P(current_tag | previous_tag): Likelihood of one tag following another.
| Previous → Current | P(tag₂ | tag₁) |
|--------------------|----------------|
| START → PRON | 0.8 |
| PRON → MODAL | 0.7 |
| PRON → NOUN | 0.2 |
| MODAL → VERB | 0.9 |
| VERB → NOUN | 0.3 |
| MODAL → NOUN | 0.1 |
Calculate Probabilities for Tag Sequences
➤ Option 1 (Correct):

P=P(PRON∣START)⋅P(he∣PRON)⋅P(MODAL∣PRON)⋅P(can∣MODAL)⋅P(VERB∣MODAL)⋅P(fish∣VERB)
“He/PRON can/MODAL fish/VERB”

P=0.8⋅0.9⋅0.7⋅0.7⋅0.9⋅0.4=0.1270
➤ Option 2 (Less likely):
“He/PRON can/NOUN fish/VERB”
P=0.8⋅0.9⋅0.2⋅0.2⋅0.4⋅0.4=0.0092
➤ Option 3 (Nonsensical):
“He/PRON can/NOUN fish/NOUN”
P=0.8⋅0.9⋅0.2⋅0.2⋅0.3⋅0.5=0.0108
Most Probable Tagging (Highest Score):
He/PRON can/MODAL fish/VERB
Interpretation:
“He is able to catch fish.”
How HMM Helps:
 Uses contextual patterns in the form of transition probabilities.
 Considers word-tag probabilities (emissions).
 Viterbi algorithm efficiently computes the best path (tag sequence).
Why It’s Effective for Ambiguity
 The word “can” has multiple meanings.
 The model chooses “MODAL” for “can” because it follows a pronoun and precedes a verb,
both highly probable transitions.

Sure! Let's walk through a clear example that demonstrates Maximum Entropy Modeling (Logistic
Regression) in NLP using a Part-of-Speech (POS) tagging task.
Example: POS Tagging Using Maximum Entropy (Logistic Regression)
✍️Sentence to Tag:
“Book that flight”
Ambiguity:
 “Book” can be:
o a Noun (I read a book)
o a Verb (Book a flight)
Goal:
Determine the correct POS tag for “Book” using contextual features and a trained Maximum Entropy
classifier.
Step 1: Define Feature Set for Each Word
Let’s extract features for the first word “Book”:
Feature Value
Word itself “Book”
Is first word in sentence? Yes
Is capitalized? Yes
Following word “that”
Part-of-speech of next word Conjunction / Pronoun (varies)
Word shape (title case, lowercase, etc.) Title
These features form the input xx to the MaxEnt model.

Step 2: Maximum Entropy Model Prediction

The model evaluates:

Assume the model computes the probabilities:

 P(Book = VERB | features) = 0.85
 P(Book = NOUN | features) = 0.15
Since the verb tag has the highest probability, the model selects:
“Book” → VERB
Tagged Sentence Output:
“Book/VERB that/DET flight/NOUN”
Why the Classifier Chose VERB:
 “Book” is capitalized and at the beginning → neutral clue.
 The next word is “that”, commonly followed by a determiner in command-style sentences →
suggests verb.
 The overall sentence pattern resembles a command.
Python Code (Using NLTK’s MaxEnt Classifier)
Here’s a simplified version using NLTK's built-in classifier:
import nltk
from [Link] import MaxentClassifier

# Training data (simplified): list of tuples (features, label)

train_set = [
({"word": "Book", "next_word": "that", "capitalized": True}, "VERB"),
({"word": "book", "next_word": "is", "capitalized": False}, "NOUN"),
({"word": "flight", "capitalized": False}, "NOUN"),
]

# Train the classifier

classifier = [Link](train_set, algorithm='iis', trace=0, max_iter=10)

# Test features for word "Book"

test_features = {"word": "Book", "next_word": "that", "capitalized": True}

# Predict POS tag

predicted_tag = [Link](test_features)
print("Predicted Tag for 'Book':", predicted_tag)
🧾 Output:
Predicted Tag for 'Book': VERB
Summary
 Maximum Entropy (Logistic Regression) chooses the most likely tag based on features, not just
word frequency.
 It works well when context and features are important (like capitalization, nearby words,
position).
 Unlike Naïve Bayes, it doesn’t assume independence between features.
Absolutely! Let's walk through a complete, step-by-step example of how a Conditional Random
Field (CRF) is used in Named Entity Recognition (NER) — a common NLP task — to resolve
ambiguity.
Example: Named Entity Recognition (NER) with CRF
Sentence:
"Steve Jobs founded Apple in California."
Goal:
Identify named entities in the sentence, such as:
 Steve Jobs → PERSON
 Apple → ORGANIZATION
 California → LOCATION
We want to assign a label (NER tag) to each word.
1. Tokenized Sentence:
["Steve", "Jobs", "founded", "Apple", "in", "California", "."]
2. Tags to Predict:
Word NER Tag
Steve B-PER (Begin Person)
Jobs I-PER (Inside Person)
founded O (Outside)
Apple B-ORG
in O
California B-LOC
. O
3. Feature Extraction per Word
Let’s extract features for each word — CRFs learn patterns between features and labels, and between
labels themselves.
🔹 Features for “Apple”:
Feature Value
Word “Apple”
Is Capitalized? Yes
Prefix (2 letters) “Ap”
Suffix (3 letters) “ple”
Previous Word “founded”
Next Word “in”
Previous Tag O (learned during training)
These features help the CRF recognize “Apple” as a likely organization.
4. How CRF Resolves Ambiguity
Let’s say a model needs to tag "Apple" in different contexts:
 “Apple is tasty.” → “Apple” = B-FRUIT (or O)
 “Apple released a new iPhone.” → “Apple” = B-ORG
CRF looks at the surrounding words and tags:
 If the previous word is “founded”, it learns that the next word is likely an organization.
 If the previous word is “an”, and the next word is “tree”, it may tag “Apple” as a fruit.
Prediction:
 “founded” → followed by capitalized noun = likely organization
 So, CRF tags "Apple" as → B-ORG
5. CRF Learns These Patterns:
Pattern (Learned from Training Data) Likely Tag
Capitalized word after “founded” ORG
Two consecutive capitalized words at the beginning PER
Word “in” followed by capitalized word LOC
Final Tagged Output:
Word Predicted Tag
Steve B-PER
Jobs I-PER
founded O
Apple B-ORG
in O
California B-LOC
. O
How CRF Helps Resolve Ambiguity:
 CRF considers word-level features AND neighboring tags.
 It learns that:
o “Steve Jobs” is a name → PERSON
o “Apple” after “founded” → ORGANIZATION
o “California” after “in” → LOCATION
 Unlike HMM, CRF can use rich contextual and lexical features without assuming
independence.
Summary:
 CRF is ideal for tasks like NER and POS tagging where context matters.
 It uses features and label sequences to make smart, consistent decisions.
 CRFs are accurate and flexible, often used in real-world NLP systems.
Statistical Parsing with Probabilistic Context-Free Grammars (PCFGs)
What Is Statistical Parsing?
Statistical parsing is the process of analyzing the grammatical structure of a sentence using
probability-based models, to choose the most likely parse tree among several possibilities.
A Probabilistic Context-Free Grammar (PCFG) is an extension of a regular Context-Free
Grammar (CFG) that assigns a probability to each production rule.
1. Context-Free Grammar (CFG) – Recap
A CFG consists of:
 Terminals (actual words like “dog”, “runs”)
 Non-terminals (syntactic categories like NP, VP)
 Production Rules, e.g.:
 S → NP VP
 NP → Det Noun
 VP → Verb NP
2. What Is a PCFG?
In a PCFG, each rule has a probability, e.g.:
S → NP VP [1.0]
NP → Det Noun [0.5]
NP → ProperNoun [0.5]
VP → Verb NP [0.8]
VP → Verb [0.2]
 The probabilities are learned from a treebank (a parsed corpus).
 For each non-terminal, the sum of rule probabilities must be 1.0.
3. Example Sentence:
“She sees a dog”
Let’s define a basic PCFG:
📜 Grammar:
S → NP VP [1.0]
NP → Pronoun [0.4]
NP → Det Noun [0.6]
VP → Verb NP [1.0]
Det → “a” [1.0]
Noun → “dog” [1.0]
Pronoun → “She” [1.0]
Verb → “sees” [1.0]
4. All Possible Parse Trees
Only one parse tree is possible here:
S
├── NP → Pronoun → She
└── VP
├── Verb → sees
└── NP
├── Det → a
└── Noun → dog
🔢 Probability of Parse Tree:
Multiply the probabilities of the applied rules:
P(S → NP VP) = 1.0
P(NP → Pronoun) = 0.4
P(Pronoun → She) = 1.0
P(VP → Verb NP) = 1.0
P(Verb → sees) = 1.0
P(NP → Det Noun) = 0.6
P(Det → a) = 1.0
P(Noun → dog) = 1.0

Total = 1.0 × 0.4 × 1.0 × 1.0 × 1.0 × 0.6 × 1.0 × 1.0 = **0.24**
5. Why Use PCFGs for Ambiguity Resolution?
Consider a more ambiguous sentence:
“I saw the man with the telescope.”
Possible parses:
1. I saw [the man with the telescope].
2. I saw [the man] [with the telescope].
PCFG will score each parse using the rule probabilities and pick the most probable one, effectively
resolving syntactic ambiguity.
6. Python Example with NLTK
import nltk
from nltk import CFG
from [Link] import ViterbiParser

# Define a PCFG
pcfg_grammar = [Link]("""
S -> NP VP [1.0]
NP -> Det N [0.6] | Pronoun [0.4]
VP -> V NP [0.8] | V [0.2]
Det -> 'a' [1.0]
N -> 'dog' [1.0]
V -> 'sees' [1.0]
Pronoun -> 'She' [1.0]
""")

# Create parser
parser = ViterbiParser(pcfg_grammar)

# Parse a sentence
sentence = ['She', 'sees', 'a', 'dog']
for tree in [Link](sentence):
print(tree)
print("Probability:", [Link]())
Output:
(S
(NP (Pronoun She))
(VP (V sees) (NP (Det a) (N dog))))
Probability: 0.24
Advantages of PCFGs
 Automatically chooses the most likely parse using learned probabilities.
 Can resolve ambiguity better than basic CFGs.
 Trainable from corpora like Penn Treebank.
import nltk
from nltk import PCFG
from [Link] import ViterbiParser

# Define the grammar

pcfg_grammar = [Link]("""
S -> NP VP [1.0]
VP -> V NP [0.5] | VP PP [0.5]
NP -> Pronoun [0.3] | Det N [0.4] | NP PP [0.6]
PP -> P NP [1.0]
Det -> 'the' [1.0]
N -> 'man' [0.5] | 'telescope' [0.5]
P -> 'with' [1.0]
V -> 'saw' [1.0]
Pronoun -> 'I' [1.0]
""")

# Parser
parser = ViterbiParser(pcfg_grammar)
sentence = ['I', 'saw', 'the', 'man', 'with', 'the', 'telescope']
for tree in [Link](sentence):
print(tree)
print("Probability:", [Link]())

Part-of-Speech Tagging Techniques
No ratings yet
Part-of-Speech Tagging Techniques
83 pages
Lecture Part of Speech Tagging
No ratings yet
Lecture Part of Speech Tagging
41 pages
NLP Session 6
No ratings yet
NLP Session 6
5 pages
Lecture Notes On Syntactic Processing
No ratings yet
Lecture Notes On Syntactic Processing
14 pages
Lecture 5
No ratings yet
Lecture 5
56 pages
HMM for POS Tagging in Python
No ratings yet
HMM for POS Tagging in Python
13 pages
Understanding Part-Of-Speech Tagging
No ratings yet
Understanding Part-Of-Speech Tagging
53 pages
3 POS Viet
No ratings yet
3 POS Viet
81 pages
POS Tagging and HMM in NLP
No ratings yet
POS Tagging and HMM in NLP
93 pages
Module-5 (Markov Model and Pos Tagging)
No ratings yet
Module-5 (Markov Model and Pos Tagging)
66 pages
Part of Speech Tagging
No ratings yet
Part of Speech Tagging
26 pages
Lecture 20-23 Part of Speech Tagging
No ratings yet
Lecture 20-23 Part of Speech Tagging
36 pages
POS HMM Viterbi Algo 2025
No ratings yet
POS HMM Viterbi Algo 2025
52 pages
Lecture 2
No ratings yet
Lecture 2
27 pages
Pos Tagging of Punjabi Language Using Hidden Markov Model
No ratings yet
Pos Tagging of Punjabi Language Using Hidden Markov Model
9 pages
Unit-3.Word Level Analysis AIML
No ratings yet
Unit-3.Word Level Analysis AIML
5 pages
NLP 2
No ratings yet
NLP 2
5 pages
5 Sequence Learning
No ratings yet
5 Sequence Learning
50 pages
Part-of-Speech (POS) Tagging
No ratings yet
Part-of-Speech (POS) Tagging
94 pages
PoS Tagging and HMM in NLP
No ratings yet
PoS Tagging and HMM in NLP
50 pages
Techniques for POS Tagging Explained
No ratings yet
Techniques for POS Tagging Explained
12 pages
Cme4408 p6 Pos Tagging
No ratings yet
Cme4408 p6 Pos Tagging
33 pages
2025-NLP-Lecture 05 - Sequence Labeling For Parts of Speech and Name Entities
No ratings yet
2025-NLP-Lecture 05 - Sequence Labeling For Parts of Speech and Name Entities
69 pages
19CSE453 - Natural Language Processing: Part of Speech Tagging
No ratings yet
19CSE453 - Natural Language Processing: Part of Speech Tagging
59 pages
Corpus Analysis
No ratings yet
Corpus Analysis
8 pages
HMM Model
No ratings yet
HMM Model
11 pages
Lecture 16-17-18-19
No ratings yet
Lecture 16-17-18-19
42 pages
Part of Speech Tagging and Hidden Markov Models
No ratings yet
Part of Speech Tagging and Hidden Markov Models
24 pages
Parts of Speech Tagging
No ratings yet
Parts of Speech Tagging
62 pages
3.1 Chap NLP Pos - Tagging - Lecture3
No ratings yet
3.1 Chap NLP Pos - Tagging - Lecture3
38 pages
POS Tagging: Techniques and Challenges
No ratings yet
POS Tagging: Techniques and Challenges
75 pages
NLP Chapter 3
No ratings yet
NLP Chapter 3
7 pages
Pos Tagging Pushpak
No ratings yet
Pos Tagging Pushpak
88 pages
L11-POS - Tagging - II
No ratings yet
L11-POS - Tagging - II
43 pages
What Is POS Tagging in NLP
No ratings yet
What Is POS Tagging in NLP
8 pages
CSCI 5832 Natural Language Processing: Jim Martin
No ratings yet
CSCI 5832 Natural Language Processing: Jim Martin
46 pages
Explain in Detail Rule Based POS Tagging
No ratings yet
Explain in Detail Rule Based POS Tagging
12 pages
Part of Speech Tagging in NLP
No ratings yet
Part of Speech Tagging in NLP
57 pages
Lec3-Posner Intro
No ratings yet
Lec3-Posner Intro
30 pages
Lect6 Pos
No ratings yet
Lect6 Pos
62 pages
2021 25 Pos Tagging NLP
No ratings yet
2021 25 Pos Tagging NLP
8 pages
POS Tagging for NLP Enthusiasts
No ratings yet
POS Tagging for NLP Enthusiasts
47 pages
Lec 10
No ratings yet
Lec 10
77 pages
Module2 Lecture3 POS
No ratings yet
Module2 Lecture3 POS
40 pages
Word Class Prediction of Ambiguous and Unknown Words of Punjabi Language Using Bi-Gram Methods
No ratings yet
Word Class Prediction of Ambiguous and Unknown Words of Punjabi Language Using Bi-Gram Methods
5 pages
Finalnlp
No ratings yet
Finalnlp
24 pages
Module 3 NLP
No ratings yet
Module 3 NLP
97 pages
3 cs626 Pos Tagging Week of 8aug22
No ratings yet
3 cs626 Pos Tagging Week of 8aug22
27 pages
1.pos Tagging 1
No ratings yet
1.pos Tagging 1
20 pages
Word Classes and Part-of-Speech (POS) Tagging: CS4705 Julia Hirschberg
No ratings yet
Word Classes and Part-of-Speech (POS) Tagging: CS4705 Julia Hirschberg
40 pages
4 Pos
No ratings yet
4 Pos
62 pages
NLP Chapter 3
No ratings yet
NLP Chapter 3
36 pages
Ai TXT Unit5
No ratings yet
Ai TXT Unit5
7 pages
NLPChapter 3
No ratings yet
NLPChapter 3
14 pages
Multi-Tagging For Transition-Based Dependency Parsing
No ratings yet
Multi-Tagging For Transition-Based Dependency Parsing
10 pages
Different Methods To Idetify PoS in Text Analysis
No ratings yet
Different Methods To Idetify PoS in Text Analysis
19 pages
Pos-Tagging - Lec - HMM Ex
No ratings yet
Pos-Tagging - Lec - HMM Ex
105 pages
POS Tagging HMM Notes With Diagrams
No ratings yet
POS Tagging HMM Notes With Diagrams
4 pages
Unit 4-NLP
No ratings yet
Unit 4-NLP
53 pages
NLP PT1 - Syllabus25 (1) - 250820 - 123838
No ratings yet
NLP PT1 - Syllabus25 (1) - 250820 - 123838
10 pages
2024 Latechclfl-1 18
No ratings yet
2024 Latechclfl-1 18
11 pages
Is Consecutive Interpreting Easier Than Simultaneous Interpreting A Corpus-Based Study of Lexical Simplification in Interpretation
No ratings yet
Is Consecutive Interpreting Easier Than Simultaneous Interpreting A Corpus-Based Study of Lexical Simplification in Interpretation
17 pages
Exploring Corpus Linguistics Concepts
No ratings yet
Exploring Corpus Linguistics Concepts
13 pages
CSE 2021 - 2025 Batch 7th Sem Syllabus
No ratings yet
CSE 2021 - 2025 Batch 7th Sem Syllabus
8 pages
NLP Techniques in Data Science
No ratings yet
NLP Techniques in Data Science
69 pages
Arabic Metaphor Sentiment Classification Using Semantic Information
No ratings yet
Arabic Metaphor Sentiment Classification Using Semantic Information
13 pages
Phrase Structure Guidelines PDF
No ratings yet
Phrase Structure Guidelines PDF
303 pages
Natural Language Processing Lab Manual
83% (6)
Natural Language Processing Lab Manual
56 pages
Part of Speech Tagging (Chapter 5) : Adapted From Kathy Mccoy'S Presentation Downloaded From The Web, September 2010
No ratings yet
Part of Speech Tagging (Chapter 5) : Adapted From Kathy Mccoy'S Presentation Downloaded From The Web, September 2010
63 pages
SEM-VII AIML DE Syllabus
No ratings yet
SEM-VII AIML DE Syllabus
81 pages
Natural Language Processing 110641
No ratings yet
Natural Language Processing 110641
19 pages
In Ectional Review of Deep Learning On Natural Language Processing
No ratings yet
In Ectional Review of Deep Learning On Natural Language Processing
5 pages
Natural Language Processing
No ratings yet
Natural Language Processing
22 pages
Natural Language Processing A Machine Learning Perspective by Yue Zhang, Westlake University Zhiyang Teng, Westlake University
No ratings yet
Natural Language Processing A Machine Learning Perspective by Yue Zhang, Westlake University Zhiyang Teng, Westlake University
768 pages
NLP
No ratings yet
NLP
2 pages
Unit 1 NLP
No ratings yet
Unit 1 NLP
32 pages
Mu 7th Semester Question Papers December 22
No ratings yet
Mu 7th Semester Question Papers December 22
7 pages
NLP Preprocessing Steps
No ratings yet
NLP Preprocessing Steps
20 pages
Lab Syllabus NLP Lab
No ratings yet
Lab Syllabus NLP Lab
2 pages
Word Sense Disambiguation by Web Mining For Word Co-Occurrence Probabilities
No ratings yet
Word Sense Disambiguation by Web Mining For Word Co-Occurrence Probabilities
4 pages
Amharic Phrase Chunking Study
No ratings yet
Amharic Phrase Chunking Study
7 pages
Lecture 01
No ratings yet
Lecture 01
59 pages
Introduction to Natural Language Processing
100% (1)
Introduction to Natural Language Processing
12 pages
NLP Model Comparisons and Concepts
No ratings yet
NLP Model Comparisons and Concepts
28 pages
NLP Unit-I Notes
No ratings yet
NLP Unit-I Notes
19 pages
02 A Critical Study of Pragmatic Ambiguity Detection in Natural Language Requirements
No ratings yet
02 A Critical Study of Pragmatic Ambiguity Detection in Natural Language Requirements
11 pages
NLP Unit-2
No ratings yet
NLP Unit-2
12 pages
NLP - PPT - CH 3
No ratings yet
NLP - PPT - CH 3
74 pages
Natural Language Processing: Artificial Intelligence COSC-3112 Ms. Humaira Anwer
No ratings yet
Natural Language Processing: Artificial Intelligence COSC-3112 Ms. Humaira Anwer
22 pages

NLP Unit2

Uploaded by

NLP Unit2

Uploaded by

Statistical Methods for Ambiguity Resolution

Word Possible Tags

Step 2: Maximum Entropy Model Prediction

Assume the model computes the probabilities:

# Training data (simplified): list of tuples (features, label)

# Train the classifier

# Test features for word "Book"

# Predict POS tag

# Define the grammar

You might also like