0% found this document useful (0 votes)

3 views5 pages

NLP Descriptive Answers Simple

The document discusses various applications of Natural Language Processing (NLP) including text summarization, spam detection, and language models. It explains concepts such as syntactic vs. semantic analysis, n-grams, perplexity, POS tagging, and word embeddings, highlighting their roles and differences. Additionally, it covers techniques like CBOW and Skip-gram in Word2Vec, emphasizing their advantages and limitations.

Uploaded by

harshmakadiya15

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views5 pages

NLP Descriptive Answers Simple

Uploaded by

harshmakadiya15

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

1.

Text Summarization Tool using NLP

a) What is NLP and how does it help? (1.5 marks)

NLP means teaching computers to understand human language. It helps to quickly read, understand, and

work with large text files like news articles.

b) Difference: Syntactic vs. Semantic analysis (1.5 marks)

- Syntactic = Grammar check.

Example: Finding subject and verb in She runs fast.

- Semantic = Meaning check.

Example: Understanding Apple means fruit or company depending on sentence.

c) How do n-grams help? (2 marks)

N-grams are word pairs/triples like global warming or new policy. They help find common phrases in the text

which can be used in summaries.

2. Spam Detection using NLP

a) What is NLP and why is it useful for spam detection? (1.5 marks)

NLP helps understand email text, so we can check if its spam based on words and patterns used.

b) Rule-based vs. Machine learning (1.5 marks)

- Rule-based: Uses fixed rules.

Example: Mark email as spam if it says You won a prize.

- ML-based: Learns from past spam emails.

Example: Naive Bayes model trained on many spam and not-spam emails.
c) How do n-grams help detect spam? (2 marks)

Spam emails often repeat word patterns like free money now. N-gram models learn these patterns to detect

spam easily.

3. Perplexity in Language Models

a) What is perplexity? (2 marks)

It checks how well a model can guess the next word in a sentence. Lower perplexity = better model.

b) Calculate perplexity (3 marks)

Given:

P(Dogs) = 0.25

P(bark | Dogs) = 0.15

P(at | bark) = 0.1

P(night | at) = 0.2

P(loudly | night) = 0.1

Multiply all:

0.25 0.15 0.1 0.2 0.1 = 0.000075

Take 5th root of 1 / 0.000075 Perplexity 7.92

4. Bigrams and Smoothing

a) Bigrams starting with AI: (1 mark)

AI solves, AI learns

b) Why raw bigrams are bad? (1 mark)

If a pair never appears, its chance becomes 0. The model will fail for new word pairs.

c) Add-1 smoothing: P(solves | AI) (3 marks)

Use formula:

P = (count(AI solves) + 1) / (count(AI) + total_words)

= (1+1)/(2+7) = 2/9 0.22

5. POS Tagging and NER

a) What is POS tagging? (2 marks)

It labels each words job in a sentence.

Examples:

- "She eats cake." eats = verb

- "The cake is tasty." cake = noun

b) What is NER? (2 marks)

NER finds names of people, places, etc.

Examples:

- "India" = Location

- "Elon Musk" = Person

c) How POS helps NER? (1 mark)

It shows which words are nouns or proper nouns, helping to find names more accurately.

6. CBOW (Word2Vec)

a) Goal of CBOW? (1.5 marks)

It learns word meanings by guessing a word using nearby words.

b) How CBOW works? (2 marks)

In The cat sat on the mat, to guess sat, CBOW looks at The, cat, on, the.

c) One advantage and one limitation (1.5 marks)

+ Fast training

Not great for rare words

7. Skip-gram (Word2Vec)

a) Goal of Skip-gram? (1.5 marks)

It guesses nearby words using the current word.

b) Example: (2 marks)

In AI solves problems, for word solves, skip-gram tries to guess AI and problems.

c) Advantage and limitation (1.5 marks)

+ Good for rare words

Slower than CBOW

8. Word Embeddings

a) What are embeddings vs. one-hot? (4 marks)

- One-hot: Only says if word is present like [0, 0, 1, 0]

- Embeddings: Real numbers that show word meaning like [0.2, -0.3, 0.7]

Embeddings help find similar words (e.g., king & queen).

b) How Word2Vec learns embeddings? (3 marks)

Trains a small neural network:

- CBOW: uses surrounding words to guess the middle word.

- Skip-gram: uses middle word to guess surrounding words.

c) vec("king") - vec("man") + vec("woman") vec("queen") (3 marks)

This shows word vectors capture meaning and gender.

Useful in search, translation, and chatbot understanding.

CT2 Set A
No ratings yet
CT2 Set A
4 pages
Assignment
No ratings yet
Assignment
6 pages
Natural Language Processing All Question
No ratings yet
Natural Language Processing All Question
122 pages
Natural Language Processing
No ratings yet
Natural Language Processing
25 pages
SEM-2-NLP Questions
No ratings yet
SEM-2-NLP Questions
3 pages
Module 5
No ratings yet
Module 5
57 pages
Genai Unit !
No ratings yet
Genai Unit !
71 pages
NLP Concepts for Class X Students
No ratings yet
NLP Concepts for Class X Students
3 pages
Ai CH 4
No ratings yet
Ai CH 4
53 pages
Chapter 7.1 - Introducing Natural Language Processing
No ratings yet
Chapter 7.1 - Introducing Natural Language Processing
39 pages
Ai Unit5
No ratings yet
Ai Unit5
16 pages
CS-875-Lecture 4
No ratings yet
CS-875-Lecture 4
47 pages
Transformer
No ratings yet
Transformer
5 pages
TSP Unit1 Own
No ratings yet
TSP Unit1 Own
13 pages
Unit 6 Endsem PYQs
No ratings yet
Unit 6 Endsem PYQs
15 pages
NLP Comprehensive Study Guide Pokhara University Fall 2025
No ratings yet
NLP Comprehensive Study Guide Pokhara University Fall 2025
50 pages
Kurd NLP Examination Questions
No ratings yet
Kurd NLP Examination Questions
4 pages
NLP Scheme for Mobile Forensics Exam
No ratings yet
NLP Scheme for Mobile Forensics Exam
6 pages
CT3
No ratings yet
CT3
3 pages
NLP Essentials for AI Enthusiasts
No ratings yet
NLP Essentials for AI Enthusiasts
4 pages
TSP Unit1 Own
No ratings yet
TSP Unit1 Own
20 pages
CM3060 Past Paper September 2024
No ratings yet
CM3060 Past Paper September 2024
5 pages
Unit-3NaturalLanguageProcessing (NLP) 1 T1743588944524
No ratings yet
Unit-3NaturalLanguageProcessing (NLP) 1 T1743588944524
83 pages
NLP Final Review
No ratings yet
NLP Final Review
32 pages
NLP Worksheet for Students
No ratings yet
NLP Worksheet for Students
10 pages
NLP 160709201345
No ratings yet
NLP 160709201345
61 pages
CT3 Set A
No ratings yet
CT3 Set A
3 pages
Word Vectors in NLP: Skip-Gram Model
No ratings yet
Word Vectors in NLP: Skip-Gram Model
11 pages
NLP Sem Unit 5
No ratings yet
NLP Sem Unit 5
9 pages
Important Questions and Answer NLP
No ratings yet
Important Questions and Answer NLP
10 pages
What Is Natural Language Processing
No ratings yet
What Is Natural Language Processing
10 pages
Interaction Between Computers and Human Language
No ratings yet
Interaction Between Computers and Human Language
15 pages
NLP Intro
No ratings yet
NLP Intro
74 pages
Answer NLP
No ratings yet
Answer NLP
5 pages
6stemming: Sist/Et/M.Tech (A) /Sem-2/Pgirad2034 /2023-24
No ratings yet
6stemming: Sist/Et/M.Tech (A) /Sem-2/Pgirad2034 /2023-24
3 pages
NLP Key Concepts and Applications
No ratings yet
NLP Key Concepts and Applications
2 pages
Unit - 4 DL
No ratings yet
Unit - 4 DL
33 pages
NLP Challenges & Techniques
No ratings yet
NLP Challenges & Techniques
45 pages
Artificial Intelligence Class X Unit 7: Natural Language Processing
No ratings yet
Artificial Intelligence Class X Unit 7: Natural Language Processing
10 pages
Question and Answer of AI
No ratings yet
Question and Answer of AI
17 pages
NLP Question Bank: Chapter-Wise Practice Problems With Solutions
No ratings yet
NLP Question Bank: Chapter-Wise Practice Problems With Solutions
45 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
28 pages
Natural Language Processing (UNIT 05 - 8 Marks)
No ratings yet
Natural Language Processing (UNIT 05 - 8 Marks)
3 pages
N-Gram and Neural Language Models Quiz
No ratings yet
N-Gram and Neural Language Models Quiz
13 pages
NLP Previous Sem
No ratings yet
NLP Previous Sem
5 pages
NLP Sheets
No ratings yet
NLP Sheets
23 pages
NLP Book
No ratings yet
NLP Book
599 pages
Board QP Solution and Notes
No ratings yet
Board QP Solution and Notes
36 pages
NLP Chapter - 1 Sheet
No ratings yet
NLP Chapter - 1 Sheet
6 pages
Practice Set NLP
No ratings yet
Practice Set NLP
5 pages
Pert23 - NLP
No ratings yet
Pert23 - NLP
30 pages
Natural Language Processing
No ratings yet
Natural Language Processing
8 pages
Natural Language Processing 5
No ratings yet
Natural Language Processing 5
24 pages
Dealing With Textual Data
No ratings yet
Dealing With Textual Data
67 pages
Amharic Word Prediction with Deep Learning
No ratings yet
Amharic Word Prediction with Deep Learning
90 pages
Class-Based N-Gram Language Models
No ratings yet
Class-Based N-Gram Language Models
13 pages
Lecture 4 N Grams
No ratings yet
Lecture 4 N Grams
29 pages
Introduction To N-Grams and Evaluation
No ratings yet
Introduction To N-Grams and Evaluation
7 pages
Language Modeling Techniques Overview
No ratings yet
Language Modeling Techniques Overview
30 pages
Language Modeling: Introduction To N-Grams
No ratings yet
Language Modeling: Introduction To N-Grams
76 pages
Paper 80
No ratings yet
Paper 80
11 pages
NLP Cat 2
No ratings yet
NLP Cat 2
78 pages
IA Hallucination
No ratings yet
IA Hallucination
11 pages
Evaluating Large Language Model (LLM) Systems: Metrics, Challenges, and Best Practices
No ratings yet
Evaluating Large Language Model (LLM) Systems: Metrics, Challenges, and Best Practices
27 pages
A Survey of Uncertainty Estimation in LLMS: Theory Meets Practice
No ratings yet
A Survey of Uncertainty Estimation in LLMS: Theory Meets Practice
12 pages
MSA-Free Protein Structure Prediction
No ratings yet
MSA-Free Protein Structure Prediction
12 pages
Optimized Backing-Off for M-Gram Models
No ratings yet
Optimized Backing-Off for M-Gram Models
4 pages
Minimum Edit Distance.
No ratings yet
Minimum Edit Distance.
12 pages
Model Fine Tuning Documentation
No ratings yet
Model Fine Tuning Documentation
11 pages
NLP Chapter 2
No ratings yet
NLP Chapter 2
10 pages
AI Text Detection: Perplexity & Burstiness
No ratings yet
AI Text Detection: Perplexity & Burstiness
29 pages
NLP 05 N Gram Language Models
No ratings yet
NLP 05 N Gram Language Models
37 pages
Beyond Fixed Length Bucket Pre-Training Is All You Need
No ratings yet
Beyond Fixed Length Bucket Pre-Training Is All You Need
8 pages
LM 24 Aug
No ratings yet
LM 24 Aug
75 pages
R P: Retrieval-Augmented Black-Box Language Models: E LUG
No ratings yet
R P: Retrieval-Augmented Black-Box Language Models: E LUG
12 pages
Automating Investment Research With Perplexity Finance
No ratings yet
Automating Investment Research With Perplexity Finance
21 pages
Spell Correction
No ratings yet
Spell Correction
46 pages
NLP Units Iv V
No ratings yet
NLP Units Iv V
30 pages
Synchronous LLMs for Full-Duplex Dialogue
No ratings yet
Synchronous LLMs for Full-Duplex Dialogue
13 pages
Ms Word Template
No ratings yet
Ms Word Template
5 pages
2024 ICML Discrete Diffusion Modeling by Estimating The Ratios of The Data Distribution
No ratings yet
2024 ICML Discrete Diffusion Modeling by Estimating The Ratios of The Data Distribution
30 pages
Unit 3 - Accessing Text Corpora and Word Level Analysis
No ratings yet
Unit 3 - Accessing Text Corpora and Word Level Analysis
73 pages
N-Gram Language Models: Random Sentence Generated From A Jane Austen Trigram Model
No ratings yet
N-Gram Language Models: Random Sentence Generated From A Jane Austen Trigram Model
28 pages
An Empirical Study of Smoothing Techniques For Language Modeling
No ratings yet
An Empirical Study of Smoothing Techniques For Language Modeling
9 pages

NLP Descriptive Answers Simple

Uploaded by

NLP Descriptive Answers Simple

Uploaded by

1.

Text Summarization Tool using NLP

a) What is NLP and how does it help? (1.5 marks)

work with large text files like news articles.

b) Difference: Syntactic vs. Semantic analysis (1.5 marks)

- Syntactic = Grammar check.

Example: Finding subject and verb in She runs fast.

- Semantic = Meaning check.

Example: Understanding Apple means fruit or company depending on sentence.

c) How do n-grams help? (2 marks)

which can be used in summaries.

2. Spam Detection using NLP

b) Rule-based vs. Machine learning (1.5 marks)

- Rule-based: Uses fixed rules.

Example: Mark email as spam if it says You won a prize.

- ML-based: Learns from past spam emails.

3. Perplexity in Language Models

a) What is perplexity? (2 marks)

b) Calculate perplexity (3 marks)

P(bark | Dogs) = 0.15

P(at | bark) = 0.1

P(night | at) = 0.2

P(loudly | night) = 0.1

0.25 0.15 0.1 0.2 0.1 = 0.000075

Take 5th root of 1 / 0.000075 Perplexity 7.92

4. Bigrams and Smoothing

a) Bigrams starting with AI: (1 mark)

b) Why raw bigrams are bad? (1 mark)

c) Add-1 smoothing: P(solves | AI) (3 marks)

P = (count(AI solves) + 1) / (count(AI) + total_words)

= (1+1)/(2+7) = 2/9 0.22

5. POS Tagging and NER

a) What is POS tagging? (2 marks)

It labels each words job in a sentence.

- "She eats cake." eats = verb

- "The cake is tasty." cake = noun

b) What is NER? (2 marks)

NER finds names of people, places, etc.

- "Elon Musk" = Person

c) How POS helps NER? (1 mark)

a) Goal of CBOW? (1.5 marks)

b) How CBOW works? (2 marks)

c) One advantage and one limitation (1.5 marks)

Not great for rare words

a) Goal of Skip-gram? (1.5 marks)

It guesses nearby words using the current word.

c) Advantage and limitation (1.5 marks)

+ Good for rare words

Slower than CBOW

a) What are embeddings vs. one-hot? (4 marks)

- One-hot: Only says if word is present like [0, 0, 1, 0]

Embeddings help find similar words (e.g., king & queen).

Trains a small neural network:

- CBOW: uses surrounding words to guess the middle word.

- Skip-gram: uses middle word to guess surrounding words.

c) vec("king") - vec("man") + vec("woman") vec("queen") (3 marks)

This shows word vectors capture meaning and gender.

Useful in search, translation, and chatbot understanding.

You might also like