0% found this document useful (0 votes)
43 views17 pages

Natural Language Processing Overview

Uploaded by

Piyush Singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
43 views17 pages

Natural Language Processing Overview

Uploaded by

Piyush Singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 17

Artificial Intelligence

(CS F407)

Understanding Natural Languages

Chittaranjan Hota
Professor, Computer Science & Information Systems Department
BITS Pilani Hyderabad Campus, Hyderabad
[email protected]
Natural Language Processing

Make computers learn our language rather than we learn


theirs.
Yahoo, Google, Microsoft  Info. Retrieval

आप कह ाँ ज रहे  Machine Translation

Ask Jeeves  Question Answering

Myspace, Facebook Processing of User-Generated Content

All “Big Guys” have (several) strong NLP research labs:


IBM, Microsoft, AT&T, Xerox, Sun, etc.
Where does it fit in the CS taxonomy?

Computer Science

Databases Artificial Intelligence Systems prog. Networking

ML Natural Language Processing Search

Information Machine Language


Retrieval Translation Analysis

Semantics Parsing
Information Extraction (IE)

• Identify phrases in language that refer to specific


types of entities and relations in the text.
• Named entity recognition is the task of identifying
names of people, places, organizations, etc. in text.
• Rajesh Gopinathan is the CEO of TCS who lives in
Mumbai.
• Relation extraction identifies specific relations
between entities.
• Rajesh Gopinathan is the CEO of TCS who lives in
Mumbai.
Text Summarization

Summarized Text
BITS conducts the all-India
computerized entrance examination,
BITSAT (BITS Admission Test).

Birla Institute of Technology & Science,


Pilani (shortened BITS Pilani or BITS)
is an Indian institute of higher
education and a deemed university
under Section 3 of the UGC Act 1956.

http://textsummarization.net/text-summarizer
Machine Translation (MT)

• Translate a sentence from one natural language to


another.

• आप कह ाँ ज रहे
• Where are you going

• बिट् स है दर ि द कह हे
• BITS Hyderabad said,

http://translate.google.com/#
Knowledge of Language

• Phonology – concerns how words are related to the sounds


that realize them. (e.g, thumb, dumb)

• Morphology – concerns how words are constructed from more


basic meaning units called morphemes. nonperishable

• Syntax - concerns the proper ordering of


3 morphemes: non-
words and it’s effect on meaning. , perish, and -able
• The dog bit the boy.
• The boy bit the dog.
• Bit boy dog the the.
Continued…
 Semantics concerns the (literal) meaning of words, phrases,
and sentences.
 “plant” is a manufacturing facility
 “plant” is the act of sowing

 Pragmatics concerns the overall communicative and social


context and its effect on interpretation.
 John thinks mushrooms.

 Discourse – concerns how the immediately preceding


sentences affect the interpretation of the next sentence.

 World Knowledge – includes general knowledge about the


world.
Ambiguity (I made him duck)
• Some interpretations:
• I cooked duck for him.
• I cooked duck belonging to him.
• I created a toy duck which he owns.
• I caused him to quickly lower his head or body.
• I bowled him out without scoring any runs.
• I used magic and turned him into a duck.
• duck – morphologically and syntactically
ambiguous: noun or verb.
• make – semantically ambiguous: cook, bowl, or
create.
Syntactic user
stems
Analysis
Morphologic
al Analysis
Surface
form

Semantic
Analysis lexicon
Pragmatic
Analysis Perform
action

Internal Discourse Resolve


references
representation
Analysis

Lexicon is a vocabulary data bank, that contains the


Source: H. Al-Muhtaseb
language words and their linguistic information.
Surface form stems

I want to print I (pronoun)


Ali’s .init file want (verb)
to (prep)
to(infinitive)
print (verb)
Ali (noun)
‘s (possessive)
.init (adj)
file (noun)
file (verb)
Source: H. Al-Muhtaseb
S

stems NP VP
I (pronoun) V S
want (verb) PRO Parse
to (prep) NP VP tree
I
to(infinitive) want
PRO V NP
print (verb)
Ali (noun) ADJ NP
‘s (possessive) I print
.init (adj) ADJ N
file (noun) Ali’s
.init file
file (verb)
Source: H. Al-Muhtaseb
Parsing: Another example
s --> np, vp. det -->[a]. det --> [an].
np --> det, noun. det --> [the].
np --> proper_noun. noun --> [apple].
vp --> v, np. noun --> [orange].
vp --> v. proper_noun --> [john].
proper_noun --> [mary].
v --> [eats]. v --> [loves].
Eg. john eats an apple.

proper_noun det noun

v np
np vp

s
Semantic Net
I Ali
who

who Who’s
S
NP want print file
VP what what
type
PRO V S
.init
NP VP
I
want
PRO V NP

ADJ NP
I print
ADJ N
Parse tree Ali’s
.init file
Source: H. Al-Muhtaseb
Semantic analysis
To whom the pronoun ‘I’
refers
To whom the proper
noun ‘Ali’ refers
What are the files to be
printed
I Ali
who

who Who’s

want print file


what what
type
Semantic Net
Execute the command
.init
lpr /ali/stuff.init
•Who/when/where/what … are involved in an event?
•How to connect the semantic representations of
•Practical usage of language: what a
different sentences?
•What is the cause of an event and what is the sentence means in practice?
consequence of an action?
Thank You!

You might also like