Tagging and Morphological Disambiguation of Turkish Text: Kemal Oazer - Ilker Kuru Oz

The document discusses a POS tagger for Turkish text that utilizes a full-scale two-level morphological specification and a lexicon of approximately 24,000 root words. It highlights the importance of morphological disambiguation in tagging due to the agglutinative nature of Turkish, which often leads to ambiguities in lexical forms. The tagger achieves high accuracy rates and integrates functionalities such as multi-word construct recognition and statistical analysis to enhance performance in natural language processing applications.

Uploaded by

anilkimsesiz1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views6 pages

Tagging and Morphological Disambiguation of Turkish Text: Kemal Oazer - Ilker Kuru Oz

Uploaded by

anilkimsesiz1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Tagging and Morphological Disambiguation of Turkish Text

Kemal O azer and I_lker Kuruoz

Department of Computer Engineering and Information Science
Bilkent University
Bilkent, Ankara, TURKEY
fko,[email protected]

Abstract scale two-level morphological speci cation of Turk-

Automatic text tagging is an important ish (O azer, 1993), implemented on the PC-KIMMO
component in higher level analysis of text environment (Antworth, 1990). In this paper, we de-
corpora, and its output can be used in scribe the functionality and the performance of our
tagger along with various techniques that we have
cmp-lg/9407026 29 Jul 1994

many natural language processing applica- employed to deal with various sources of ambigui-
tions. In languages like Turkish or Finnish, ties.
with agglutinative morphology, morpholog-
ical disambiguation is a very crucial pro-
cess in tagging, as the structures of many
lexical forms are morphologically ambigu-
2 Tagging Text
ous. This paper describes a POS tagger for Automatic text tagging is an important step in dis-
Turkish text based on a full-scale two-level covering the linguistic structure of large text cor-
speci cation of Turkish morphology that is pora. Basic tagging involves annotating the words
based on a lexicon of about 24,000 root in a given text with various pieces of information,
words. This is augmented with a multi- such as part-of-speech and other lexical features.
word and idiomatic construct recognizer, Part-of-speech tagging facilitates higher-level analy-
and most importantly morphological dis- sis, such as parsing, essentially by performing a cer-
ambiguator based on local neighborhood tain amount of ambiguity resolution using relatively
constraints, heuristics and limited amount cheaper methods.
of statistical information. The tagger also
has functionality for statistics compilation The most important functionality of a tagger is
and ne tuning of the morphological an- the resolution of the structure and parts-of-speech of
alyzer, such as logging erroneous morpho- the lexical items in the text. This, however, is not a
logical parses, commonly used roots, etc. very trivial task since many words are in general am-
Preliminary results indicate that the tag- biguous in their part-of-speech for various reasons.
ger can tag about 98-99% of the texts ac- In English, for example a word such as make can
curately with very minimal user interven- be verb or a noun. In Turkish, even though there
tion. Furthermore for sentences morpho- are ambiguities of such sort, the agglutinative na-
logically disambiguated with the tagger, an ture of the language usually helps resolution of such
LFG parser developed for Turkish, gener- ambiguities due to morphotactical restrictions. On
ates, on the average, 50% less ambiguous the other hand, this very nature introduces another
parses and parses almost 2.5 times faster. kind of ambiguity, where a lexical form can be mor-
The tagging functionality is not speci c to phologically interpreted in many ways. For example,
Turkish, and can be applied to any lan- the word evin, can be broken down as:1
guage with a proper morphological analysis evin POS English
interface. 1. N(ev)+2SG-POSS N (your) house
2. N(ev)+GEN N of the house
1 Introduction 3. N(evin) N wheat germ
As a part of large scale project on natural language If, however, the local context is considered, it may
processing for Turkish, we have undertaken the de- be possible to resolve the ambiguity as in:
velopment of a number of tools for analyzing Turk-
ish text. This paper describes one such tool { a text 1 Output of the morphological analyzer is edited for
tagger for Turkish. The tagger is based on a full clarity.
.. sen-in ev-in .. based approach implemented with nite-state ma-
PN(you)+GEN N(ev)+2SG-POSS chines (Koskenniemi et al., 1992; Voutilainen and
your house Tapanainen, 1993).
A completely di erent approach to tagging uses
.. evin kap-s .. statistical methods, (e.g., (Church, 1988; Cutting et
N(ev)+GEN N(door)+3SG-POSS al., 1993)). These systems essentially train a statis-
door of the house tical model using a previously hand-tagged corpus
using genitive{possessive agreement constraints. and provide the capability of resolving ambiguity on
As a more complex case we can give the following: the basis of most likely interpretation. The models
alnms that have been widely used assume that the part-of-
1 ADJ(al)+2SG-POSS+NtoV()+NARR+3SG 2
speech of a word depends on the categories of the two
(V) (it) was your red (one) preceding words. However, the applicability of such
2 ADJ(al)+GEN+NtoV()+NARR+3SG approaches to word-order free languages remains to
(V) (it) belongs to the red (one)
3 N(aln)+NtoV()+NARR+3SG be seen.
(V) (it) was a forehead
4 V(al)+PASS+VtoAdj(mis) 2.1 An example
(ADJ) (a) taken (object) We can describe the process of tagging by showing
5 V(al)+PASS+NARR+3SG the analysis for the sentence:
(V) (it) was taken
6 V(aln)+VtoAdj(mis) I_sten doner donmez evimizin yaknnda bulunan
(ADJ) (an) o ended (person) derin golde yuzerek gevsemek en buyuk zevkimdi.
7 V(aln)+NARR+3SG (Relaxing by swimming the deep lake near our
(V) (s/he) was o ended house, as soon as I return from work was my greatest
It is in general rather hard to select one of these pleasure.)
interpretations without doing substantial analysis of which we assume has been processed by the morpho-
the local context, and even then one can not fully logical analyzer with the following output:
resolve such (usually semantic) ambiguities. isten POS
An additional problem that can be o -loaded to 1. N(is)+ABL N+
the tagger is the recognition of multi-word or id- doner
iomatic constructs. In Turkish, which abounds with 1. N(doner) N
such forms, such a recognizer can recognize these 2. V(don)+AOR+3SG V+
very productive multi-word constructs, like 3. V(don)+VtoAdj(er) ADJ
kos-a kos-a donmez
1. V(don)+NEG+AOR+3SG V+
run+OPT+3SG run+OPT+3SG 2. V(don)+VtoAdj(mez) ADJ
evimizin
yap-ar yap-ma-z 1. N(ev)+1PL{POSS+GEN N+
do+AOR+3SG do+NEG+AOR+3SG yaknnda
where both components are verbal but the com- 1. ADJ(yakn)+3SG{POSS+LOC N+
pound construct is a manner or temporal adverb. 2. ADJ(yakn)+2SG{POSS+LOC N
This relieves the parser from dealing with them at bulunan
the syntactic level. Furthermore, it is also possible 1. V(bul)+PASS+VtoADJ(yan) ADJ
to recognize various proper nouns with this func- 2. V(bulun)+VtoADJ(yan) ADJ+
tionality. Such help from a tagging functionality derin
1. N(deri)+2SG{POSS N
would simplify the development of parsers for Turk- 2. ADJ(derin) ADJ+
ish (Demir, 1993; Gungordu, 1993). 3. V(der)+IMP+2PL V
Researchers have used a number of di erent ap- 4. V(de)+VtoADJ(er)+2SG{POSS N
proaches for building text taggers. Karlsson (Karls- 5. V(de)+VtoADJ(er)+GEN N
son, 1990) has used a rule-based approach where golde
the central idea is to maximize the use of mor- 1. N(gol)+LOC N+
phological information. Local constraints expressed yuzerek
as rules basically discard many alternative parses 1. V(yuz)+VtoADV(yerek) ADV+
whenever possible. Brill (Brill, 1992) has designed gevsemek
1. V(gevse)+VtoINF(mak) V+
a rule-based tagger for English. The tagger works en
by automatically recognizing rules and remedying 1. N(en) N
its weaknesses, thereby incrementally improving its 2. ADV(en) ADV+
performance. More recently, there has been a rule- buyuk
1. ADJ(buyuk) ADJ+
2
In Turkish, all adjectives can be used as nouns, hence zevkimdi
with very minor di erences adjectives have the same 1. N(zevk)+1SG{POSS+ V+
morphotactics as nouns. NtoV()+PAST+3SG
Although there are a number of choices for tags The conditions refer to any available morpholog-
for the lexical items in the sentence, almost all ex- ical or positional feature associated with a lexical
cept one set of choices give rise to ungrammatical or form such as:
implausible sentence structures.3 There are number Absolute or relative lexical position (e.g., sen-
of points that are of interest here: tence initial or nal, or 1 after the current word,
the construct d oner donmez formed by two etc.)
tensed verbs, is actually a temporal adverb root and nal POS category,
meaning ... as soon as .. return(s), hence these derivation type,
two lexical items can be coalesced into a single
lexical item and tagged as a temporal adverb. case, agreement (number and person), and cer-
The second person singular possessive interpre- tain semantic markers, for nominal forms,
tation of yaknnda is not possible since this aspect and tense, subcategorization require-
word forms a simple compound noun phrase ments, verbal voice, modality,and sense for ver-
with the previous lexical item and the third per- bal forms
son singular possessive morpheme functions as subcategorization requirements for postposi-
the compound marker, agreeing with the agree- tions.
ment of the previous genitive case-marked form. Conditions may refer to absolute feature values or
The word derin (deep) is the modi er of a sim- variables (as in Prolog, denoted by the pre x in the
ple compound noun derin gol (deep lake) hence following examples) which are then used to link con-
the second choice can safely be selected. The ditions. All occurrences of a variable have to unify
verbal root in the third interpretation is very for the match to be considered successful. This fea-
unlikely to be used in text, let alone in sec- ture is powerful and and lets us specify in a rather
ond person imperative form. The fourth and general way, (possibly long distance) feature con-
the fth interpretations are not very plausible straints in complex NPs, PPs and VPs. This is a
either. The rst interpretation (meaning your part of our approach that distinguishes it from other
skin) may be a possible choice but can be dis- constraint-based approaches.
carded in the middle of a longer compound noun The actions are of the following types:
phrase. Null action: Nothing is done on the matching
The word en preceding an adjective indicates parse.
a superlative construction and hence the noun Delete: Removes the matching parse if more
reading can be discarded. than one parse for the lexical form are still in
3 The Tagging Tool the set associated with the lexical form.
Output: Removes all but the matching parse
The tagging tool that we have developed integrates from the set e ectively tagging the lexical form
the following functionality with a user interface, as with the matching parse.
shown in Figure 1, implemented under X-windows. Compose: Composes a new parse from various
It can be used interactively, though user interaction matching parses, for multi-word constructs.
is very rare and (optionally) occurs only when the
disambiguation can not be done by the tagger. These rules are ordered, and applied in the given
order and actions licensed by any matching rule are
1. Morphological analysis with error logging, applied. One rule formalism is used to encode both
2. Multi-word and idiomatic construct recogni- multi-word constructs and constraints.
tion, 3.1 The Multi-word Construct Processor
3. Morphological disambiguation by using con- As mentioned before, tagging text on lexical item ba-
straints, heuristics and certain statistics, sis may generate spurious or incorrect results when
4. Root and lexical form statistics compilation, multiple lexical items act as single syntactic or se-
The second and the third functionalities are imple- mantic entity. For example, in the sentence Sirin mi
mented by a rule-base subsystem which allows one sirin bir kopek kosa kosa geldi (A very cute dog came
to write rules of the following form: running) the fragment sirin mi sirin constitutes a
C1:A1; C2:A2; ... Cn :An. duplicated emphatic adjective in which there is an
embedded4 question sux mi (written separately in
where each C is a set of constraints on a lexical form,
i Turkish), and the fragment kosa kosa is a dupli-
and the corresponding A is an action to be executed
i cated verbal construction, which has the grammat-
on the set of parses associated with that lexical form, ical role of manner adverb in the sentence, though
only when all the conditions are satis ed. 4 If, however, the adjective
sirin was not repeated,
3
The correct choices of tags are marked with +. then we would have a question formation.
Figure 1: User interface of tagging tool

both of the constituent forms are verbal construc- Saray (Topkap Palace).
tions. The purpose of the multi-word construct pro- 6. compound verb formations which are formed by
cessor is to detect and tag such productive con- a lexically adjacent, direct or oblique object and
structs in addition to various other semantically co- a verb, which for the purposes of syntactic anal-
alesced forms such as proper nouns, etc. ysis, may be considered as single lexical item.
The following is a set of multi-word constructs for
Turkish that we handle in our tagger. This list is We can give the following example for specifying
not meant to be comprehensive, and new construct a multi-word construct:5
speci cations can easily be added. It is conceivable Lex=_W1, Root=_R1, Cat=V, Aspect=AOR, Agr=3SG,
that such a functionality can be used in almost any Sense=POS: ;
language. Lex=_W2, Root=_R1, Cat=V, Aspect=AOR, Agr=3SG,
Sense = NEG:
1. duplicated optative and 3SG verbal forms func- Compose=((*CAT* ADV)(*R* "_W1 _W2 (_R1)")
tioning as manner adverb, e.g., kosa kosa, aorist (*SUB* TEMP)).
verbal forms with root duplications and sense
negation functioning as temporal adverbs, e.g., This rule would match any adjacent verbal lexical
yapar yapmaz, and duplicated verbal and de- forms with the same root, both with the aorist as-
rived adverbial forms with the same verbal root pect, and 3SG agreement. The rst verb has to be
acting as temporal adverbs, e.g., gitti gideli, positive and the second one negated. When found,
a composite lexical form with an temporal adverb
2. duplicated compound nominal form construc- part-of-speech, is then generated. The original ver-
tions that act as adjectives, e.g., guzeller guzeli, bal root may be recovered from the root of the com-
and emphatic adjectival forms involving the posed form for any subcategorization checks, at the
question sux, e.g., guzel mi guzel, syntactic level.
3. adjective or noun duplications that act as man-
ner adverbs, e.g., hzl hzl, ev ev, 3.2 Using constraints for morphological
4. idiomatic word sequences with speci c usage
ambiguity resolution
whose semantics is not compositional, e.g., yan Morphological analysis does not have access to syn-
sra, hic olmazsa, and idiomatic forms which are tactic context, so when the morphological structure
never used singularly, e.g., gurul gurul, 5 The output of the morphological analyzer is actually
5. proper nouns, e.g., Jimmy Carter, Topkap a feature-value list in the standard LISP format.
of a lexical form has several distinct analyses, it approach is e ective in disambiguating morpholog-
is not possible to disambiguate such cases except ical structures, and hence POS, with minimal user
maybe by using root usage frequencies. For disam- intervention. Currently, the speed of the tagger is
biguation one may have to use information provided limited by essentially that of the morphological ana-
by sentential position and the local morphosyntac- lyzer, but we have ported the morphologicalanalyzer
tic context. Voutilainen and Heikkila (Voutilainen et to the XEROX TWOL system developed by Kart-
al., 1992) have proposed a constraint grammar ap- tunen and Beesley (Karttunen and Beesley, 1992).
proach where one speci es constraints on the local This system can analyze Turkish word forms at
context of a word to disambiguate among multiple about 1000 forms/sec on SparcStation 10's. We in-
readings of a word. Their approach has, however, tend to integrate this to our tagger soon, improving
been applied to English where morphological infor- its speed performance considerably.
mation has rather little use in such resolution. We have tested the impact of morphological dis-
In our tagger, constraints are applied on each ambiguation on the performance of a LFG parser
word, and check if the forms within a speci ed neigh- developed for Turkish (Gungordu, 1993; Gungordu
borhood of the word satisfy certain morphosyntactic and O azer, 1994). The input to the parser was dis-
or positional restrictions, and/or agreements. Our ambiguated using the tool developed and the results
constraint pattern speci cation is very similar to were compared to the case when the parser had to
multi-word construct speci cation. Use of variables, consider all possible morphological ambiguities it-
operators and actions, are same except that the com- self. For a set of 80 sentences considered, it can be
pose actions does not make sense here. The follow- seen that (Table 2), morphological disambiguation
ing is an example constraint that is used to select enables almost a factor of two reduction in the av-
the postpositional reading of certain word when it is erage number of parses generated and over a factor
preceded by a yet unresolved nominal form with a of two speed-up in time.
certain case. The only requirement is that the case
of the nominal form agrees with the case subcatego-
rization requirement of the following postposition.
5 Conclusions
(LP = 0 refers to current word, LP = 1 refers to This paper has presented an overview of a tool for
next word.) tagging text along with various issues that have
LP = 0, Case = _C : Output; come up in disambiguating morphological parses of
LP = 1, Cat = POSTP, Subcat = _C : Output. Turkish words. We have noted that the use of con-
straints is very e ective in morphological disam-
When a match is found, the matching parses from biguation. Preliminary results indicate that the tag-
both words are selected and the others are discarded. ger can tag about 98-99% of the texts accurately
This one constraint disambiguates almost all of the with very minimal user intervention, though it is
postpositions and their arguments, the exceptions conceivable that it may do worse on more substantial
being nominal words which semantically convey the text { but there is certainly room for improvement in
information provided by the case (such as words in- the mechanisms provided. The tool also provides for
dicating direction, which may be used as if they have recognition of multi-word constructs that behave as
a dative case). a single syntactic and semantic entity in higher level
Finally the following example constraint deletes analysis, and the compilation of information for ne-
the sentence nal adjectival readings derived from tuning of the morphological analyzer and the tagger
verbs, e ectively preferring the verbal reading (as itself. We, however, feel that our approach does not
Turkish is a SOV language.) deal satisfactorily with most aspects of word-order
Cat = V, Finalcat = ADJ, SP = END : Delete. freeness. We are currently working on an extension
whereby the rules do not apply immediately but vote
4 Performance of the Tagger on their preferences and a nal global vote tally de-
termines the assignments.
We have performed some preliminary experiments
to assess the e ectiveness of our tagger. We have
used about 250 constraints for Turkish. Some of
6 Acknowledgment
these constraints are very general as the postposition This research was supported in part by a NATO Sci-
rule above, while some are geared towards recogni- ence for Stability Program Grant, TU-LANGUAGE.
tion of NP's of various sorts and a small number ap-
ply certain syntactic heuristics. In this section, we
summarize our preliminary results. Table 1 presents
some preliminary results about the our tagging ex-
References
periments. E. L. Antworth. 1990. PC-KIMMO: A Two-level
Although the texts that we have experimented Processor for Morphological Analysis. Summer In-
with are rather small, the results indicate that our stitute of Linguistics, Dallas, Texas.
Table 1: Statistics on texts tagged, and tagging and disambiguation results
Text Words Morphological Parse Distribution
0 1 2 3 4 5
1 468 7.3% 28.7% 41.1% 11.1% 7.1 % 4.7%
2 573 1.0% 30.2% 37.3% 13.1% 11.1% 7.3%
3 533 3.8% 24.8% 38.1% 19.1% 9.2 % 5.0%
4 7004 3.9% 17.2% 41.5% 15.6% 11.7% 10.1%
Note: Words with zero parses are proper names which are not in the lexicon of the morphological analyzer.
Text % Correctly % Tagged % Correctly Automatic Disambiguation by
Tagged by Tagged Multi-word Constraints
Automatically User Total Rules
1 98.5 1.0 99.1 10.1 67.7
2 98.5 0.3 98.8 7.5 74.4
3 97.8 1.1 98.9 3.1 74.5
4 95.4 1.7 97.1 4.2 76.4

Table 2: Impact of disambiguation on parsing performance

No disambiguation With disambiguation Ratios
Avg. Length Avg. Avg. Avg. Avg.
(words) parses time (sec) parses time (sec) parses speed-up
5.7 5.78 29.11 3.30 11.91 1.97 2.38
Note: The ratios are the averages of the sentence by sentence ratios.

E. Brill. 1992. A simple rule-based part-of-speech L. Karttunen and K. R. Beesley. 1992. Two-level
tagger. In Proceedings of the Third Conference on rule compiler. Technical Report, XEROX Palo
Applied Computational Linguistics, Trento, Italy. Alto Research Center.
K. W. Church. 1988. A stochastic parts program K. Koskenniemi, P. Tapanainen, and A. Voutilainen.
and noun phrase parser for unrestricted text. In 1992. Compiling and using nite-state syntactic
Proceedings of the Second Conference on Applied rules. In Proceedings of COLING-92, the 14th
Natural Language Processing (ACL), pages 136{ International Conference on Computational Lin-
143. guistics, volume 1, pages 156{162, Nantes, France.
D. Cutting, J. Kupiec, J. Pedersen, and P. Sibun. K. O azer. 1993. Two-level description of Turkish
1993. A practical part-of-speech tagger. Technical morphology. In Proceedings of the Sixth Confer-
report, Xerox Palo Alto Research Center. ence of the European Chapter of the Association
for Computational Linguistics, April. A full ver-
C. Demir. 1993. An ATN grammar for Turkish. sion appears in Literary and Linguistic Comput-
Master's thesis, Department of Computer Engi- ing, Vol.9 No.2, 1994.
neering and Information Sciences, Bilkent Univer- A. Voutilainen and P. Tapanainen. 1993. Ambiguity
sity, Ankara, Turkey, July. resolution in a reductionistic parser. In Proceed-
Z. Gungordu and K. O azer. 1994. Parsing Turkish ings of EACL'93, Utrecht, Holland.
using the Lexical-Functional Grammar formalism. A. Voutilainen, J. Heikkila, and A. Anttila. 1992.
In Proceedings of COLING-94, the 15th Interna- Constraint Grammar of English. University of
tional Conference on Computational Linguistics, Helsinki.
Kyoto, Japan.
Z. Gungordu. 1993. A Lexical-Functional Gram-
mar for Turkish. Master's thesis, Department of
Computer Engineering and Information Sciences,
Bilkent University, Ankara, Turkey, July.
F. Karlsson. 1990. Constraint grammar as a frame-
work for parsing running text. In Proceedings of
COLING-90, the 13th International Conference
on Computational Linguistics, volume 3, pages
168{173, Helsinki, Finland.

A Morphology-Aware Network For Morphological Disambiguation
No ratings yet
A Morphology-Aware Network For Morphological Disambiguation
8 pages
F Learning Morphological Disambiguation Rules For
No ratings yet
F Learning Morphological Disambiguation Rules For
8 pages
Tag Disambiguation in Italian: Rodolfo Delmonte°, Emanuele Pianta
No ratings yet
Tag Disambiguation in Italian: Rodolfo Delmonte°, Emanuele Pianta
5 pages
Automatic Lexical Text Simplification For Turkish: Ahmet Yavuz Uluslu
No ratings yet
Automatic Lexical Text Simplification For Turkish: Ahmet Yavuz Uluslu
6 pages
Purepos 2.0: A Hybrid Tool For Morphological Disambiguation
No ratings yet
Purepos 2.0: A Hybrid Tool For Morphological Disambiguation
7 pages
Arabic Essay PDF
No ratings yet
Arabic Essay PDF
228 pages
Developing Methods For Part of Speech Tagging in Turkish Language
No ratings yet
Developing Methods For Part of Speech Tagging in Turkish Language
45 pages
Parsing Agglutinative Word Structures and Its Application To Spelling Checking For Turkish
No ratings yet
Parsing Agglutinative Word Structures and Its Application To Spelling Checking For Turkish
7 pages
Arabic Morphological Analysis Techniques
No ratings yet
Arabic Morphological Analysis Techniques
4 pages
El Kah-Anoual-Publications-17-08-2022-11-08-19-34
No ratings yet
El Kah-Anoual-Publications-17-08-2022-11-08-19-34
10 pages
A Persian Part-Of-Speech Tagger Based On Morpholog
No ratings yet
A Persian Part-Of-Speech Tagger Based On Morpholog
6 pages
Morphological Word Segmentation On Agglutinative Languages For Neural Machine Translation
No ratings yet
Morphological Word Segmentation On Agglutinative Languages For Neural Machine Translation
7 pages
Natural Language Processing Course Overview
No ratings yet
Natural Language Processing Course Overview
43 pages
Automatic Web Page Classification Methods
No ratings yet
Automatic Web Page Classification Methods
10 pages
Modeling and Learning Multilingual Inflectional Morphology in A Minimally Supervised Framework
No ratings yet
Modeling and Learning Multilingual Inflectional Morphology in A Minimally Supervised Framework
221 pages
3.1 Chap NLP Pos - Tagging - Lecture3
No ratings yet
3.1 Chap NLP Pos - Tagging - Lecture3
38 pages
Task 3
No ratings yet
Task 3
17 pages
3 Natural Language Processing-PoS Tagging
No ratings yet
3 Natural Language Processing-PoS Tagging
14 pages
POS Tagging and Word Classes Explained
No ratings yet
POS Tagging and Word Classes Explained
40 pages
Automated Arabic Text Tagging System
No ratings yet
Automated Arabic Text Tagging System
4 pages
On Translating Technical Terminology - Acronyms
No ratings yet
On Translating Technical Terminology - Acronyms
7 pages
Understanding POS Tagging Basics
No ratings yet
Understanding POS Tagging Basics
35 pages
Understanding POS Tagging in NLP
No ratings yet
Understanding POS Tagging in NLP
33 pages
Fast Development of Basic NLP Tools Towards A
No ratings yet
Fast Development of Basic NLP Tools Towards A
9 pages
Apznzaaczprqee1da4bjade7ul0meb Ap8tjou Feozcgqct6cpnh0z32ibu3faj 0wgfmnhp5p Eneunhaucakhow Bie9yhlaoqtsknu7yq0gfnxrzjd2mjuyrbnhadveb2wj7gjgcxpffbjgyxl4nzdqf5qeux-Lla2ggr5kg9w4bp8ev5hqrj7bwr3npwnp9gfmazwtau
No ratings yet
Apznzaaczprqee1da4bjade7ul0meb Ap8tjou Feozcgqct6cpnh0z32ibu3faj 0wgfmnhp5p Eneunhaucakhow Bie9yhlaoqtsknu7yq0gfnxrzjd2mjuyrbnhadveb2wj7gjgcxpffbjgyxl4nzdqf5qeux-Lla2ggr5kg9w4bp8ev5hqrj7bwr3npwnp9gfmazwtau
108 pages
1 s2.0 S2215039024000079 Main
No ratings yet
1 s2.0 S2215039024000079 Main
7 pages
Two-Level Description of Turkish Morphology
No ratings yet
Two-Level Description of Turkish Morphology
2 pages
Shallow Syntax Analysis in Sanskrit Guided by Semantic Nets Constraints
No ratings yet
Shallow Syntax Analysis in Sanskrit Guided by Semantic Nets Constraints
10 pages
2025-NLP-Lecture 05 - Sequence Labeling For Parts of Speech and Name Entities
No ratings yet
2025-NLP-Lecture 05 - Sequence Labeling For Parts of Speech and Name Entities
69 pages
Evaluating Word Embeddings and A Revised Corpus For Part of Speech Tagging in Portuguese
No ratings yet
Evaluating Word Embeddings and A Revised Corpus For Part of Speech Tagging in Portuguese
15 pages
Urdu SMS Romanization to Arabic Script
No ratings yet
Urdu SMS Romanization to Arabic Script
4 pages
Turkish Bank Document Categorization
No ratings yet
Turkish Bank Document Categorization
113 pages
An Empirical Study On POS Tagging For Vietnamese Social Media Text
No ratings yet
An Empirical Study On POS Tagging For Vietnamese Social Media Text
15 pages
Amazigh Language POS Tagging with Bi-LSTM
No ratings yet
Amazigh Language POS Tagging with Bi-LSTM
9 pages
Understanding Automatic Text Summarization
No ratings yet
Understanding Automatic Text Summarization
20 pages
Advances in Computational Terminology
No ratings yet
Advances in Computational Terminology
399 pages
Understanding Part-Of-Speech Tagging
No ratings yet
Understanding Part-Of-Speech Tagging
53 pages
NLP Notes Unit2 & Unit3
No ratings yet
NLP Notes Unit2 & Unit3
22 pages
Lecture Part of Speech Tagging
No ratings yet
Lecture Part of Speech Tagging
41 pages
POS Tagging for NLP Students
No ratings yet
POS Tagging for NLP Students
36 pages
Building Tamil-English MT System Insights
No ratings yet
Building Tamil-English MT System Insights
9 pages
Part-of-Speech (POS) Tagging
No ratings yet
Part-of-Speech (POS) Tagging
94 pages
What Is POS Tagging in NLP
No ratings yet
What Is POS Tagging in NLP
8 pages
Basics of Text Processing
No ratings yet
Basics of Text Processing
28 pages
Evaluating Part-Of-speech Tagging and Parsing
No ratings yet
Evaluating Part-Of-speech Tagging and Parsing
26 pages
Machine Learning in Automated Text Categorization
No ratings yet
Machine Learning in Automated Text Categorization
55 pages
Text Summarization As Feature Selection For Arabic Text Classification
No ratings yet
Text Summarization As Feature Selection For Arabic Text Classification
4 pages
Medicine Dispenser
No ratings yet
Medicine Dispenser
9 pages
Context-Sensitive Spelling Correction and Rich Morphology
No ratings yet
Context-Sensitive Spelling Correction and Rich Morphology
4 pages
Text Mining and Preprocessing Techniques
No ratings yet
Text Mining and Preprocessing Techniques
40 pages
Symbiosis of Evolutionary Techniques and Statistical Natural Language Processing
No ratings yet
Symbiosis of Evolutionary Techniques and Statistical Natural Language Processing
14 pages
Natural Language Processing From Scratch
No ratings yet
Natural Language Processing From Scratch
45 pages
Multi-Tagging in Dependency Parsing
No ratings yet
Multi-Tagging in Dependency Parsing
10 pages
3.word Level Analysis-Tokenization Stemming
No ratings yet
3.word Level Analysis-Tokenization Stemming
8 pages
NLP Techniques for Social Media Analysis
No ratings yet
NLP Techniques for Social Media Analysis
35 pages
Research Presentation
No ratings yet
Research Presentation
26 pages
IR Lec3
No ratings yet
IR Lec3
41 pages
Engproc 107 00008
No ratings yet
Engproc 107 00008
16 pages
Ozlem Final
No ratings yet
Ozlem Final
6 pages
Preserving The Past Unveiling Challenges in Ancien
No ratings yet
Preserving The Past Unveiling Challenges in Ancien
5 pages
Named-Entity Recognition in Turkish Legal Texts
No ratings yet
Named-Entity Recognition in Turkish Legal Texts
28 pages
SCiL 2024 Morphological Segmentation
No ratings yet
SCiL 2024 Morphological Segmentation
11 pages
Yale University Library Digital Collections: Title Call Number Creator Published/Created Date Collection Title Rights
No ratings yet
Yale University Library Digital Collections: Title Call Number Creator Published/Created Date Collection Title Rights
12 pages
SWJ 1474
No ratings yet
SWJ 1474
19 pages
2023 Conll-1 34
No ratings yet
2023 Conll-1 34
13 pages
A Structured Analysis On Morpheme Segmentation For Agglutinative Languages
No ratings yet
A Structured Analysis On Morpheme Segmentation For Agglutinative Languages
6 pages
2020 Acl-Srw 15
No ratings yet
2020 Acl-Srw 15
8 pages
B F N L P H T: R M: Uilding Oundations For Atural Anguage Rocessing of Istorical Urkish Esources and Odels
No ratings yet
B F N L P H T: R M: Uilding Oundations For Atural Anguage Rocessing of Istorical Urkish Esources and Odels
20 pages
VNLP: Turkish NLP Package: Melikşah Türker, Mehmet Erdi Arı, Aydın Han
No ratings yet
VNLP: Turkish NLP Package: Melikşah Türker, Mehmet Erdi Arı, Aydın Han
10 pages
Parsing Turkish Using The Lexical Functional Grammar Formalism
No ratings yet
Parsing Turkish Using The Lexical Functional Grammar Formalism
7 pages
Cdin5 Finals
No ratings yet
Cdin5 Finals
33 pages
IEP Goals and Objectives Resource
100% (1)
IEP Goals and Objectives Resource
132 pages
Checkpoint Stage 9 Workbook Answer Key
No ratings yet
Checkpoint Stage 9 Workbook Answer Key
39 pages
Reading Strategies for English Learners
0% (1)
Reading Strategies for English Learners
101 pages
Lesson Plan for CHN-I Course
No ratings yet
Lesson Plan for CHN-I Course
1 page
Understanding Non-Verbal Communication
No ratings yet
Understanding Non-Verbal Communication
14 pages
Grade 7 Essay Writing Task Guide
No ratings yet
Grade 7 Essay Writing Task Guide
5 pages
Đề Thi Thử Tiếng Anh Lớp 10 2023
No ratings yet
Đề Thi Thử Tiếng Anh Lớp 10 2023
4 pages
Legalese
No ratings yet
Legalese
2 pages
Grade 9 English Lesson Plan: Literary Devices
No ratings yet
Grade 9 English Lesson Plan: Literary Devices
11 pages
C1 Advanced Exam Format: Playlist What's in The Listening Paper?
0% (1)
C1 Advanced Exam Format: Playlist What's in The Listening Paper?
2 pages
Grammar and Vocabulary For The TOEIC Tes
33% (3)
Grammar and Vocabulary For The TOEIC Tes
5 pages
Comparative Adjectives in English Class
No ratings yet
Comparative Adjectives in English Class
41 pages
Grammar Mastery Assessment Matrix
No ratings yet
Grammar Mastery Assessment Matrix
2 pages
Introduction to Logic Concepts
No ratings yet
Introduction to Logic Concepts
34 pages
Ашық Сабақ "Future Simple" 5 Сынып
No ratings yet
Ашық Сабақ "Future Simple" 5 Сынып
8 pages
Class 4 Summer Vacation Homework 2025
No ratings yet
Class 4 Summer Vacation Homework 2025
6 pages
Teaching Grammar 1
No ratings yet
Teaching Grammar 1
17 pages
Formal Presentation Planning
No ratings yet
Formal Presentation Planning
6 pages
井下前视声波偏移聚焦成像方法
No ratings yet
井下前视声波偏移聚焦成像方法
5 pages
Subject Verb Agreement
No ratings yet
Subject Verb Agreement
2 pages
E Book Spanish Exercises PDF
No ratings yet
E Book Spanish Exercises PDF
68 pages
DLP - For - Gr.10single Word Modifier 1
No ratings yet
DLP - For - Gr.10single Word Modifier 1
21 pages
Arabic Learning-French
No ratings yet
Arabic Learning-French
3 pages
My Classmate and I... Our Teacher A Question About The Test. - 20250322 - 142014 - 0000
No ratings yet
My Classmate and I... Our Teacher A Question About The Test. - 20250322 - 142014 - 0000
1 page
What Is A False Cognate
No ratings yet
What Is A False Cognate
3 pages
GNED 05 Module 1 - Communication Process and Its Components
No ratings yet
GNED 05 Module 1 - Communication Process and Its Components
7 pages
DOD Terminology Program Overview
No ratings yet
DOD Terminology Program Overview
4 pages
Spoken English MBA Challenge Atharv
No ratings yet
Spoken English MBA Challenge Atharv
3 pages
B2 Passive Voice Practice Exercises
No ratings yet
B2 Passive Voice Practice Exercises
6 pages

Tagging and Morphological Disambiguation of Turkish Text: Kemal Oazer - Ilker Kuru Oz

Uploaded by

Tagging and Morphological Disambiguation of Turkish Text: Kemal Oazer - Ilker Kuru Oz

Uploaded by

Tagging and Morphological Disambiguation of Turkish Text

Kemal O azer and I_lker Kuruoz

Abstract scale two-level morphological speci cation of Turk-

Table 2: Impact of disambiguation on parsing performance

You might also like

Kemal O azer and I_lker Kuruoz