Dependency Parsing for Telugu

Prof.M.Humera khanam

Dependency Parsing for Telugu

Prof.M.Humera khanam

2013

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

In this paper we present our experiments in parsing for Telugu language. We explore two data driven parsers Malt and MST and compare the results of both the parsers. We describe the data and parser settings used in detail. Some of these are specific to either one particular or all the Indian Languages. The average of best unlabeled attachment, labeled attachment and labeled accuracies are 88.43%, 69.71 % and 70.01 % respectively.We are also presented which parser gives best results for different sentence types in Telugu.

Figures (3)

We have classified our sentences based on the sentence types. We have given these classified sentences for testing to the two parsers. For simple sentences, both parsers had given good results, but for other sentence types they have shown less accuracies. We presented our results below.

we found that both parsers were showing less accuracies for complex, compound-complex type sentences because they have long sentences, including many punctuations like comma, period, and other symbols. These punctuation symbols increases the complexity for giving best accuracies for the parsers.

Key takeaways

Section 3 describes the data and parser settings for Telugu language.
Malt Parser: Malt Parser (Nivre et al., 2006) implements which has two essential components: A transition system for mapping sentences into dependency trees A classifier for predicting the next transition for every possible system configuration Transition Systems: MaltParser comes with a number of built-in transition systems, but we limit our attention to the two systems that have been used in the parsing experiments: the arc-eager projective system first described in Nivre (2003) and the non-projective transition system based on the method described by Covington (2001).
This is the parser described in the following papers -Multilingual Dependency Parsing with a Two-Stage Discriminative Parser -Online Learning of Approximate Dependency Parsing Algorithms -Non-projective Dependency Parsing using Spanning Tree Algorithms -Online Large-Margin Training of Dependency Parsers Telugu is a Dravidian language which is agglutinative in nature.
For simple sentences, both parsers had given good results, but for other sentence types they have shown less accuracies.
For Telugu language, Malt performed better over MST.

Praveen Gatla

2019

In this paper, we have developed manually annotated Telugu corpora by following DS guidelines (2009) and experimented our Telugu dependency treebank data on the data-driven parsers like Malt (Nivre et al., 2007a) and MST (McDonald et al. 2006) for parsing Telugu sentences. In the dependency, we link the head and dependents with their dependency relations (drels) by giving kāraka and non-kāraka relations to them. Telugu annotated data contains token with their morph information, pos, chunk and the drels. We have used our final Telugu treebank data in CONLL format for parsing in malt and MST parsers. We evaluated the labeled attachment score (LAS), unlabeled attachment score (UAS) and labeled accuracy (LA) for both the parsers and also compared their score in case of dependency relation too. Finally, we evaluated the most frequent errors which occurred after parsing the sentences and explained them with relevant examples with appropriate linguistic analysis, so that we can improve the...

Log In

Dependency Parsing for Telugu

Sign up for access to the world's latest research

Abstract

Key takeaways

Related papers

Related topics