A Computational Model of Finnish Sentence Structure

Aarno Lehtola

A Computational Model of Finnish Sentence Structure

Aarno Lehtola

1983

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

A parser based on this model is being implemented as a component of a larger system, namely a natural language data base interface. There it will follow a component of morphological analysis (see JSppinen et al C83); hence, throughout the present paper it is assumed that all relevant morpho logical and lexical information is computationally available for all words in a sentence. Even though we have a data base application in mind, sen tence analysis will be based on general linguistic knowledge. All applicatio-. dependent inferences are left to subsequent modules which are not discussed here.

Simo Vihjanen

2013

We describe the methods and resources used to build FinnTreeBank-3, a 76.4 million token corpus of Finnish with automatically produced morphological and dependency syntax analyses. Starting from a definition of the target dependency scheme, we show how existing resources are transformed to conform to this definition and subsequently used to develop a parsing pipeline capable of processing a large-scale corpus. An independent formal evaluation demonstrates high accuracy of both morphological and syntactic annotation layers. The parsed corpus is freely available within the FIN-CLARIN infrastructure project.

Log In

A Computational Model of Finnish Sentence Structure

Sign up for access to the world's latest research

Abstract

Related papers

Related topics