Descovering Collocations in Modern Greek Language

CHRISTOS SKOURLAS

Descovering Collocations in Modern Greek Language

CHRISTOS SKOURLAS

2004

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

In this paper two statistical methods for extracting collocations from text corpora written in Modern Greek are described, the mean and variance method and a method based on the X 2 test. The mean and variance method calculates distances ("offsets") between words in a corpus and looks for specific patterns of distance. The X 2 test is combined with the formulation of a null hypothesis H 0 for a sample of occurrences and we check if there are associations between the words. The X 2 testing does not assume that the words in the corpus have normally distributed probabilities and hence it seems to be more flexible. The two methods extract interesting collocations that are useful in various applications e.g. computational lexicography, language generation and machine translation.

CHRISTOS SKOURLAS

Natural Language Understanding and Cognitive Science, 2004

In this paper we describe and apply two statistical methods for extracting collocations from text corpora written in Modern Greek. The first one is the mean and variance method which calculates "offsets" (distances) between words in a corpus and looks for patterns of distances with low spread. The second method is based on the X 2 test. Such an approach seems to be more flexible because it does not assume normally distributed probabilities of the words in the corpus. The two techniques produce interesting collocations that are useful in various applications e.g. computational lexicography, language generation and machine translation.

Log In

Descovering Collocations in Modern Greek Language

Sign up for access to the world's latest research

Abstract

Related papers

Related topics