Building an annotated corpus for Amazighe

Mohamed Outahajala

Building an annotated corpus for Amazighe

Mohamed Outahajala

2013

visibility

…

description

13 pages

link

1 file

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

This paper gives an overview of the morpho-syntactic features of the Amazighe language and corpus encoding, afterwards we present our experience of constructing an annotated corpus with part-of-speech (POS) information. The annotated corpora consist of 20,667 Moroccan Amazighe tokens chosen from different materials; it is to our knowledge the first one dealing with Amazighe language. The experience is also meant to give a handle on the encoding and tagging processes of the aforementioned corpus.

Mohamed Outahajala

2014

Log In

Building an annotated corpus for Amazighe

Sign up for access to the world's latest research

Abstract

Related papers

Related papers