This is a treebank of Haitian creole. It contains 144 sentences selected from 3 major genres: bible, literary texts, newspapers.
Kreyòl (Kreyòl Ayisyen, Haitian Creole, iso-639-1: ht) is the main language of Haïti. The dialect described here is the Cap Haïtien dialect which differs slightly in its lexicon with Center and South varieties.
This treebank contains a selection of sentences from the following sources:
- the bible in Haitian creole
- extracts of a novel: Roy (2021) "Lanmou titato"
- newspaper texts from "VOA kreyol" and "PAPDA"
The corpus contains 144 sentences and 3418 tokens. The annotation was done in ArboratorGrew in the SUD format and automatically converted to the UD format..
This treebank is the outcome of a Master internship project by Sandra Jagodzińska (LACITO, CNRS, France) and Claudel Pierre-Louis (LISN, Université Paris-Saclay, CNRS, France). It was funded by:
- an AIP project at the LISN laboratory et the Paris-Saclay University
- the French ANR AUTOGRAMM project (ANR-21-CE38-0017)
- Sandra Jagodzińska, Claudel Pierre-Louis, Sylvain Kahne, Agata Savary, Emmanuel Schang (submitted) Le premier corpus arboré en créole haïtien, in Journée d’études "Le créole haïtien : histoire, évolution, grammaire et lexique", Université d'Etat d'Haïti
- 2024-05-15 v2.14
- Review of some POS tags, mainly DET vs PRON
- 2023-11-15 v2.13
- Initial release in Universal Dependencies.
=== Machine-readable metadata (DO NOT REMOVE!) ================================ Data available since: UD v2.13 License: CC BY-SA 4.0 Includes text: yes Parallel: no Genre: grammar-examples Lemmas: manual native UPOS: manual native XPOS: not available Features: manual native Relations: converted from manual Contributors: Pierre-Louis, Claudel; Jagodzińska, Sandra; Kahane, Sylvain; Savary, Agata; Schang, Emmanuel Contributing: here Contact: [email protected] ===============================================================================