Skip to content

UniversalDependencies/UD_Ottoman_Turkish-DUDU

 
 

Repository files navigation

Summary

An Ottoman Turkish dependency treebank annotated in UD style. Created by Enes Yılandiloğlu.

Introduction

This project comprises 1,782 sentences that are firstly automaticaly annotated via machamp (Van der Goot et al., 2021). During the training phase, multiple modern Turkish UD treebanks were used. Subsequently, the sentences were manually corrected. The sentences were written between 14th to 20th century in various genres such as fiction, news, article, registry record, and religious preach. Unfortunately, for this version, the genres can not be told apart by sentence ids. The training set, translated by the contributor of the treebank, is the direct translation of Cairo Cicling Corpus (CCC). In this treebank, Ottoman Turkish transcription alphabet is used.

Acknowledgments

I am immensely grateful to Fatma Elcan for her tremendous help in providing me with sentences.

Changelog

  • 2024-05-15 v2.14
    • Initial release in Universal Dependencies.
=== Machine-readable metadata (DO NOT REMOVE!) ================================
Data available since: UD v2.14
License: CC BY-SA 4.0
Includes text: yes
Parallel: cairo
Genre: news fiction nonfiction poetry
Lemmas: manual native
UPOS: manual native
XPOS: manual native
Features: manual native
Relations: manual native
Contributors: Yılandiloğlu, Enes
Contributing: here
Contact: [email protected]
===============================================================================

About

No description, website, or topics provided.

Resources

License

Contributing

Stars

Watchers

Forks

Packages

No packages published

Contributors 2

  •  
  •