UD_Kyrgyz-KTMU is dependency parsing based treebank in Kyrgyz language. The dataset mostly contains headlines from Kyrgyz news websites.
The treebank consists of 2460 sentences (23K tokens) for now and its domain is mainly news headlines. Kyrgyz UD treebank follows the Universal Dependencies (UD) annotation standard.
We would like to thank all the people who contributed to this corpus: Assoc.Prof.Dr. Bakyt Sharshembaev
An academic paper describing this resource is pending, for the time being please use the repository URL to cite this dataset.
- 2023-05-15 v2.12
- Initial release in Universal Dependencies.
=== Machine-readable metadata (DO NOT REMOVE!) ================================ Data available since: UD v2.12 License: CC BY-SA 4.0 Includes text: yes Parallel: no Genre: news fiction Lemmas: manual native UPOS: manual native XPOS: manual native Features: manual native Relations: manual native Contributors: Benli, İbrahim Contributing: here Contact: [email protected] ===============================================================================