This treebank consists contemporary written Sinhala text taken from a 10M corpus maintained by UCSC, Sri Lanka. The corpus contains novels, short stories, Sinhala translations, critiques and Sinhala newspapers.
The Sinhala language is an Ido-aryan language spoken by about 20 million people around the world. It is one of the two official languages in Sri Lanka spoken by 75% of its population. Sinhala has been influenced by Portuguese, Dutch, English and Tamil languages in addition to Sanskrit and Pali.
Gunasekara, A. M. (1891). A Comprehensive Grammar of the Sinhalese Language. Godage International Publishers, Sri Lanka.
Karunatillake, W. S. (2009). Sinhala bhasha vyakaranaya. M. D. Gunasena & Co. Ltd, Sri Lanka.
Kumarathunga, M. (1993). kriya viwaranaya. M.D. Gunasena & Company Limited, Sri Lanka.
Kumarathunga, M. (2000). vyakarana vivaranaya. S. Godage & Brothers, Sri Lanka.
Sumanasara, T. (2007). Sinhala Bhashave Vyakaranaya. Wijesooriya Grantha Kendraya, Sri Lanka.
Sumangala, H. (1937). Sinhala vyakarana pari:kshanaya. D. C. Karunanayaka, Sri Lanka.
- 2021-11-07 v2.11
- Initial release in Universal Dependencies.
=== Machine-readable metadata (DO NOT REMOVE!) ================================ Data available since: UD v2.11 License: CC BY-SA 4.0 Includes text: yes Parallel: no Genre: fiction government news nonfiction web Lemmas: manual native UPOS: manual native XPOS: manual native Features: manual native Relations: manual native Contributors: Chamila, Liyanage; Kengatharaiyer, Sarveswaran Contributing: elsewhere Contact: [email protected], [email protected] ===============================================================================