


default search action
VarDial@COLING 2020: Barcelona, Spain (Online)
- Marcos Zampieri, Preslav Nakov, Nikola Ljubesic, Jörg Tiedemann, Yves Scherrer:

Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial@COLING 2020, Barcelona, Spain (Online), December 13, 2020. International Committee on Computational Linguistics (ICCL) 2020, ISBN 978-1-952148-47-7 - Mihaela Gaman, Dirk Hovy, Radu Tudor Ionescu, Heidi Jauhiainen, Tommi Jauhiainen, Krister Lindén, Nikola Ljubesic, Niko Partanen, Christoph Purschke, Yves Scherrer, Marcos Zampieri:

A Report on the VarDial Evaluation Campaign 2020. 1-14 - Iuliia Nigmatulina, Tannon Kew, Tanja Samardzic:

ASR for Non-standardised Languages with Dialectal Variation: the case of Swiss German. 15-24 - Janine Siewert, Yves Scherrer, Martijn Wieling, Jörg Tiedemann:

LSDC - A comprehensive dataset for Low Saxon Dialect Classification. 25-35 - Amirhossein Tebbifakhr, Matteo Negri, Marco Turchi:

Machine-oriented NMT Adaptation for Zero-shot NLP tasks: Comparing the Usefulness of Close and Distant Languages. 36-46 - Michael Gasser, Binyam Ephrem Seyoum, Nazareth Amlesom Kifle:

Character Alignment in Morphologically Complex Translation Sets for Related Languages. 47-56 - Bharathi Raja Chakravarthi, Navaneethan Rajasekaran, Mihael Arcan, Kevin McGuinness, Noel E. O'Connor, John P. McCrae:

Bilingual Lexicon Induction across Orthographically-distinct Under-Resourced Dravidian Languages. 57-69 - Sina Ahmadi:

Building a Corpus for the Zaza-Gorani Language Family. 70-78 - Ainara Estarrona, Izaskun Etxeberria, Ricardo Etxepare, Manuel Padilla-Moyano, Ander Soraluze:

Dealing with dialectal variation in the construction of the Basque historical corpus. 79-89 - Chahan Vidal-Gorène, Victoria Khurshudyan, Anaïd Donabédian-Demopoulos:

Recycling and Comparing Morphological Annotation Models for Armenian Diachronic-Variational Corpus Processing. 90-101 - Maja Popovic, Alberto Poncelas, Marija Brkic, Andy Way:

Neural Machine Translation for translating into Croatian and Serbian. 102-113 - Sina Ahmadi:

A Tokenization System for the Kurdish Language. 114-127 - Badr M. Abdullah, Jacek Kudera, Tania Avgustinova, Bernd Möbius

, Dietrich Klakow:
Rediscovering the Slavic Continuum in Representations Emerging from Neural Models of Spoken Language Identification. 128-139 - Aleksandra Miletic, Myriam Bras, Marianne Vergez-Couret, Louise Esher, Clamença Poujade, Jean Sibille:

A Four-Dialect Treebank for Occitan: Building Process and Parsing Experiments. 140-149 - Andrea Zugarini, Matteo Tiezzi, Marco Maggini:

Vulgaris: Analysis of a Corpus for Middle-Age Varieties of Italian Language. 150-159 - Alyssa Hwang, William R. Frey, Kathleen R. McKeown:

Towards Augmenting Lexical Resources for Slang and African American English. 160-172 - Tommi Jauhiainen, Heidi Jauhiainen, Niko Partanen, Krister Lindén:

Uralic Language Identification (ULI) 2020 shared task dataset and the Wanca 2017 corpora. 173-185 - Çagri Çöltekin:

Dialect Identification under Domain Shift: Experiments with Discriminating Romanian and Moldavian. 186-192 - Cristian Popa, Vlad Stefanescu:

Applying Multilingual and Monolingual Transformer-Based Models for Dialect Identification. 193-201 - Yves Scherrer, Nikola Ljubesic:

HeLju@VarDial 2020: Social Media Variety Geolocation with BERT Models. 202-211 - Petru Rebeja, Dan Cristea:

A dual-encoding system for dialect classification. 212-219 - Tommi Jauhiainen, Heidi Jauhiainen, Krister Lindén:

Experiments in Language Variety Geolocation and Dialect Identification. 220-231 - George-Eduard Zaharia, Andrei-Marius Avram, Dumitru-Clementin Cercel, Traian Rebedea:

Exploring the Power of Romanian BERT for Dialect Identification. 232-241 - Mihaela Gaman, Radu Tudor Ionescu:

Combining Deep Learning and String Kernels for the Localization of Swiss German Tweets. 242-253 - Fernando Benites, Manuela Hürlimann, Pius von Däniken, Mark Cieliebak:

ZHAW-InIT - Social Media Geolocation at VarDial 2020. 254-264 - Andrea Ceolin, Hong Zhang:

Discriminating between standard Romanian and Moldavian tweets using filtered character ngrams. 265-272 - Gabriel Bernier-Colborne, Cyril Goutte:

Challenges in Neural Language Identification: NRC at VarDial 2020. 273-282 - Piyush Mishra:

Geolocation of Tweets with a BiLSTM Regression Model. 283-289

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














