Skip to main content

Tanara Zingano Kuhn

University of Coimbra, Celga-Iltec, Researcher

Universidade de Lisboa, Departamento de Linguistica Geral e Românica, Alumna

Followers

160

Following

151

Co-authors

22

Public Views

PhD in Applied Linguistics
Address: Universidade de Coimbra|University of Coimbra
Centro de Estudos de Linguística Geral e Aplicada (CELGA-ILTEC)|Centre for General and Applied Linguistics Studies (CELGA-ILTEC)
Faculdade de Letras • Largo da Porta Férrea • 3004-530 COIMBRA • PORTUGAL
E-mail: [email protected] |[email protected]

less

University of California, Merced

The University of Sydney

University of Amsterdam

The Hebrew University of Jerusalem

Johannes Kabatek

Uni Zürich

Scott A Schwenter

Ohio State University

Akademia Pomorska w Słupsku - Pomeranian University in Słupsk

Zoe Gavriilidou

Democritus University of Thrace

Austrian Academy of Sciences

UCLouvain (University of Louvain)

Interests

Uploads

Papers by Tanara Zingano Kuhn

State-of-the-art on monolingual lexicography for Brazil (Brazilian Portuguese)

Slovenščina 2.0: Empirične, Aplikativne in Interdisciplinarne Raziskave, Nov 13, 2019

This paper is a minireview of the current status of monolingual lexicography in Brazil. Firstly, ... more This paper is a minireview of the current status of monolingual lexicography in Brazil. Firstly, a brief contextualization of the origins of Brazilian Portuguese dictionary-making is provided. Then, an account of contemporary monolingual dictionaries is given and a more detailed overview on print, digital, spelling, and school dictionaries is presented. Next, research into dictionary use is reviewed. Finally, the perception among the Brazilians with regards to corpora and use of crowdsourcing in lexicography is discussed.

Data preparation in crowdsourcing for pedagogical purposes: the case of the CrowLL game

by Tanara Zingano Kuhn and Kristina Koppel

Slovenščina 2.0, 2022

One way to stimulate the use of corpora in language education is by making pedagogically appropri... more One way to stimulate the use of corpora in language education is by making pedagogically appropriate corpora, labeled with different types of problems (sensitive content, offensive language, structural problems). However, manually labeling corpora is extremely time-consuming and a better approach should be found. We thus propose a combination of two approaches to the creation of problem-labeled pedagogical corpora of Dutch, Estonian, Slovene and Brazilian Portuguese: the use of games with a purpose and of crowdsourcing for the task. We conducted initial experiments to establish the suitability of the crowdsourcing task, and used the lessons learned to design the Crowdsourcing for Language Learning (CrowLL) game in which players identify problematic sentences, classify them, and indicate problematic excerpts. The focus of this paper is on data preparation, given the crucial role that such a stage plays in any crowdsourcing project dealing with the creation of language learning resources. We present the methodology for data preparation, offering a detailed presentation of source corpora selection, pedagogically oriented GDEX configurations, and the creation of lemma lists, with a special focus on common and language-dependent decisions. Finally, we offer a discussion of the challenges that emerged and the solutions that have been implemented so far.

O desenho de uma aplicação de MAVL em PLE destinado a aprendentes chineses

by Tanara Zingano Kuhn and Margarita Correia

Entrepalavras, Fortaleza, 2022

O presente trabalho tem como objetivo apresentar o desenho de uma aplicação1 de Mobile-assisted V... more O presente trabalho tem como objetivo apresentar o desenho de uma
aplicação1 de Mobile-assisted Vocabulary Learning (MAVL) em Português como Língua Estrangeira (PLE) destinada a aprendentes chineses, a UVA. O conteúdo do desenho é baseado em investigações sobre ensino-aprendizagem de vocabulário em língua estrangeira (NATION, 1990, 2000; MA, 2006, 2009; BEATTY, 2010a; BEATTY, 2010b; JIANG, 2000) e na adaptação das estratégias de O’Malley e Chamot (1990)
e Oxford (1990a). Além disso, o processo de aprendizagem na aplicação baseia-se em diversos estudos no âmbito da
aprendizagem assistida por tecnologia (GOODFELLOW, 2006; LAUFER et
al., 2000; GROOT, 2000). Na UVA, pretende-se dar conta da realidade da
aprendizagem de vocabulário de língua portuguesa e dos hábitos e necessidades no uso de aplicações de MAVL dos
aprendentes chineses. Para isso, foi aplicado um inquérito2 a 133 aprendentes chineses, cujos resultados nos ofereceram informação imprescindível para um desenho da aplicação mais adequado ao público-alvo. A estrutura da UVA consiste em cinco módulos: Escolha de Vocabulário a aprender; Aprendizagem de Vocabulário
(subdividido em três etapas: dedução, consolidação e retomada); Dicionário; Administração de Aprendizagem e Campo Social. Trata-se de um recurso inédito que busca facilitar e flexibilizar

Crowdsourcing pedagogical corpora for lexicographical purposes

by Tanara Zingano Kuhn and Rina zviel-girshin

Proceedings of EURALEX 2020 Conference, Volume II. Komotini: SynMorPhoSe Lab, Democritus University of Thrace, v.2., 2021

Corpora are valuable sources for the development of language learning materials (e.g., books, gra... more Corpora are valuable sources for the development of language learning materials (e.g., books, grammars, dictionaries, exercises), because they contain language as produced in natural contexts. Even though corpora are getting larger, mainly due to crawling data from the web, their pedagogical use remains rather challenging. Not all texts are appropriate for language learning or teaching purposes as they can potentially contain sensitive or offensive content, in addition to exhibit structural problems, errors, among other problems. Corpus cleaning for pedagogical purposes is however a very time-consuming task if done manually. In this paper we present a new and more effective method for creating problem-labelled pedagogical corpora for a group of languages, namely Portuguese, Serbian, Slovene, Dutch and Estonian, by means of crowdsourcing. First, we report on an experiment aimed at verifying the adequacy of crowdsourcing as a technique for corpus labelling. We then outline the lessons learned and discuss how these have led us to explore an alternative way of compiling pedagogical corpora through gamification.

O Corpus de Português Escrito em Periódicos - CoPEP

DELTA: Documentação de Estudos em Lingüística Teórica e Aplicada, 2020

O presente estudo tem como objetivo descrever os desafios e soluções encontrados na compilação do... more O presente estudo tem como objetivo descrever os desafios e soluções encontrados na compilação do Corpus de Português Escrito em Periódicos - CoPEP, que contém aproximadamente 40 milhões de palavras, é equilibrado entre as variedades português brasileiro e português europeu em número de palavras e cobre seis grandes áreas de conhecimento. Primeiramente, apresentaremos o contexto de criação do CoPEP, qual seja, a elaboração de um dicionário on-line de português para universitários, para o qual serviu como fonte primária de obtenção de evidências linguísticas. Assim, foram as características desse projeto lexicográfico que informaram os critérios de criação do desenho do CoPEP e as consequentes tomadas de decisão. A seguir, descreveremos a metodologia de aquisição de dados, com foco especial nos desafios enfrentados e nas soluções encontradas. Terminaremos com a descrição da fase final de compilação, na qual aplicamos uma série de procedimentos para obtenção de equilíbrio.

Português como Língua Adicional no Brasil - perfis e contextos implicados (Bulla & Kuhn, 2020)

by Gabriela Bulla and Tanara Zingano Kuhn

Revista Virtual de Estudos da Linguagem - ReVEL, 2020

A área de Português como Língua Adicional (PLA) tradicionalmente abarca questões relativas à educ... more A área de Português como Língua Adicional (PLA) tradicionalmente abarca questões relativas à educação4 e políticas linguísticas5 envolvendo o português para falantes de outras línguas, ou seja, em contextos em que não é a língua de socialização inicial do estudante/examinando ou de determinada comunidade. Neste artigo, apresentamos uma introdução à área de PLA por meio da discussão de algumas variações terminológicas no que tange ao próprio nome da área no Brasil, bem como da breve exploração de públicos e contextos em que profissionais de PLA podem atuar em termos de ensino, avaliação, pesquisa, produção técnico-científica e políticas linguísticas.

Português como Língua Adicional: uma entrevista com Marisa Mendonça [2020]

by Gabriela Bulla and Tanara Zingano Kuhn

Revista Virtual de Estudos da Linguagem - ReVEL, 2020

A professora Marisa Mendonça inicia esta entrevista com a história da constituição da área de PLA... more A professora Marisa Mendonça inicia esta entrevista com a história da constituição da área de PLA em Moçambique. Em seguida, oferece uma apresentação das características sócio-históricas, linguísticas e culturais do continente africano de modo a contextualizar as especificidades e os desafios ali encontrados em relação ao ensino e à aprendizagem de PLA. Também reflete sobre o papel do IILP para a área de PLA e compartilha sua opinião especializada quanto ao que entende ser essencial para um currículo de formação inicial e continuada de professores de PLA, destacando os principais desafios e problemáticas para a área de PLA no futuro. Por fim, nos deixa indicações de leituras para interessados em ingressar nessa área de estudos.

STATE-OF-THE-ART ON MONOLINGUAL LEXICOGRAPHY FOR BRAZIL

Slovenščina 2.0, 2019

Zingano Kuhn, Tanara: State-of-the-art on monolingual lexicography for Brazil (Brazilian Portugue... more Zingano Kuhn, Tanara: State-of-the-art on monolingual lexicography for Brazil (Brazilian Portuguese). Slovenščina 2.0, 7 (1): 98-112. This paper is a minireview of the current status of monolingual lexicography in Brazil. Firstly, a brief contextualization of the origins of Brazilian Portuguese dictionary-making is provided. Then, an account of contemporary monolingual dictionaries is given and a more detailed overview on print, digital, spelling, and school dictionaries is presented. Next, research into dictionary use is reviewed. Finally, the perception among the Brazilians with regards to corpora and use of crowdsourcing in lexicography is discussed.

Identification and automatic extraction of good dictionary examples: the case(s) of GDEX

by Kristina Koppel and Tanara Zingano Kuhn

International Journal of Lexicography

Examples have always been an important part of a dictionary entry. As Rundell and Atkins (2008: 4... more Examples have always been an important part of a dictionary entry. As Rundell and Atkins (2008: 454) point out, ‘you sometimes find that an entry is almost incomprehensible without its examples.’ This argument is strengthened by the recent findings of Frankenberg-Garcia (2012, 2014) that several corpus examples can sometimes be even more useful than the definition. ... Selecting examples is a great challenge to lexicographers, not only because they need to find examples that meet criteria of a good dictionary example (criteria may differ depending on the target users) but also because the sources of examples, i.e. corpora, are getting larger and larger, nowadays containing several billion words or more, and it is inconceivable that...

THE IMAGE OF THE MONOLINGUAL DICTIONARY ACROSS EUROPE. RESULTS OF THE EUROPEAN SURVEY OF DICTIONARY USE AND CULTURE

International Journal of Lexicography, 2018

The article presents the results of a survey on dictionary use in Europe, focusing ongeneral mon... more The article presents the results of a survey on dictionary use in Europe, focusing ongeneral monolingual dictionaries. The survey is the broadest survey of dictionaryuse to date, covering close to 10,000 dictionary users (and non-users) in nearly thirtycountries. Our survey covers varied user groups, going beyond the students andtranslators who have tended to dominate such studies thus far. The survey wasdelivered via an online survey platform, in language versions speciﬁc to each targetcountry. It was completed by 9,562 respondents, over 300 respondents per countryon average. The survey consisted of the general section, which was translated andpresented to all participants, as well as country-speciﬁc sections for a subset of 11countries, which were drafted by collaborators at the national level. The present re-port covers the general section.

A Design Proposal of an Online Corpus-Driven Dictionary of Portuguese for University Students (Dissertation abstract)

Journal of Portuguese Linguistics, 2019

The objective of this PhD project was to propose the design of an online corpus-driven dictionary... more The objective of this PhD project was to propose the design of an online corpus-driven dictionary of Portuguese for university students (DOPU), aimed at both speakers of Portuguese as a mother tongue and as an additional language and covering Brazilian and European Portuguese varieties. For that, the highly innovative semi-automated approach to dictionary-making (Gantar, Kosem and Krek 2016) was adopted, which involves automatic extraction of data from the corpus and import into dictionary writing system. As a method that had never been applied for lexicographical projects of the Portuguese language, it was necessary to experiment the approach for the first time. Thus, all the required pre-requisites were newly developed, namely, a corpus of academic texts, sketch grammar, GDEX configuration, and a specially-tailored procedure for automatic extraction of data. The experiment indicated that not only can this approach be successfully used as a means to provide lexical content for the design of DOPU, but it can also be beneficial to other lexicographical projects of Portuguese.

The CPLP Corpus : A Pluricentric Corpus for the Common Portuguese Spelling Dictionary ( VOC )

by Tanara Zingano Kuhn and Margarita Correia

Proceedings of Euralex 2018, 2018

The Pluricentric Corpus of the Portuguese Language (CPLP Corpus) aims to provide comparable corpo... more The Pluricentric Corpus of the Portuguese Language (CPLP Corpus) aims to provide comparable corpora for the national varieties of the countries where Portuguese is an official language, making it possible to undertake corpus-based comparisons among the varieties of these countries. It is intended as a publicly available corpus for comparative linguistics and language resource development, but furthermore constitutes one of the pillars of the Vocabulário Ortográfico Comum da Língua Portuguesa (VOC), the official spelling dictionary for Portuguese. The headword list in VOC is partly derived from lexicographic tradition, which is to date based almost exclusively on the European and Brazilian varieties, and partly made up of words retrieved from the CPLP corpus, many of them included for the first time in official language resources for Portuguese. This double inclusion route aims at presenting an integral (i.e., non-contrastive) and increasingly balanced perspective on all the varieties. This paper describes the general design of the corpus, the challenges faced in its development, as well as the way it was used in the compilation of VOC.

DEVISING A SKETCH GRAMMAR FOR ACADEMIC PORTUGUESE

by Tanara Zingano Kuhn and Iztok Kosem

Slovenšcina 2.0: empirical, applied and interdisciplinary research, 2016

This paper presents the development of a new sketch grammar designed specifically for CoPEP, a ne... more This paper presents the development of a new sketch grammar designed specifically for CoPEP, a newly compiled 40-million corpus comprising texts from academic journals, tagged with Freeling v3, the default tagger available in the Sketch Engine for corpora of Portuguese. We first provide an overview and evaluation of existing sketch grammars for Portuguese, followed by a detailed description of the development of a new sketch grammar, and the presentation of some of the problems encountered. We conclude by summarizing the main findings, highlighting important implications, and offering suggestions for further improvement of the sketch grammar. More accurate and varied word sketch results than those offered by the current default sketch grammar indicate that our sketch grammar can be used for advanced lexicographic tasks such as automatic extraction of lexical data from CoPEP, the methodology of knowledge acquisition planned for the compilation of a dictionary of Portuguese for university students. Moreover, this new sketch grammar can be used with any other corpus of Portuguese tagged with Freeling v3, which makes it an important resource for lexicographic and corpus linguistic research of the Portuguese language.

Trabalhando gêneros orais em um curso Técnico em Biotecnologia: sugestão de tarefa para estudar a organização interna de uma palestra

Revista Bem Legal, 2015

Resenha Oxford Learner's Dictionary of Academic English

BELT - Brazilian English Language Teaching Journal, 2015

Os dicionários monolíngues para aprendizes de uma língua adicional se diferenciam dos dicionários... more Os dicionários monolíngues para aprendizes de uma língua adicional se diferenciam dos dicionários de língua geral por, em regra, buscarem não só facilitar a compreensão textual mas também auxiliar na produção de textos. O Oxford Learner's Dictionary of Academic English (doravante, OLDAE), tratando-se de um dicionário para aprendizes estrangeiros (como já indica seu nome), apresenta essa característica, porém, com o diferencial de que a descrição da língua aqui em pauta tem uma especificação: é o inglês usado em contexto acadêmico.

VOCABULÁRIO CONTROLADO E REDAÇÃO DE DEFINIÇÕES EM DICIONÁRIOS DE PORTUGUÊS PARA ESTRANGEIROS: ENSAIOS PARA UMA LÉXICO-ESTATÍSTICA TEXTUAL

by Aline Evers, Aline Maciel Pereira, and Tanara Zingano Kuhn

Initial study in lexical-textual statistics that aims at collecting data to support the construct... more Initial study in lexical-textual statistics that aims at collecting data to support the construction of a basic controlled vocabulary (CV) to be a reference for writing definitions in a Portuguese learner’s dictionary. We used vocabulary frequency data from Brazilian popular newspapers and we also analyzed three different corpora. After comparing the most frequent words of each source, we evaluated the use of CVs to prepare a set of test entries. The results demonstrate the proper use of these corpora for the composition of a CV and the relevance of statistical linguistics for its compilation.

Foregrounding the Development of an Online Dictionary for Intermediate-level Learners of Brazilian Portuguese as an Additional Language: Initial Contributions

Proceedings of Euralex 2012, 2012

The present PhD project intends to collaborate with the designing of a monolingual online diction... more The present PhD project intends to collaborate with the designing of a monolingual online dictionary for intermediate-level learners of Brazilian Portuguese as an additional language. Considering that the development of such a reference work involves the investigation of a series of theoretical-methodological aspects, this research will be narrowed down to one specific issue: the use of simplified Portuguese language patterns in the writing of the definitions. Therefore, the steps to be taken entail a thorough bibliographical review on lexicographical definitions for monolingual learners' dictionaries and the use of defining vocabulary for their writing; Brazilian Portuguese corpus research in order to compile a defining vocabulary list (DVL); and tests with learners to verify which kind of definitionsthose which were written with or without the use of DVLis better for the user. Since pedagogical (meta)lexicography regarding Brazilian Portuguese as an Additional Language (BPAL) is to a fairly large degree still incipient, especially when compared to what has been done in the area of English as a Foreign Language (EFL), this project is expected to give substantial contribution to new knowledge.

On the proposal of an on-line Brazilian Portuguese dictionary for speakers of Asian languages: an ongoing experiment

Proceedings of ASIALEX 2011, 2011

Marcadores discursivos em conversa no português brasileiro: proposta de atividade didática para nível básico o ensino de português como língua adicional

Atas CONFERÊNCIA DA KALUBS, 2011

Uso de vocabulário controlado em dicionários de português como língua estrangeira em formato on-line: uma experiência em andamento para uso de aprendizes coreanos

Atas do IIISIMELP: A formação de novas gerações de falantes de português no mundo, 2011

State-of-the-art on monolingual lexicography for Brazil (Brazilian Portuguese)

Slovenščina 2.0: Empirične, Aplikativne in Interdisciplinarne Raziskave, Nov 13, 2019

This paper is a minireview of the current status of monolingual lexicography in Brazil. Firstly, ... more This paper is a minireview of the current status of monolingual lexicography in Brazil. Firstly, a brief contextualization of the origins of Brazilian Portuguese dictionary-making is provided. Then, an account of contemporary monolingual dictionaries is given and a more detailed overview on print, digital, spelling, and school dictionaries is presented. Next, research into dictionary use is reviewed. Finally, the perception among the Brazilians with regards to corpora and use of crowdsourcing in lexicography is discussed.

Data preparation in crowdsourcing for pedagogical purposes: the case of the CrowLL game

by Tanara Zingano Kuhn and Kristina Koppel

Slovenščina 2.0, 2022

One way to stimulate the use of corpora in language education is by making pedagogically appropri... more One way to stimulate the use of corpora in language education is by making pedagogically appropriate corpora, labeled with different types of problems (sensitive content, offensive language, structural problems). However, manually labeling corpora is extremely time-consuming and a better approach should be found. We thus propose a combination of two approaches to the creation of problem-labeled pedagogical corpora of Dutch, Estonian, Slovene and Brazilian Portuguese: the use of games with a purpose and of crowdsourcing for the task. We conducted initial experiments to establish the suitability of the crowdsourcing task, and used the lessons learned to design the Crowdsourcing for Language Learning (CrowLL) game in which players identify problematic sentences, classify them, and indicate problematic excerpts. The focus of this paper is on data preparation, given the crucial role that such a stage plays in any crowdsourcing project dealing with the creation of language learning resources. We present the methodology for data preparation, offering a detailed presentation of source corpora selection, pedagogically oriented GDEX configurations, and the creation of lemma lists, with a special focus on common and language-dependent decisions. Finally, we offer a discussion of the challenges that emerged and the solutions that have been implemented so far.

O desenho de uma aplicação de MAVL em PLE destinado a aprendentes chineses

by Tanara Zingano Kuhn and Margarita Correia

Entrepalavras, Fortaleza, 2022

O presente trabalho tem como objetivo apresentar o desenho de uma aplicação1 de Mobile-assisted V... more O presente trabalho tem como objetivo apresentar o desenho de uma
aplicação1 de Mobile-assisted Vocabulary Learning (MAVL) em Português como Língua Estrangeira (PLE) destinada a aprendentes chineses, a UVA. O conteúdo do desenho é baseado em investigações sobre ensino-aprendizagem de vocabulário em língua estrangeira (NATION, 1990, 2000; MA, 2006, 2009; BEATTY, 2010a; BEATTY, 2010b; JIANG, 2000) e na adaptação das estratégias de O’Malley e Chamot (1990)
e Oxford (1990a). Além disso, o processo de aprendizagem na aplicação baseia-se em diversos estudos no âmbito da
aprendizagem assistida por tecnologia (GOODFELLOW, 2006; LAUFER et
al., 2000; GROOT, 2000). Na UVA, pretende-se dar conta da realidade da
aprendizagem de vocabulário de língua portuguesa e dos hábitos e necessidades no uso de aplicações de MAVL dos
aprendentes chineses. Para isso, foi aplicado um inquérito2 a 133 aprendentes chineses, cujos resultados nos ofereceram informação imprescindível para um desenho da aplicação mais adequado ao público-alvo. A estrutura da UVA consiste em cinco módulos: Escolha de Vocabulário a aprender; Aprendizagem de Vocabulário
(subdividido em três etapas: dedução, consolidação e retomada); Dicionário; Administração de Aprendizagem e Campo Social. Trata-se de um recurso inédito que busca facilitar e flexibilizar

Crowdsourcing pedagogical corpora for lexicographical purposes

by Tanara Zingano Kuhn and Rina zviel-girshin

Proceedings of EURALEX 2020 Conference, Volume II. Komotini: SynMorPhoSe Lab, Democritus University of Thrace, v.2., 2021

Corpora are valuable sources for the development of language learning materials (e.g., books, gra... more Corpora are valuable sources for the development of language learning materials (e.g., books, grammars, dictionaries, exercises), because they contain language as produced in natural contexts. Even though corpora are getting larger, mainly due to crawling data from the web, their pedagogical use remains rather challenging. Not all texts are appropriate for language learning or teaching purposes as they can potentially contain sensitive or offensive content, in addition to exhibit structural problems, errors, among other problems. Corpus cleaning for pedagogical purposes is however a very time-consuming task if done manually. In this paper we present a new and more effective method for creating problem-labelled pedagogical corpora for a group of languages, namely Portuguese, Serbian, Slovene, Dutch and Estonian, by means of crowdsourcing. First, we report on an experiment aimed at verifying the adequacy of crowdsourcing as a technique for corpus labelling. We then outline the lessons learned and discuss how these have led us to explore an alternative way of compiling pedagogical corpora through gamification.

O Corpus de Português Escrito em Periódicos - CoPEP

DELTA: Documentação de Estudos em Lingüística Teórica e Aplicada, 2020

O presente estudo tem como objetivo descrever os desafios e soluções encontrados na compilação do... more O presente estudo tem como objetivo descrever os desafios e soluções encontrados na compilação do Corpus de Português Escrito em Periódicos - CoPEP, que contém aproximadamente 40 milhões de palavras, é equilibrado entre as variedades português brasileiro e português europeu em número de palavras e cobre seis grandes áreas de conhecimento. Primeiramente, apresentaremos o contexto de criação do CoPEP, qual seja, a elaboração de um dicionário on-line de português para universitários, para o qual serviu como fonte primária de obtenção de evidências linguísticas. Assim, foram as características desse projeto lexicográfico que informaram os critérios de criação do desenho do CoPEP e as consequentes tomadas de decisão. A seguir, descreveremos a metodologia de aquisição de dados, com foco especial nos desafios enfrentados e nas soluções encontradas. Terminaremos com a descrição da fase final de compilação, na qual aplicamos uma série de procedimentos para obtenção de equilíbrio.

Português como Língua Adicional no Brasil - perfis e contextos implicados (Bulla & Kuhn, 2020)

by Gabriela Bulla and Tanara Zingano Kuhn

Revista Virtual de Estudos da Linguagem - ReVEL, 2020

A área de Português como Língua Adicional (PLA) tradicionalmente abarca questões relativas à educ... more A área de Português como Língua Adicional (PLA) tradicionalmente abarca questões relativas à educação4 e políticas linguísticas5 envolvendo o português para falantes de outras línguas, ou seja, em contextos em que não é a língua de socialização inicial do estudante/examinando ou de determinada comunidade. Neste artigo, apresentamos uma introdução à área de PLA por meio da discussão de algumas variações terminológicas no que tange ao próprio nome da área no Brasil, bem como da breve exploração de públicos e contextos em que profissionais de PLA podem atuar em termos de ensino, avaliação, pesquisa, produção técnico-científica e políticas linguísticas.

Português como Língua Adicional: uma entrevista com Marisa Mendonça [2020]

by Gabriela Bulla and Tanara Zingano Kuhn

Revista Virtual de Estudos da Linguagem - ReVEL, 2020

A professora Marisa Mendonça inicia esta entrevista com a história da constituição da área de PLA... more A professora Marisa Mendonça inicia esta entrevista com a história da constituição da área de PLA em Moçambique. Em seguida, oferece uma apresentação das características sócio-históricas, linguísticas e culturais do continente africano de modo a contextualizar as especificidades e os desafios ali encontrados em relação ao ensino e à aprendizagem de PLA. Também reflete sobre o papel do IILP para a área de PLA e compartilha sua opinião especializada quanto ao que entende ser essencial para um currículo de formação inicial e continuada de professores de PLA, destacando os principais desafios e problemáticas para a área de PLA no futuro. Por fim, nos deixa indicações de leituras para interessados em ingressar nessa área de estudos.

STATE-OF-THE-ART ON MONOLINGUAL LEXICOGRAPHY FOR BRAZIL

Slovenščina 2.0, 2019

Zingano Kuhn, Tanara: State-of-the-art on monolingual lexicography for Brazil (Brazilian Portugue... more Zingano Kuhn, Tanara: State-of-the-art on monolingual lexicography for Brazil (Brazilian Portuguese). Slovenščina 2.0, 7 (1): 98-112. This paper is a minireview of the current status of monolingual lexicography in Brazil. Firstly, a brief contextualization of the origins of Brazilian Portuguese dictionary-making is provided. Then, an account of contemporary monolingual dictionaries is given and a more detailed overview on print, digital, spelling, and school dictionaries is presented. Next, research into dictionary use is reviewed. Finally, the perception among the Brazilians with regards to corpora and use of crowdsourcing in lexicography is discussed.

Identification and automatic extraction of good dictionary examples: the case(s) of GDEX

by Kristina Koppel and Tanara Zingano Kuhn

International Journal of Lexicography

Examples have always been an important part of a dictionary entry. As Rundell and Atkins (2008: 4... more Examples have always been an important part of a dictionary entry. As Rundell and Atkins (2008: 454) point out, ‘you sometimes find that an entry is almost incomprehensible without its examples.’ This argument is strengthened by the recent findings of Frankenberg-Garcia (2012, 2014) that several corpus examples can sometimes be even more useful than the definition. ... Selecting examples is a great challenge to lexicographers, not only because they need to find examples that meet criteria of a good dictionary example (criteria may differ depending on the target users) but also because the sources of examples, i.e. corpora, are getting larger and larger, nowadays containing several billion words or more, and it is inconceivable that...

THE IMAGE OF THE MONOLINGUAL DICTIONARY ACROSS EUROPE. RESULTS OF THE EUROPEAN SURVEY OF DICTIONARY USE AND CULTURE

International Journal of Lexicography, 2018

The article presents the results of a survey on dictionary use in Europe, focusing ongeneral mon... more The article presents the results of a survey on dictionary use in Europe, focusing ongeneral monolingual dictionaries. The survey is the broadest survey of dictionaryuse to date, covering close to 10,000 dictionary users (and non-users) in nearly thirtycountries. Our survey covers varied user groups, going beyond the students andtranslators who have tended to dominate such studies thus far. The survey wasdelivered via an online survey platform, in language versions speciﬁc to each targetcountry. It was completed by 9,562 respondents, over 300 respondents per countryon average. The survey consisted of the general section, which was translated andpresented to all participants, as well as country-speciﬁc sections for a subset of 11countries, which were drafted by collaborators at the national level. The present re-port covers the general section.

A Design Proposal of an Online Corpus-Driven Dictionary of Portuguese for University Students (Dissertation abstract)

Journal of Portuguese Linguistics, 2019

The objective of this PhD project was to propose the design of an online corpus-driven dictionary... more The objective of this PhD project was to propose the design of an online corpus-driven dictionary of Portuguese for university students (DOPU), aimed at both speakers of Portuguese as a mother tongue and as an additional language and covering Brazilian and European Portuguese varieties. For that, the highly innovative semi-automated approach to dictionary-making (Gantar, Kosem and Krek 2016) was adopted, which involves automatic extraction of data from the corpus and import into dictionary writing system. As a method that had never been applied for lexicographical projects of the Portuguese language, it was necessary to experiment the approach for the first time. Thus, all the required pre-requisites were newly developed, namely, a corpus of academic texts, sketch grammar, GDEX configuration, and a specially-tailored procedure for automatic extraction of data. The experiment indicated that not only can this approach be successfully used as a means to provide lexical content for the design of DOPU, but it can also be beneficial to other lexicographical projects of Portuguese.

The CPLP Corpus : A Pluricentric Corpus for the Common Portuguese Spelling Dictionary ( VOC )

by Tanara Zingano Kuhn and Margarita Correia

Proceedings of Euralex 2018, 2018

The Pluricentric Corpus of the Portuguese Language (CPLP Corpus) aims to provide comparable corpo... more The Pluricentric Corpus of the Portuguese Language (CPLP Corpus) aims to provide comparable corpora for the national varieties of the countries where Portuguese is an official language, making it possible to undertake corpus-based comparisons among the varieties of these countries. It is intended as a publicly available corpus for comparative linguistics and language resource development, but furthermore constitutes one of the pillars of the Vocabulário Ortográfico Comum da Língua Portuguesa (VOC), the official spelling dictionary for Portuguese. The headword list in VOC is partly derived from lexicographic tradition, which is to date based almost exclusively on the European and Brazilian varieties, and partly made up of words retrieved from the CPLP corpus, many of them included for the first time in official language resources for Portuguese. This double inclusion route aims at presenting an integral (i.e., non-contrastive) and increasingly balanced perspective on all the varieties. This paper describes the general design of the corpus, the challenges faced in its development, as well as the way it was used in the compilation of VOC.

DEVISING A SKETCH GRAMMAR FOR ACADEMIC PORTUGUESE

by Tanara Zingano Kuhn and Iztok Kosem

Slovenšcina 2.0: empirical, applied and interdisciplinary research, 2016

This paper presents the development of a new sketch grammar designed specifically for CoPEP, a ne... more This paper presents the development of a new sketch grammar designed specifically for CoPEP, a newly compiled 40-million corpus comprising texts from academic journals, tagged with Freeling v3, the default tagger available in the Sketch Engine for corpora of Portuguese. We first provide an overview and evaluation of existing sketch grammars for Portuguese, followed by a detailed description of the development of a new sketch grammar, and the presentation of some of the problems encountered. We conclude by summarizing the main findings, highlighting important implications, and offering suggestions for further improvement of the sketch grammar. More accurate and varied word sketch results than those offered by the current default sketch grammar indicate that our sketch grammar can be used for advanced lexicographic tasks such as automatic extraction of lexical data from CoPEP, the methodology of knowledge acquisition planned for the compilation of a dictionary of Portuguese for university students. Moreover, this new sketch grammar can be used with any other corpus of Portuguese tagged with Freeling v3, which makes it an important resource for lexicographic and corpus linguistic research of the Portuguese language.

Trabalhando gêneros orais em um curso Técnico em Biotecnologia: sugestão de tarefa para estudar a organização interna de uma palestra

Revista Bem Legal, 2015

Resenha Oxford Learner's Dictionary of Academic English

BELT - Brazilian English Language Teaching Journal, 2015

Os dicionários monolíngues para aprendizes de uma língua adicional se diferenciam dos dicionários... more Os dicionários monolíngues para aprendizes de uma língua adicional se diferenciam dos dicionários de língua geral por, em regra, buscarem não só facilitar a compreensão textual mas também auxiliar na produção de textos. O Oxford Learner's Dictionary of Academic English (doravante, OLDAE), tratando-se de um dicionário para aprendizes estrangeiros (como já indica seu nome), apresenta essa característica, porém, com o diferencial de que a descrição da língua aqui em pauta tem uma especificação: é o inglês usado em contexto acadêmico.

VOCABULÁRIO CONTROLADO E REDAÇÃO DE DEFINIÇÕES EM DICIONÁRIOS DE PORTUGUÊS PARA ESTRANGEIROS: ENSAIOS PARA UMA LÉXICO-ESTATÍSTICA TEXTUAL

by Aline Evers, Aline Maciel Pereira, and Tanara Zingano Kuhn

Initial study in lexical-textual statistics that aims at collecting data to support the construct... more Initial study in lexical-textual statistics that aims at collecting data to support the construction of a basic controlled vocabulary (CV) to be a reference for writing definitions in a Portuguese learner’s dictionary. We used vocabulary frequency data from Brazilian popular newspapers and we also analyzed three different corpora. After comparing the most frequent words of each source, we evaluated the use of CVs to prepare a set of test entries. The results demonstrate the proper use of these corpora for the composition of a CV and the relevance of statistical linguistics for its compilation.

Foregrounding the Development of an Online Dictionary for Intermediate-level Learners of Brazilian Portuguese as an Additional Language: Initial Contributions

Proceedings of Euralex 2012, 2012

The present PhD project intends to collaborate with the designing of a monolingual online diction... more The present PhD project intends to collaborate with the designing of a monolingual online dictionary for intermediate-level learners of Brazilian Portuguese as an additional language. Considering that the development of such a reference work involves the investigation of a series of theoretical-methodological aspects, this research will be narrowed down to one specific issue: the use of simplified Portuguese language patterns in the writing of the definitions. Therefore, the steps to be taken entail a thorough bibliographical review on lexicographical definitions for monolingual learners' dictionaries and the use of defining vocabulary for their writing; Brazilian Portuguese corpus research in order to compile a defining vocabulary list (DVL); and tests with learners to verify which kind of definitionsthose which were written with or without the use of DVLis better for the user. Since pedagogical (meta)lexicography regarding Brazilian Portuguese as an Additional Language (BPAL) is to a fairly large degree still incipient, especially when compared to what has been done in the area of English as a Foreign Language (EFL), this project is expected to give substantial contribution to new knowledge.

On the proposal of an on-line Brazilian Portuguese dictionary for speakers of Asian languages: an ongoing experiment

Proceedings of ASIALEX 2011, 2011

Marcadores discursivos em conversa no português brasileiro: proposta de atividade didática para nível básico o ensino de português como língua adicional

Atas CONFERÊNCIA DA KALUBS, 2011

Uso de vocabulário controlado em dicionários de português como língua estrangeira em formato on-line: uma experiência em andamento para uso de aprendizes coreanos

Atas do IIISIMELP: A formação de novas gerações de falantes de português no mundo, 2011

Proposta de desenvolvimento de uma Plataforma On-line de Dicionários de Colocações Acadêmicas

I Congresso de Português como Língua Estrangeira na Columbia University, 2021

Ottaiano, Adriane Orenha; Kuhn, Tanara Zingano; Valencio, Carlos Roberto; Tenório, William

The building of an Online Platform for Monolingual Dictionaries of Academic Collocations in Portuguese and English

by Tanara Zingano Kuhn and Adriane Orenha Ottaiano

56th Linguistics Colloquium, 2020

Pluricentrismo e sistemas de certificação de competências em língua portuguesa – o caso dos estudantes estrangeiros em Portugal

by Tanara Zingano Kuhn, Catarina Gaspar, and Margarita Correia

III Simpósio Internacional de Ensino de Português como Língua Adicional (SINEPLA) - programação e resumos, 2021

Mou, Xiao; Gaspar, Catarina; Correia, Margarita; Kuhn, Tanara Zingano

Gamifying the path to corpus-based pedagogical dictionaries

Electronic lexicography in the 21st century (eLex 2021): Post-editing lexicography. Book of abstracts, 2021

Corpus cleaning for language learning resource development

EUROCALL Conference , 2019

Desenvolvimento de uma configuração GDEX para um corpus de português acadêmico

VII Simpósio Mundial de Estudos de Língua Portuguesa – SIMELP, 2019

Corpus Filtering via Crowdsourcing for Developing a Learner’s Dictionary

Electronic lexicography in the 21st century (eLex 2019): Smart Lexicography., 2019

Ensino de português como língua de acolhimento em Portugal: Análise do material didático Caderno de Formação – propostas de atividades e exercícios

Livro de Resumos VI Jornadas Pedagógicas de Língua Portuguesa, 2019

Crowdsourcing corpus cleaning for language learning - an approach proposal

by Tanara Zingano Kuhn and Rina zviel-girshin

3rd enetCollect Annual Meeting, 2019

Introducing CoPEP, the Corpus de Português Escrito em Periódicos (Corpus of Portuguese from Academic Journals)

14th American Association for Corpus Linguistics (AACL) Conference, 2014

Dando corpo às diversas vozes do português: o projeto corpus CPLP

by Tanara Zingano Kuhn and Margarita Correia

II Simpósio Internacional de Ensino de Português Língua Adicional-SINEPLA, 2018

Uma experiência no Curso de Português-Espanhol para Intercâmbio (CEPI): a formação de professoras para contextos on-line e inserção de vídeos explicativos

by Tanara Zingano Kuhn and Kétina Timboni

Salão de Ensino UFRGS, 2018

RESUMO: Neste relato, vamos apresentar o Curso de Português-Espanhol para Intercâmbio (CEPI) e mo... more RESUMO: Neste relato, vamos apresentar o Curso de Português-Espanhol para Intercâmbio (CEPI) e mostrar o desenvolvimento da última edição ressaltando o papel do curso para a formação de professores para o contexto on-line e trazendo algumas contribuições novas para o curso. O CEPI é um curso de português para estrangeiros para fins de intercâmbio para falantes de espanhol oferecido pelo Programa de Português para Estrangeiros da UFRGS nas férias para futuros intercambistas que irão estudar na UFRGS no semestre seguinte. Como é um curso que antecipa o intercâmbio, os aulas são ministradas totalmente on-line sendo o último encontro presencial para que todos, professores e alunos, se conheçam pessoalmente. O CEPI tem um papel importante na formação de professores para contextos digitais, uma vez que é o único curso de extensão de português como língua adicional (PLA) do PPE realizado on-line. Por essa razão, a cada edição se insere novas professoras graduandas com auxílio de professoras mais experientes. O curso está hospedado no Moodle e tem as atividades realizadas em outros ambientes digitais, tais como Grupo do Facebook, Google Drive e Google Hangout. Uma das contribuições do grupo da última edição foi a inserção de vídeos das professoras como feedbacks da semana. Tais devolutivas tinham como objetivos reorganizar as atividades dos alunos, dar dicas de língua portuguesa e indicar estudos e exercícios presente no Moodle. Com essa apresentação, busca-se refletir sobre o ensino de PLA em contextos digitais visando o uso de tecnologias que otimizem os estudos dos alunos.

Analisando pacotes lexicais em um corpus multinacional de português acadêmico

by Tanara Zingano Kuhn and Margarita Correia

IX Escola Brasileira de Linguística Computacional (EBRALC2017) e XIV Encontro de Linguística de Corpus (ELC 2017), 2017

Caderno de Resumos ELC-EBRALC 2017 corresponde à 1ª ediç ã o em versã o eletrô nica. O Caderno de... more Caderno de Resumos ELC-EBRALC 2017 corresponde à 1ª ediç ã o em versã o eletrô nica. O Caderno de Resumos ELC-EBRALC 2017 é uma publicaç ã o em que estã o compiladas versões recortadas pelos organizadores de resumos das palestras, dos minicursos, dos workshops e dos textos dos resumos dos trabalhos (portuguê s paper short paper reproduzido na í ntegra. As informações deste Caderno de Resumos conjugam-se às publicadas no site do evento:

Reporting on the development of sketch grammar for academic Portuguese

Caderno de Resumos ELC-EBRALC 2017, 2017

Extended abstract

Experimenting automatic creation of content for a dictionary of academic Portuguese

ELEX 2017. Electronic Lexicography in the 21st Century. Lexicography from Scratch, 2017

Dealing with multiple orthographic standards within a single corpus: the case of Portuguese in the CoPEP corpus

by Tanara Zingano Kuhn and Margarita Correia

9èmes Journées Internationales de la Linguistique de Corpus, 2017. Livret., 2017

Extended abstract

Princípios e parâmetros para o desenho de um dicionário on-line de português para estudantes universitários

CLUL-LINGME - Linguistic Meeting for Young Researchers, 2016

Building a corpus of written academic texts in Portuguese

12th Teaching and Language Corpora Conference (TALC12), 2016

Investigating the use of cohesive devices by advanced learners of German through contrastive inte... more

Usando o Sketch Engine para a obtenção de evidências lexicográficas de um corpus de português

Colóquio Comemorativo dos 40 Anos do Centro de Linguística da Universidade do Porto, 2016

O uso de dicionários de português por estudantes universitários

X Fórum de Partilha Linguística, 2015

Português língua pluricêntrica: das políticas às práticas

Português língua pluricêntrica: das políticas às práticas, 2022

Num momento em que se avalia o crescente valor econômico do português e em que os decisores polít... more Num momento em que se avalia o crescente valor econômico do português e em que os decisores políticos consagraram o uso do termo “português como língua pluricêntrica”, faz-se necessário discutir em que se consubstancia o pluricentrismo, o que ele significa para os seus falantes, que implicações traz para a investigação linguística e literária, a formação de professores, as práticas de ensino e sistemas de avaliação. O III Simpósio Internacional sobre o Ensino de Português como Língua Adicional (SINEPLA), com o tema “Português língua pluricêntrica: das políticas às práticas”, realizado virtualmente de 16 a 18 de junho de 2021 e organizado por CELGA-ILTEC/Universidade de Coimbra, Universidade de Westminster e Instituto de Letras da UFRGS, buscou propiciar uma oportunidade de reflexão sobre esses temas. Os textos publicados no livro “Português língua pluricêntrica: das políticas às práticas” resultam de trabalhos apresentados no Simpósio.
Ao longo de 17 capítulos, o livro traz reflexões sobre políticas e práticas em português como língua pluricêntrica, o ensino de PLA para fins e públicos específicos, a reflexão linguística, a formação de professores, o uso de textos literários em aulas de PLA e o exame Celpe-Bras. Trata-se de uma obra que procura contribuir para que o debate acerca do uso do conceito “português como língua pluricêntrica” siga ampliando a compreensão da complexidade de fatores envolvidos na nomeação das línguas com as quais se trabalha e nas possíveis implicações de seu uso.

Electronic lexicography in the 21st century. Proceedings of the eLex 2019 conference.

by Tanara Zingano Kuhn, Margarita Correia, Maarten Janssen, and Miloš Jakubíček

Proceedings of the eLex 2019 conference.1-3 October 2019, Sintra, Portugal. , 2019

edited by Iztok Kosem, Tanara Zingano Kuhn, Margarita Correia, José Pedro Ferreira, Maarten Janse... more

Dicionário de linguística da enunciação

Equipe Roman Jakobson

Introdução aos estudos de Roman Jakobson sobre afasia

Apresentação. Português como língua pluricêntrica nas práticas de profissionais da linguagem participantes do III SINEPLA

by Tanara Zingano Kuhn and Margarita Correia

Português língua pluricêntrica: das políticas às práticas, 2022

ANÁLISE COMPARATIVA DAS EDIÇÕES PORTUGUESA E BRASILEIRA DA OBRA OS LIVROS QUE DEVORARAM O MEU PAI, DE AFONSO CRUZ

by Isabel Garcez and Tanara Zingano Kuhn

HISTÓRIA, CULTURA E POLÍTICA NO MUNDO LUSÓFONO, 2021

Os processos de revisão editorial, nos últimos anos, têm vindo a beneficiar de reflexões e orient... more Os processos de revisão editorial, nos últimos anos, têm vindo a beneficiar
de reflexões e orientações da linguística, enquanto ciência da linguagem,
mas também de ferramentas de processamento de linguagem natural ou linguística
computacional, que podem servir para desenvolver tarefas de análise de
corpora, geração e sumarização de textos, tradução, parafraseamento, entre outros.

Developing pedagogically appropriate language corpora through crowdsourcing and gamification

by Rina zviel-girshin, Tanara Zingano Kuhn, and Branislava Šandrih

CALL and professionalisation: short papers from EUROCALL 2021

Despite the unquestionable academic interest on corpus-based approaches to language education, th... more Despite the unquestionable academic interest on corpus-based approaches to language education, the use of corpora by teachers in their everyday practice is still not very widespread. One way to promote usage of corpora in language teaching is by making pedagogically appropriate corpora, labelled with different types of problems (for instance, sensitive content, offensive language, structural problems), so that teachers can select authentic examples according to their needs. Because manually labelling corpora is extremely time-consuming, we propose to use crowdsourcing for this task. After a first exploratory phase, we are currently developing a multimode, multilanguage game in which players first identify problematic sentences and then classify them.

Vocabulário Ortográfico Comum da Língua Portuguesa (VOC)

by Tanara Zingano Kuhn and Gildaris Pandim

Panorama da contribuição do Brasil para a difusão do português. Fundação Alexandre Gusmão. Ministério das Relações Exteriores. , 2021

Panorama da contribuição do Brasil para a difusão do português Descrição: Trata-se de publicaç... more Panorama da contribuição do Brasil para a difusão do português
Descrição:
Trata-se de publicação de referência que reúne 33 verbetes, escritos por reputados especialistas em diversas áreas do conhecimento e 17 depoimentos de consagrados escritores, artistas e intelectuais que revelam a importância da cultura brasileira em sua formação como artífices da palavra em língua portuguesa.
Organizadores Alexandre Pilati | Nelson Viana

One Book, Two Language Varieties

by Isabel Garcez, Anabela Barreiro, and Tanara Zingano Kuhn

springer, 2020

This paper presents a comparative study of alignment pairs, either contrasting expressions or sty... more This paper presents a comparative study of alignment pairs, either contrasting expressions or stylistic variants of the same expression in the European (EP) and the Brazilian (BP) varieties of Portuguese. The alignments were collected semi-automatically using the CLUE-Aligner tool, which allows to record all pairs of paraphrastic units resulting from the alignment task in a database. The corpus used was a children's literature book Os Livros Que Devoraram o Meu Pai (The Books that Devoured My Father) by the Portuguese author Afonso Cruz and the Brazilian adaptation of this book. The main goal of the work presented here is to gather equivalent phrasal expressions and different syntactic constructions, which convey the same meaning in EP and BP, and contribute to the optimisation of editorial processes compulsory in the adaptation of texts, but which are suitable for any type of editorial process. This study provides a scientific basis for future work in the area of editing, proofreading and converting text to and from any variety of Portuguese from a computational point of view, namely to be used in a paraphrasing system with a variety adaptation functionality, even in the case of a literary text. We contemplate "challenging" cases, from a literary point of view, looking for alternatives that do not tamper with the imagery richness of the original version.

Proposta de critérios norteadores para produção de manual didático de português brasileiro língua adicional

Bulla, Gabriela S.; Uflacker, Cristina M.; Schlatter, Margarete. Práticas pedagógicas e materiais didáticos para o ensino de Português como Língua Adicional., 2019

Princípios de análise enunciativa de fatos de língua

MA dissertation, 2009

« Qui a peur de la langue ? » et j'ai entendu que j'avais peur, en effet, de cette chose-comment ... more « Qui a peur de la langue ? » et j'ai entendu que j'avais peur, en effet, de cette chose-comment la nommer ?-la plus familière et la plus étrange. Au risque de la naïvité, j'ai choisi l'émerveillement et de tâcher à le faire partager. Claudine Normand AGRADECIMENTOS Ao Valdir do Nascimento Flores, não apenas por sua grande sabedoria e orientação primorosa, mas também pelo companheirismo que desenvolvemos ao longo desses anos de convivência; cuidado que transpassa o círculo acadêmico e que demonstra que teoria e amizade podem andar lado a lado. À minha família, em especial aos meus pais Elisabeth e Egídio, pelo amor, carinho e apoio incondicionais, mesmo que à distância, e à minha irmã Ananda, pela amizade tão importante na minha vida.

A Design Proposal of an Online Corpus-Driven Dictionary of Portuguese for University Students

PhD Thesis, 2017

I also thank my mother-in-law, Tila, and father-in-law, Zé António, for their attention, support ... more I also thank my mother-in-law, Tila, and father-in-law, Zé António, for their attention, support and care. Special thanks go to Andrew Swearingen for proofreading the manuscript. Last but not least, I would like to express my sincere appreciation to the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (Capes-Brazil) for the PhD scholarship, to CELGA-ILTEC (University of Coimbra) for funding essential elements of my PhD research, such as scripts, to the European Cooperation for Science and Technology (COST) through the European Network of e-Lexicography (ENeL) Action for a Short-Term Scientific Mission, and to the University of Ljubljana for granting me licence to use the tools required for developing my PhD research.