Skip to main content
The availability of partially overlapping parallel corpora for a language pair opens up opportunities for automatically comparing, evaluating and improving them. We compare and evaluate the alignment quality of two English-Estonian... more
    • by 
    •   2  
      Corpus LinguisticsParallel Corpora
Resumen: Los corpus de textos son herramientas de larga tradición y numerosas aplicaciones. De todos los tipos existentes, este trabajo se centra en uno en concreto: el corpus paralelo alineado. Tomando como punto de partida un corpus... more
    • by 
    •   6  
      Translation StudiesMuseum StudiesCorpus LinguisticsParallel Corpora
Rule Based Machine Translation (RBMT) and Statistical Machine translation (SMT) have different approach in performing translation task. RBMT uses linguistic rule between two languages which is built manually by human in general, whereas... more
    • by 
    •   4  
      Machine TranslationStatistical Machine TranslationParallel CorporaAdvanced
The aim of this paper is to investigate Polish equivalents of English phrasal verbs as found in an English-Polish (E-P) parallel corpus PHRAVERB. Given the semantic idiosyncrasy exhibited by phrasal verbs, it is assumed that the... more
    • by 
    •   4  
      LexicographyPhrasal VerbsParallel CorporaEquivalence in Translation
In this paper we present a method for term extraction that can be used in classroom with translation students. The terms are extracted from a multilingual parallel corpus with the aid of a parallel concordancer, AntPConc. Our work is... more
    • by 
    •   4  
      Translation StudiesTerminologyParallel CorporaMedical Terminology
Rafael guzmán tiRado iRina a. VotyakoVa (ed.) gRanada 2013 tipología léxica cualquier forma de reproducción, distribución, comunicación pública o transformación de esta obra sólo puede ser realizada con la autorización de sus titulares,... more
    • by 
    •   4  
      Corpus LinguisticsLinguistic TypologyParallel CorporaLexical Typology
5η Συνάντηση Ελληνόφωνων Μεταφρασεολόγων, ΑΠΘ, 21-23/5/2015 In this paper we present a method for term extraction that can be used in classroom with translation students. The terms are extracted from a multilingual parallel corpus with... more
    • by  and +1
    •   4  
      TerminologyParallel CorporaMedical TerminologyMedical translation
The paper discusses the main trends in the development of the parallel corpora within the RNC since 2015. The New languages section deals with seven new language pairs that emerged during this period, their architecture and tagging.... more
    • by 
    •   4  
      Parallel CorporaAnnotation of corporaPluperfectBilingual Corpora and Translation
The article presents the analysis of etiquette formulas (forms of address, greetings and farewells) used between teachers of Russian as a foreign language and students studying Russian outside Russia. the survey was conducted among 100... more
    • by 
    •   6  
      Russian LanguageForms of addressParallel CorporaLacunarity
    • by 
    •   8  
      Spanish LinguisticsItalian (Languages And Linguistics)Modern Greek LanguageParallel Corpora
"This article presents a corpus-based study of the metaphorical and metonymical use of the words "head" and "heart," together with the Norwegian correspondents "hode" and "hjerte." The continuum between metaphor and metonymy is explored,... more
    • by 
    •   4  
      MetaphorMetonymyParallel CorporaENPC
摘要: 最近几十年,语料库语言学已成为现代应用语言学的支柱。因此,本文的宗旨是更深入地探讨语料库建设的一些认知性和操作性的步骤,以便把语料库观念向广大的研究人员推广。本文主要分为三个部分: 1. 语料库建设:理论与实践 2. 语料文本的加工层面 3. 语料格式属性的标注... more
    • by 
    •   7  
      Corpus LinguisticsArabic Corpus LinguisticsCorpus Linguistics and Translation StudiesCorpus-Based Translation Studies
В статье рассматривается на корпусном материале русская конструкция типа пошёл было в сопоставлении с белорусским плюсквамперфектом (форма типа пайшоў быў). Выявлены некоторые особенности менее изученной белорусской формы -прежде всего,... more
    • by 
    •   6  
      Slavic LanguagesBelarusian StudiesRussian LanguageTense and Aspect Systems
    • by 
    •   2  
      Corpus LinguisticsParallel Corpora
This paper presents a bilingual corpus-based study of the use of several nouns meaning ‘time’ or time units (‘hour’, ‘minute’, ‘moment’) in Bulgarian and Ukrainian. All matching instances of these words in a collection of parallel texts... more
    • by 
    •   5  
      Translation StudiesTranslationBulgarian LanguageParallel Corpora
    • by 
    •   5  
      Word alignmentParallel CorporaHybrid ApproachEdit Distance
Translation is a profession highly connected to technology, and for this reason, most of today's translators are in contact with a variety of tools, services and programs, such as word processors, e-mail, electronic dictionaries, among... more
    • by  and +1
    •   5  
      UsabilityErgonomicsCorpus Linguistics and Translation StudiesCorpus-Based Translation Studies
In this thesis we describe and evaluate a tool for automatic generation of translations for multiword English terms into Spanish from a monolingual specialized Spanish corpus, compiled by means of web crawling. The resulting translations... more
    • by 
    •   15  
      Machine TranslationTerminologyLexical SemanticsLexicography
The paper reports on a study based on the data drawn from such a corpus. The aim of the study was to find and examine the closest Polish translation equivalents of two semantically related verbs in Czech. The author starts with the... more
    • by 
    •   3  
      Parallel CorporaPolish LanguageCzech language
This paper concentrates on the verbal moods used after Spanish adverbs expressing potentiality (quizá(s), tal vez, probablemente, posiblemente). With the use of the corpus CREA, we sought to determine whether there is a preference for... more
    • by 
    •   10  
      Translation StudiesSpanishModalityCorpus Linguistics
We report on a project to annotate biblical texts in order to create an aligned multilingual Bible corpus for linguistic research, particularly computational linguistics, including automatically creating and evaluating translation... more
    • by 
    •   5  
      Cognitive ScienceComputational linguistic phylogeneticsParallel CorporaCorpus Annotation
    • by 
    •   2  
      Machine TranslationParallel Corpora
We present HindEnCorp, a parallel corpus of Hindi and English, and HindMonoCorp, a monolingual corpus of Hindi in their release version 0.5. Both corpora were collected from web sources and preprocessed primarily for the training of... more
    • by 
    •   4  
      Computational LinguisticsMachine TranslationCorporaParallel Corpora
La lingüística histórica, en su camino hacia la consagración como disciplina autónoma, no ha podido, o no ha querido, distanciarse de las corrientes anejas que transitan y evolucionan en el seno de una lingüística más general y... more
    • by 
    •   7  
      Translation StudiesHistorical LinguisticsCorpus LinguisticsTraducción
We discuss the elative adjectival prefix _pre-_ in Bulgarian and Ukrainian, variously treated as derivative or inflexional by grammarians and lexicographers. Our investigation, performed on a bilingual corpus of parallel texts, shows that... more
    • by  and +1
    •   8  
      ReduplicationBulgarian LanguageParallel CorporaEvaluative morphology
Canonical question tags feature prominently in spoken English, where they display great versatility. At face value they are meant to elicit a response from a co-participant in the form of (dis)agreement with the proposition to which the... more
    • by 
    •   4  
      PragmaticsTag QuestionsParallel CorporaContrastive Linguistics
يُعْتَبَر علم الذخائر اللغوية من العلوم اللغوية التأسيسية التي تُرَسِّخْ مفهوم دراسة اللغة في بيئتها الطبيعية، بعيدًا عن القياس اللغوي المنطقي الذي ساد في حقل الدراسات اللغوية قرونًا عدة. إن علم الذخائر اللغوية، الذي أَسَّسَ له عالم اللغة... more
    • by 
    •   5  
      Natural Language ProcessingApplied LinguisticsChinese Language and CultureCorpus Linguistics
We report on a project to annotate biblical texts in order to create an aligned multilingual Bible corpus for linguistic research, particularly computational linguistics, including automatically creating and evaluating translation... more
    • by 
    •   5  
      Cognitive ScienceComputational linguistic phylogeneticsParallel CorporaCorpus Annotation
The Algerian Arabic dialects are under-resourced languages, which lack both corpora and Natural Language Processing (NLP) tools, although they are increasingly used in written form, especially on social media and forums. We aim through... more
    • by  and +3
    •   3  
      Statistical Machine TranslationArabic DialectsParallel Corpora
Accessing historical texts is often a challenge because readers either do not know the historical language, or they are challenged by the technological hurdle when such texts are available digitally. Merging corpus linguistic methods and... more
    • by 
    •   9  
      Digital HumanitiesHistorical LinguisticsVisualizationComputational Linguistics
The sentences in the RNC are aligned sentence -by -sentence. The texts kindly offered for the use in the RNC by Adrian Barentsen and included into the Amsterdam Slavic Parallel Aligned Corpus multilingual corpus are already aligned... more
    • by 
    •   4  
      Slavic LanguagesCorpus LinguisticsRussian LanguageParallel Corpora
Contrastive methods have long been employed in lexicography, in particular in bi-and multilingual dictionary projects. The main rationale for this is the necessity to comprehensively study, i.e. compare and contrast, two or more... more
    • by 
    •   31  
      LexicologyVocabularyTerminologyConceptual Modelling
    • by 
    •   2  
      Machine TranslationParallel Corpora
iii
    • by  and +1
    •   4  
      Bulgarian LanguageParallel CorporaLithuanian languagePolish Language
The ACTRES Parallel Corpus (P-ACTRES 2.0) is a bidirectional English-Spanish corpus developed by ACTRES research group. P-ACTRES 2.0 contains over 4 million words both directions. From original English texts to their Spanish translations,... more
    • by 
    •   4  
      SpanishEnglishComparable CorporaParallel Corpora
Automatic extraction of bilingual lexicons from parallel corpora has been recently exploited to overcome the knowledge acquisition bottleneck in a number of research areas in natural language processing, such as machine translation (MT)... more
    • by 
    •   4  
      Natural Language ProcessingComputational LinguisticsParallel CorporaArabic Machine Translation
This paper describes the first phase of the CEXI project at the University of Bologna in Forlì, involving the selection of the texts to be included in the corpus and decisions about the processing of these texts. The aim of the project is... more
    • by 
    •   3  
      TranslationParallel CorporaCorpus-Based Translation
The paper describes semantic properties of Perfect forms in European languages exemplified by a massive parallel corpus. A NeighbourNet distance graph for European Perfects is built. In a separate section, the English Perfect in the... more
    • by 
    •   3  
      Perfect TenseParallel CorporaEnglish Perfect Tenses
In this study we examine the occurrences and correspondences of terms for blood kinship in a Bulgarian–Ukrainian parallel corpus of fiction. All instances of the terms selected for study, matching and non-matching, were located and... more
    • by  and +1
    •   11  
      Translation StudiesSemanticsSlavic LanguagesCorpus Linguistics
reading and commenting on a draft of this paper. 2 There is no published account on this corpus; for an example of work with it, see 3 See
    • by 
    •   4  
      Translation StudiesCorpus LinguisticsSlavic LinguisticsParallel Corpora
У збірнику вміщені дослідження з актуальних проблем комп'ютерної лінгвістики. Для викладачів, науковців, учителів, студентів. This volume presents investigations on topical issues in Computational Linguistics. It is intended for... more
    • by  and +1
    •   5  
      Corpus LinguisticsLinguistic TypologyUkrainian LingusticsParallel Corpora
The present paper is about the project of Russian Learner Translator Corpus, which is currently under development. The paper discusses the feasibility of such a corpus and existing analogues, describes the current status of corpus... more
    • by  and +1
    •   6  
      Translation StudiesCorpus LinguisticsLearner corporaParallel Corpora
This paper presents a comparative bilingual corpus-based study of the use of several frequent temporal adverbs and adverbial expressions (‘always’, ‘sometimes’, ‘never’ and their synonyms) in Bulgarian and Ukrainian. The Ukrainian items... more
    • by  and +1
    •   11  
      Translation StudiesSemanticsSlavic LanguagesCorpus Linguistics
This study will examine the prefixed derivates from the verb of motion (VoM) ходить and analyse their translations to German by focusing on the problem of determining the correct meaning of individual forms and possible irregularities in... more
    • by 
    •   7  
      Translation StudiesCorpus LinguisticsRussian LanguageTense and Aspect Systems
This paper presents a comparison between Russian prefixed verbs of memory and their Italian equivalent. In particular, analysing a Russian-Italian parallel corpus, we observed the strategies used for the translation of these verbs from... more
    • by 
    •   3  
      Russian LanguageParallel CorporaContrastive Linguistics
Research in the Humanities is predominantly text-based. For centuries scholars have studied documents such as historical manuscripts, literary works, legal contracts, diaries of important personalities, old tax records etc. Manual... more
    • by 
    •   2  
      Parallel CorporaCorpus Annotation
The article provides information on the development of corpus linguistics in Belarus and Poland. The importance of creating parallel Belarusian-Polish and Polish-Belarusian parallel corpora is noted, the possible algorithm for building... more
    • by  and +1
    •   6  
      Parallel CorporaPolish LanguageBelarusian languageParallel Corpus
The article deals with diminutive adjectives, numerals, pronouns and adverbs in a parallel bilingual corpus of Bulgarian and Ukrainian texts. We address some theoretical questions regarding the category of diminutivity in both languages.... more
    • by  and +1
    •   7  
      Slavic LanguagesBulgarian LanguageDiminutivesParallel Corpora
In this paper we describe an alignment system that aligns English-Hindi texts at the sentence and word level in parallel corpora. We describe a simple sentence length approach to sentence alignment and a hybrid, multi-feature approach to... more
    • by 
    •   5  
      Word alignmentParallel CorporaHybrid ApproachEdit Distance
The present Ph. D. thesis deals with the so-called grey areas that can be found within the Spanish modal system. In these areas, two different types of modality (modal meanings) can occur. We study the relationship that can be found... more
    • by 
    •   12  
      SpanishRomance philologyModalityCorpus Linguistics