Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2021
…
7 pages
1 file
English. The paper describes the aim and structure of a new freely accessible resource-ListTyp: A typological database of listing patterns-with a focus on methodological aspects, encoded information and search functions.
2000
We present the goals and architecture of the Typological Database System, a project for the creation of a unified interface to numerous independently developed typological databases. The aim of the project is to develop a software system that allows a user to simultaneously query different databases through a single interface. The challenge of the project lies in the variability of
2022
HAL is a multidisciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L'archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d'enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
PROCEEDINGS OF THE NATIONAL CONFERENCE ON ARTIFICIAL …
Using language technology for text analysis and light-weight ontologies as a content-mediating level, we acquire indexing patterns from vast amounts of indexing data for Englishlanguage medical documents. This is achieved by statistically relating interlingual representations of these documents (based on text token bigrams) to their associated index terms. From these 'English' indexing patterns, we then induce the associated index terms for German and Portuguese documents when their interlingual representations match those of English documents. Thus, we learn from past English indexing experience and transfer it in an unsupervised way to non-English texts, without ever having seen concrete indexing data for languages other than English.
2016
Abstract: Indexing languages have traditionally been an essential tool for organizing and retrieving documental information. The inclusion of indexing languages into the digital environment leads to new frontiers, but also new opportunities. This study shows the historical evolution of the indexing languages and its application in document management field. We analyze diverse trends for their digital use from two perspectives: their integration with other digital and linguistic resources, and the adjustment of them into the Web environment. Finally, there is an analysis of how these languages are used in the Web 2.0 and the incorporation of ontologies in the Semantic Web.
Lexicographica, 2015
Theoretical lexicographers have developed a range of elaborate structures to describe the arrangement of data inside dictionaries, in particular in dictionary articles. However, most of these structures have been developed on the basis of detailed analyses of print dictionaries and relatively little has been said about the arrangement of data in e-dictionaries. The relevant data types are lexicographical data providing help concerning the function(s) and use of dictionaries on search results pages. In order to create a visual hierarchy on screen that makes the most important search result data stand out, lexicographers should prioritize functional data that are directly related to and support the function(s) of dictionaries on a need-to-have/niceto- have basis, because data presentation structures with functional focus may better help users achieve their intended goals, i.e. finding answers to problems in communicative situations. One result is that lexicographers can analyse and de...
2007
Information overloading is today a serious concern that may hinder the potential of modern web-based information systems. A promising approach to deal with this problem is represented by knowledge extraction methods able to produce artifacts (also called patterns) that concisely represent data. Patterns are usually quite heterogeneous and voluminous. So far, little emphasis has been posed on developing an overall integrated environment for uniformly representing and querying different types of patterns.
letras.ufmg.br
The poster will show the IPIC XML Database, which aims to represent spontaneous spoken language transcripts with three levels of annotation: prosodic boundaries, information structure and morphosyntactic tagging (PoS).
Classification theory is divided into two areas: analysis of conceptual structure and file organization, and the primacy of the first is stressed, A model for conceptual structure in terms of concept coordination and poly-hierarchy is sketched, Some problems of file organization, namely post-coordination vs. pre-coordination and synthetic vs. enumerative schemes are discussed in relation to this model. A model for a classification scheme for different kinds of file organization is then proposed. The scheme would consist of a "core classification scheme" made up of elemental concepts and an "extended classification scheme" made up of combinations of elemental concepts. While the core scheme would be universal, extended schemes would be developed as needed in a specific application. This would make for flexibility while maintaining inter-system compatibility.
2014
This is the author’s version of a work that was submitted/accepted for publication in the following source:
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.
Electronic Communications of the EASST, 2010
The Open Handbook of Linguistic Data Management
Data & Knowledge Engineering, 2005
… of the Workshop on Resources and …, 2002
Proceedings of the LREC04, 2004
Information Research an International Electronic Journal, 2009