
Kai Eckert
Computer and Information Scientist
Address: Mannheim, Germany
Address: Mannheim, Germany
less
Related Authors
Beat Signer
Vrije Universiteit Brussel
Paul Levinson
Fordham University
Iman Attarzadeh
Islamic Azad University of Tehran, Central branch
Katarzyna Kopecka-Piech
Maria Curie - Skłodowska University
Pedro Pereira Neto
Instituto Politécnico de Lisboa
Professor Dr Christopher Preece
Abu Dhabi University
Pirkka Åman
Aalto University
Nancy Law
The University of Hong Kong
Ahmad Salman Khan
University of Lahore, Lahore Pakistan
Hatice ŞİRİN
Ege University
InterestsView All (7)
Uploads
Research by Kai Eckert
The central contribution is the ICE-Map Visualization, a treemap-based visualization on top of a generalized statistical framework that is able to visualize almost arbitrary usage information. The proper selection of an existing KOS for available documents and the evaluation of a KOS for different indexing techniques by means of the ICE-Map Visualization is demonstrated.
For the creation of a new KOS, an approach based on crowdsourcing is presented that uses feedback from Amazon Mechanical Turk to relate terms hierarchically. The extension of an existing KOS with new terms derived from the documents to be indexed is performed with a machine-learning approach that relates the terms to existing concepts in the hierarchy. The features are derived from text snippets in the result list of a web search engine. For the splitting of overpopulated
concepts into new subconcepts, an interactive clustering approach is presented that is able to propose names for the new subconcepts. The implementation of a framework is described that integrates all approaches of this thesis and contains the reference implementation of the ICE-Map Visual-
ization. It is extendable and supports the implementation of evaluation methods that build on other evaluations. Additionally, it supports the visualization of the results and the implementation of new visualizations. An important building block for practical applications is the simple linguistic indexer that is presented as minor contribution. It is knowledge-poor and works without any training.
This thesis applies computer science approaches in the domain of information science. The introduction describes the foundations in information science; in the conclusion, the focus is set on the relevance for practical applications, especially
regarding the handling of different qualities of KOSs due to automatic and semi-automatic maintenance.
Linked Data, the openness and flexibility is a mixed blessing. For them, data validation according to predefined constraints is a much sought-after feature, particularly as this
is taken for granted in the XML world. Based on our work
in the DCMI RDF Application Profiles Task Group
and in cooperation with the W3C Data Shapes Working Group, we published by today 81 types of constraints that are required
by various stakeholders for data applications. These constraint types form the basis to investigate the role that reasoning and different semantics play in practical data validation, why reasoning is beneficial for RDF validation, and
how to overcome the major shortcomings when validating
RDF data by performing reasoning prior to validation. For
each constraint type, we examine (1) if reasoning may im-
prove data quality, (2) how efficient in terms of runtime val-
idation is performed with and without reasoning, and (3) if
validation results depend on underlying semantics which differs between reasoning and validation. Using these findings,
we determine for the most common constraint languages
which constraint types they enable to express and give di-
rections for the further development of constraint languages.
alyze the distribution of indexing results within a given Knowledge Organization System (KOS) hierarchy and allows the user to explore the document sets and the KOSs at the same time. In this paper, we demonstrate the use of the ICE-Map Visualization in combination with a simple
automatic indexer to visualize the semantic overlap between a KOS and a set of documents.
Papers by Kai Eckert
The central contribution is the ICE-Map Visualization, a treemap-based visualization on top of a generalized statistical framework that is able to visualize almost arbitrary usage information. The proper selection of an existing KOS for available documents and the evaluation of a KOS for different indexing techniques by means of the ICE-Map Visualization is demonstrated.
For the creation of a new KOS, an approach based on crowdsourcing is presented that uses feedback from Amazon Mechanical Turk to relate terms hierarchically. The extension of an existing KOS with new terms derived from the documents to be indexed is performed with a machine-learning approach that relates the terms to existing concepts in the hierarchy. The features are derived from text snippets in the result list of a web search engine. For the splitting of overpopulated
concepts into new subconcepts, an interactive clustering approach is presented that is able to propose names for the new subconcepts. The implementation of a framework is described that integrates all approaches of this thesis and contains the reference implementation of the ICE-Map Visual-
ization. It is extendable and supports the implementation of evaluation methods that build on other evaluations. Additionally, it supports the visualization of the results and the implementation of new visualizations. An important building block for practical applications is the simple linguistic indexer that is presented as minor contribution. It is knowledge-poor and works without any training.
This thesis applies computer science approaches in the domain of information science. The introduction describes the foundations in information science; in the conclusion, the focus is set on the relevance for practical applications, especially
regarding the handling of different qualities of KOSs due to automatic and semi-automatic maintenance.
Linked Data, the openness and flexibility is a mixed blessing. For them, data validation according to predefined constraints is a much sought-after feature, particularly as this
is taken for granted in the XML world. Based on our work
in the DCMI RDF Application Profiles Task Group
and in cooperation with the W3C Data Shapes Working Group, we published by today 81 types of constraints that are required
by various stakeholders for data applications. These constraint types form the basis to investigate the role that reasoning and different semantics play in practical data validation, why reasoning is beneficial for RDF validation, and
how to overcome the major shortcomings when validating
RDF data by performing reasoning prior to validation. For
each constraint type, we examine (1) if reasoning may im-
prove data quality, (2) how efficient in terms of runtime val-
idation is performed with and without reasoning, and (3) if
validation results depend on underlying semantics which differs between reasoning and validation. Using these findings,
we determine for the most common constraint languages
which constraint types they enable to express and give di-
rections for the further development of constraint languages.
alyze the distribution of indexing results within a given Knowledge Organization System (KOS) hierarchy and allows the user to explore the document sets and the KOSs at the same time. In this paper, we demonstrate the use of the ICE-Map Visualization in combination with a simple
automatic indexer to visualize the semantic overlap between a KOS and a set of documents.