Interoperability in peer data management systems

Kai-Uwe  Sattler

Interoperability in peer data management systems

Kai-Uwe Sattler

2007

visibility

…

description

12 pages

link

1 file

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

Interoperability plays an important role for a variety of applications. One of them are Peer Data Management Systems, where autonomous data sources (peers) interact with each other based on semantic mappings between their schemas. The building blocks that enable interoperability and thus the main challenges in such systems are mapping representation, query rewriting, and efficient query processing. While most approaches regard these aspects in separate this paper presents a comprehensive study of the interactions between these blocks. Our considerations try to provide a holistic view on semantic interoperability in distributed environments such as PDMS. We discuss techniques for distributed query processing and rewriting that consider high-level query operators such as top-N and skyline. Furthermore, we discuss how to increase efficiency by applying routing indexes and relaxation of result completeness/correctness.

Armin Roth

2007

Peer data management systems (PDMS) are a highly dynamic, decentralized infrastructure for large-scale data integration. They consist of a dynamic set of autonomous peers interconnected with a network of schema mappings. Queries submitted at a peer are answered with local data and by data that is reached along paths of mappings. Due to redundancies in the mapping network, query answering in PDMS can be very inefficient if the complete query result is to be computed. System P, a fully functional PDMS, compromises the completeness of the query result and reduces cost by pruning the query plan at mappings that are estimated to yield only few result tuples. The demo illustrates the following main components of System P: (1) adaptive estimation of result cardinalities of intermediate queries using histograms, (2) completeness-driven query planning under limited resources using specialized heuristics, and (3) the automatic generation of heterogeneous PDMS test instances, controlled by a rich set of parameters.

Log In

Interoperability in peer data management systems

Sign up for access to the world's latest research

Abstract

Related papers

Related papers