Academia.eduAcademia.edu

The reliability of queries (extended abstract)

1995

Abstract

We consider an unreliabable database as a random variable defined from a relational database with various probabilistic models. For a given query Q, we define its reliability on a database D15, pQ (lIll), as the probability that the answer to Q on an unreliable random instance coincides with the answer to Q on DB. We investigate the computational complexity of computing PQ (.DB), when Q is defined in various logic-based languages. We show that pQ (DB) is computable in polynomial time when Q is defined in first-order logic and that PQ (Ill?) is P#p computable when Q is defined in Datalog. We then discuss possible ways of estimating the reliability y for natural distributions. 1 Introduction Consider a relational database built by many transactions. As with most other physical processes, transactions succeed with high probabilities only, and after enough time there is a discrepancy between the expected database DB and the real one DB'. Most users know that their database contains wrong data and that some correct data is missing. This situation may not influence the answer to some query Q but answers to some other query Q' may be wrong. We address the question on what is an error model for databases and what is the reliability of a query.

Loading...

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.