Join processing in relational databases

Priti Mishra; Margaret H. Eich

Join processing in relational databases

Priti Mishra

1992, ACM Computing Surveys

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

The join operation is one of the fundamental relational database query operations. It facilitates the retrieval of information from two different relations based on a Cartesian product of the two relations. The join is one of the most diffidult operations to implement efficiently, as no predefined links between relations are required to exist (as they are with network and hierarchical systems). The join is the only relational algebra operation that allows the combining of related tuples from relations on different attribute schemes. Since it is executed frequently and is expensive, much research effort has been applied to the optimization of join processing. In this paper, the different kinds of joins and the various implementation techniques are surveyed. These different methods are classified based on how they partition tuples from different relations. Some require that all tuples from one be compared to all tuples from another; other algorithms only compare some tuples from each....

Marc H. Scholl

Proc. of the Int'l Conference on Information …

Most join algorithms can be extended to reduce wasted work when several tuples contain the same value of the join attribute. We show that separating detection of duplicates from their exploitation improves modularity and makes it easier to implement whole families of hierarchy-exploiting join algorithms that avoid duplication. The technique is also used to provide an execution technique for star-like patterns of joins around a central relation. It dominates Ingres-like substitution for the central relation, in both performance and ease of including in a conventional optimizer. Its performance dominates a cascade of conventional binary joins, and performance estimates are more accurate. We then argue that such techniques make it undesirable to implement physical-level multiway join operations within a query processor.

Log In

Join processing in relational databases

Sign up for access to the world's latest research

Abstract

Related papers

Related topics

Related papers