Academia.eduAcademia.edu

A Proposal Of Index For High-Dimensional Static Databases

Abstract

As a member of R-tree family, R*-tree is widely used in multimedia databases and spatial databases, in which NN (Nearest Neighbor) search is very popular. According to our investigations, (1) the degree of objects clustering in the leaf nodes is a very important factor on performance of NN search; (2) Normally, in R*-tree, its objects are not well-clustered in their leaf nodes. This paper proposes a new index structure, called Clustering-Based R*-tree (denoted CBR*-tree), for static databases by introducing clustering technology to R*-tree. Although some packing algorithms for R-trees have been proposed, all of them try to pack the same (or roughly same) number of objects in each leaf node, which often result in that the distribution of objects in leaf nodes can not reflect their actual distribution. The experimental results show that the CBR*-tree has better NN search performance than R*-tree and packed R-trees.