Solving All-k-Nearest Neighbor Problem without an Index

Nora Reyes

Solving All-k-Nearest Neighbor Problem without an Index

Nora Reyes

2019

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

Among the similarity queries in metric spaces, there are one that obtains the k-nearest neighbors of all the elements in the database (All-k-NN). One way to solve it is the naı̈ve one: comparing each object in the database with all the other ones and returning the k elements nearest to it (k-NN). Another way to do this is by preprocessing the database to build an index, and then searching on this index for the k-NN of each element of the dataset. Answering to the All-k-NN problem allows to build the k-Nearest Neighbor graph (kNNG). Given an object collection of a metric space, the Nearest Neighbor Graph (NNG) associates each node with its closest neighbor under the given metric. If we link each object to their k nearest neighbors, we obtain the k Nearest Neighbor Graph (kNNG).The kNNG can be considered an index for a database, which is quite efficient and can allow improvements. In this work, we propose a new technique to solve the All-k-NN problem which do not use any index to obta...

Figures (3)

Fig. 1. Costs of All-k-NN in Real spaces.

Fig. 2. Comparison of costs to solve All-k-NN on the Real spaces considered.

Fig. 3. Comparison of costs to obtain Al/-k-NN on synthetic spaces. The Figure 3 depicts the same experiments on the synthetic spaces. For lack o space, we only show the results for the vector spaces in higher dimensions; that is, dimension from 20 to 32. We do not show the results for vector spaces in lower dimen- sions because the behavior of all the methods are similar to those of the already depicted for the real spaces. As it can be observed, as dimension grows our proposal becomes better than more and more of the other alternatives. For example, for dimensions from 24 (Figures 3(b), 3(c), and 3(d)), we can overcome all the other indexes, for almost al number of neighbors.

Rodrigo Paredes

Lecture Notes in Computer Science, 2005

Proximity searching consists in retrieving from a database, objects that are close to a query. For this type of searching problem, the most general model is the metric space, where proximity is defined in terms of a distance function. A solution for this problem consists in building an offline index to quickly satisfy online queries. The ultimate goal is to use as few distance computations as possible to satisfy queries, since the distance is considered expensive to compute. Proximity searching is central to several applications, ranging from multimedia indexing and querying to data compression and clustering. In this paper we present a new approach to solve the proximity searching problem. Our solution is based on indexing the database with the knearest neighbor graph (knng), which is a directed graph connecting each element to its k closest neighbors. We present two search algorithms for both range and nearest neighbor queries which use navigational and metrical features of the knng graph. We show that our approach is competitive against current ones. For instance, in the document metric space our nearest neighbor search algorithms perform 30% more distance evaluations than AESA using only a 0.25% of its space requirement. In the same space, the pivot-based technique is completely useless.

Log In

Solving All-k-Nearest Neighbor Problem without an Index

Sign up for access to the world's latest research

Abstract

Related papers

Related topics