Reverse k nearest neighbors query processing: experiments and analysis
Given a set of users, a set of facilities and a query facility q, a reverse k nearest neighbors (R
k NN) query returns every user u for which the query is one of its k closest facilities. R k NN
queries have been extensively studied under a variety of settings and many sophisticated
algorithms have been proposed to answer these queries. However, the existing
experimental studies suffer from a few limitations. For example, some studies estimate the
I/O cost by charging a fixed penalty per I/O and we show that this may be misleading. Also …
k NN) query returns every user u for which the query is one of its k closest facilities. R k NN
queries have been extensively studied under a variety of settings and many sophisticated
algorithms have been proposed to answer these queries. However, the existing
experimental studies suffer from a few limitations. For example, some studies estimate the
I/O cost by charging a fixed penalty per I/O and we show that this may be misleading. Also …
Given a set of users, a set of facilities and a query facility q, a reverse k nearest neighbors (RkNN) query returns every user u for which the query is one of its k closest facilities. RkNN queries have been extensively studied under a variety of settings and many sophisticated algorithms have been proposed to answer these queries. However, the existing experimental studies suffer from a few limitations. For example, some studies estimate the I/O cost by charging a fixed penalty per I/O and we show that this may be misleading. Also, the existing studies either use an extremely small buffer or no buffer at all which puts some algorithms at serious disadvantage. We show that the performance of these algorithms is significantly improved even when a small buffer (containing 100 pages) is used. Finally, in each of the existing studies, the proposed algorithm is mainly compared only with its predecessor assuming that it was the best algorithm at the time which is not necessarily true as shown in our experimental study. Motivated by these limitations, we present a comprehensive experimental study that addresses these limitations and compares some of the most notable algorithms under a wide variety of settings. Furthermore, we also present a carefully developed filtering strategy that significantly improves TPL which is one of the most popular RkNN algorithms. Specifically, the optimized version is up to 20 times faster than the original version and reduces its I/O cost up to two times.
ACM Digital Library