首页 | 官方网站   微博 | 高级检索  
     


Distributed computation of the knn graph for large high-dimensional point sets
Authors:Erion Plaku  Lydia E Kavraki
Affiliation:Department of Computer Science, Rice University, 6100 Main Street MS132, Houston, TX 77005-1892, USA
Abstract:High-dimensional problems arising from robot motion planning, biology, data mining, and geographic information systems often require the computation of k nearest neighbor (knn) graphs. The knn graph of a data set is obtained by connecting each point to its k closest points. As the research in the above-mentioned fields progressively addresses problems of unprecedented complexity, the demand for computing knn graphs based on arbitrary distance metrics and large high-dimensional data sets increases, exceeding resources available to a single machine. In this work we efficiently distribute the computation of knn graphs for clusters of processors with message passing. Extensions to our distributed framework include the computation of graphs based on other proximity queries, such as approximate knn or range queries. Our experiments show nearly linear speedup with over 100 processors and indicate that similar speedup can be obtained with several hundred processors.
Keywords:Nearest neighbors  Approximate nearest neighbors  knn graphs  Range queries  Metric spaces  Robotics  Distributed and parallel algorithms
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号