首页 | 官方网站   微博 | 高级检索  
     

面向高维数据集的近邻顺序查询方法
引用本文:崔江涛,肖斌,詹海生.面向高维数据集的近邻顺序查询方法[J].计算机科学与探索,2010,4(9):840-849.
作者姓名:崔江涛  肖斌  詹海生
作者单位:1. 西安电子科技大学,计算机学院,西安,710071
2. 西安电子科技大学,网络教育学院,西安,710071
基金项目:中央高校基本科研业务费专项资金
摘    要:对顺序索引方法进行了研究,提出一种基于向量近似的高维顺序索引结构,该结构顺序访问部分文件就能完成k近邻查询。在查询过程中依据投影值来终止查询过程,依据距离来排除不匹配的数据。为进一步降低数据访问率,采用椭圆体聚类算法对数据集进行划分。新索引结构支持以多个顺序访问过程完成k近邻查询,能够同时降低查询过程中的I/O开销和CPU开销。在大型高维图像特征库上的实验表明,新的高维索引结构的查询性能优于其他高维索引方法。

关 键 词:高维索引  k近邻查询  椭圆体聚类  顺序查找
修稿时间: 

Sequential Search Method for Nearest Neighbor Query in High-Dimensional Dataset
CUI Jiangtao,XIAO Bin,ZHAN Haisheng.Sequential Search Method for Nearest Neighbor Query in High-Dimensional Dataset[J].Journal of Frontier of Computer Science and Technology,2010,4(9):840-849.
Authors:CUI Jiangtao  XIAO Bin  ZHAN Haisheng
Affiliation:1. School of Computer Science and Technology, Xidian University, Xi’an 710071, China 2. School of Network Education, Xidian University, Xi’an 710071, China
Abstract:The sequential index method is studied. A new high-dimensional sequential indexing structure based on vector ap-proximation is presented, in which only a small set of approximate vectors are sequentially accessed during the query. Two one-dimensional mapping values, projection value used for terminating the searching process and the distance used to reject impossible candidate points, are presented to improve the searching speed. To reduce the data points need to be accessed, the dataset is partitioned into some ellipsoid shaped clusters. The k-nearest neighbor search is composed of several sequentially scanning in the new index structure, which can reduce both the computa-tional CPU cost and I/O cost. The experimental results on large image database are indicative of the effectiveness of the approach.
Keywords:high-dimensional indexing  k-nearest neighbor search  ellipsoid shaped clustering  sequential scan
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机科学与探索》浏览原始摘要信息
点击此处可从《计算机科学与探索》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号