首页 | 官方网站   微博 | 高级检索  
     

基于统计学的最近邻查询中维数灾难的研究
引用本文:薄树奎,李盛阳,朱重光.基于统计学的最近邻查询中维数灾难的研究[J].计算机工程,2006,32(21):6-8.
作者姓名:薄树奎  李盛阳  朱重光
作者单位:中国科学院遥感应用研究所,北京100101
摘    要:对高维数据空间中维数对最近邻查询结果的影响作了研究,提出了对这种影响的评估方法,基于统计学,证明了在一定条件下,相似性查询的不稳定性,以及其随维数的增加恶化程度的分布规律。给出了两个关于距离的统计量的分布,可以对最近邻查询问题进行理论估计,并通过实验结果验证了理论的正确性。

关 键 词:不稳定性  统计  维数灾难  相似性  最近邻

Study on Dimensionality Curse in the Nearest Neighbor Queries Based on Statistics
BO Shukui,LI Shengyang,ZHU Chongguang.Study on Dimensionality Curse in the Nearest Neighbor Queries Based on Statistics[J].Computer Engineering,2006,32(21):6-8.
Authors:BO Shukui  LI Shengyang  ZHU Chongguang
Affiliation:(Institute of Remote Sensing Applications, Chinese Academy of Sciences, Beijing 100101)
Abstract:This paper explores the effect of dimensionality on the “nearest neighbor” problem. Based on statistics, it shows that under some conditions, as dimensionality increases, the distances between query point and data points approach to each other. So the “nearest neighbor” is becoming meaningless. The way of how to evaluate the dimensionality effect is presented. From two distributions of statistics about distance, the effect of dimensionality on the “nearest neighbor” problem is evaluated. Empirical result is presented to demonstrate the two distributions.
Keywords:Instability  Statistics  Dimensionality curse  Similarity  Nearest neighbor
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号