首页 | 官方网站   微博 | 高级检索  
     


A novel neighborhood based document smoothing model for information retrieval
Authors:Pawan Goyal  Laxmidhar Behera  T M McGinnity
Affiliation:1. Intelligent Systems Research Centre, School of Computing and Intelligent Systems, University of Ulster, Ulster, UK
2. INRIA-Rocquencourt, Le Chesnay, France
3. Department of Electrical Engineering, Indian Institute of Technology, Kanpur, India
Abstract:In this paper, a novel neighborhood based document smoothing model for information retrieval has been proposed. Lexical association between terms is used to provide a context sensitive indexing weight to the document terms, i.e. the term weights are redistributed based on the lexical association with the context words. A generalized retrieval framework has been presented and it has been shown that the vector space model (VSM), divergence from randomness (DFR), Okapi Best Matching 25 (BM25) and the language model (LM) based retrieval frameworks are special cases of this generalized framework. Being proposed in the generalized retrieval framework, the neighborhood based document smoothing model is applicable to all the indexing models that use the term-document frequency scheme. The proposed smoothing model is as efficient as the baseline retrieval frameworks at runtime. Experiments over the TREC datasets show that the neighborhood based document smoothing model consistently improves the retrieval performance of VSM, DFR, BM25 and LM and the improvements are statistically significant.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号