首页 | 官方网站   微博 | 高级检索  
     


From Retrieval Status Values to Probabilities of Relevance for Advanced IR Applications
Authors:Henrik Nottelmann  Norbert Fuhr
Affiliation:(1) Institute of Informatics and Interactive Systems, University of Duisburg-Essen, 47048 Duisburg, Germany
Abstract:Information Retrieval systems typically sort the result with respect to document retrieval status values (RSV). According to the Probability Ranking Principle, this ranking ensures optimum retrieval quality if the RSVs are monotonously increasing with the probabilities of relevance (as e.g. for probabilistic IR models). However, advanced applications like filtering or distributed retrieval require estimates of the actual probability of relevance. The relationship between the RSV of a document and its probability of relevance can be described by a ldquonormalisationrdquo function which maps the retrieval status value onto the probability of relevance (ldquomapping functionsrdquo). In this paper, we explore the use of linear and logistic mapping functions for different retrieval methods. In a series of upper-bound experiments, we compare the approximation quality of the different mapping functions. We also investigate the effect on the resulting retrieval quality in distributed retrieval (only merging, without resource selection). These experiments show that good estimates of the actual probability of relevance can be achieved, and that the logistic model outperforms the linear one. Retrieval quality for distributed retrieval is only slightly improved by using the logistic function.
Keywords:formal models  retrieval status value  probability of inference  parameter learning  evaluation
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号