Strength Pareto fitness assignment for pseudo-relevance feedback: application to MEDLINE |
| |
Authors: | Ilyes Khennak Habiba Drias |
| |
Affiliation: | Laboratory for Research in Artificial Intelligence, Computer Science Department, University of Sciences and Technology Houari Boumediene (USTHB), Algiers 16111, Algeria |
| |
Abstract: | Because of users’ growing utilization of unclear and imprecise keywords when characterizing their information need, it has become necessary to expand their original search queries with additional words that best capture their actual intent. The selection of the terms that are suitable for use as additional words is in general dependent on the degree of relatedness between each candidate expansion term and the query keywords. In this paper, we propose two criteria for evaluating the degree of relatedness between a candidate expansion word and the query keywords: (1) co-occurrence frequency, where more importance is attributed to terms occurring in the largest possible number of documents where the query keywords appear; (2) proximity, where more importance is assigned to terms having a short distance from the query terms within documents. We also employ the strength Pareto fitness assignment in order to satisfy both criteria simultaneously. The results of our numerical experiments on MEDLINE, the online medical information database, show that the proposed approach significantly enhances the retrieval performance as compared to the baseline. |
| |
Keywords: | information retrieval query expansion pseudorelevance feedback proximity multi-objective optimization Pareto dominance MEDLINE |
本文献已被 SpringerLink 等数据库收录! |
| 点击此处可从《Frontiers of Computer Science》浏览原始摘要信息 |
|
点击此处可从《Frontiers of Computer Science》下载全文 |