首页 | 官方网站   微博 | 高级检索  
     

基于本体的跨语言信息检索模型
引用本文:王进,陈恩红,张振亚,王煦法.基于本体的跨语言信息检索模型[J].中文信息学报,2004,18(3):2-1.
作者姓名:王进  陈恩红  张振亚  王煦法
作者单位:中国科学技术大学计算机科学技术系
基金项目:国家自然科学基金,安徽省自然科学基金
摘    要:随着网络信息的日益丰富和用户需求的提高,人们已经不能满足于仅仅在同一语种中进行检索,跨语言的信息检索(CLIR)因而受到人们越来越多的关注。为此,本文提出了一种新的基于语义的跨语言信息检索模型Onto-CLIR,该模型在传统信息检索技术的基础上,利用本体来刻画不同语言中对应的领域知识,以解决从查询语言到检索语言之间转换过程中出现的语义损失和曲解等问题,从而保证在检索过程中能够有效地遵循用户的查询意图,获得预期的检索信息。本文以体育新闻检索为背景,以英文查询作为查询请求,检索来自新浪网的体育类新闻,结果表明采用基于本体的跨语言信息检索方法之后检索的查全率和查准率平均提高10个百分点左右,有效地改善了检索性能。

关 键 词:计算机应用  中文信息处理  本体  跨语言信息检索  语义  
文章编号:1003-0077(2004)03-0001-08

An Ontology-Based Cross Language Information Retrieval Model
WANG Jin,CHEN En hong,ZHANG Zhen ya,WANG Xu fa.An Ontology-Based Cross Language Information Retrieval Model[J].Journal of Chinese Information Processing,2004,18(3):2-1.
Authors:WANG Jin  CHEN En hong  ZHANG Zhen ya  WANG Xu fa
Affiliation:Department of Computer Science , USTC
Abstract:With the enrichment of network information and the improvement of the user's needs, people are not satisfied with retrieving in the same kind of language. So Cross Language Information Retrieval (CLIR) receives people's more and more concerns. One of kernel problem of CLIR is how to overcome communication obstacles between different languages. This paper proposes a novel semantic based CLIR model Onto CLIR. The model, basing on the technologies of traditional information retrieval, uses Ontology to describe the relevant domain knowledge in different kinds of languages. Thus the problems of semantic loss and distortion when translating between query language and retrieval language can be solved. In this way we can ensure that the model will follow user's query intention and get the expected results. We have done experiments to validate our approach. The experiments are designed to retrieve sport news in Chinese from Sina website with query in English. The experiment results demonstrate that when applying our ontology based CLIR approach the increases of the retrieval recall and precision both have reached more than 10 percent, which shows that our approach is effective in improving retrieval performance.
Keywords:computer application  Chinese information processing  ontology  CLIR  semantic
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号