首页 | 官方网站   微博 | 高级检索  
     

话题关联识别中报道信息的动态扩充研究
引用本文:张晓艳,王挺.话题关联识别中报道信息的动态扩充研究[J].计算机科学,2009,36(11):200-203.
作者姓名:张晓艳  王挺
作者单位:国防科技大学计算机学院,长沙,410073
基金项目:国家自然科学基金资助项目,新世纪优秀人才支持计划 
摘    要:话题关联识别用于判断新闻报道对流中每对中的两篇报道是否描述了同一个话题.为解决其中报道篇幅短小、稀疏问题严重及其内容存在漂移等问题,提出了一种动态信息扩充技术,用于改进报道表示模型.该技术用过去最新的话题相关报道来扩充当前报道,动态更新原有模型.此外,还研究了扩充信息的精化问题,通过有选择地加重一些重要特征的权重来减小扩充过程中噪音带来的影响.该方法在TDT4中的中文语料上进行了实验,结果表明动态信息扩充技术能够较大幅度地改进话题关联识别的性能,对多种特征采取的精化技术也对性能改进产生了较大影响.

关 键 词:话题关联识别  动态信息扩充  报道模型
收稿时间:2008/12/8 0:00:00
修稿时间:2009/2/25 0:00:00

Research on the Dynamic Extending of Story in Story Link Detection
ZHANG Xiao-yan,WANG Ting.Research on the Dynamic Extending of Story in Story Link Detection[J].Computer Science,2009,36(11):200-203.
Authors:ZHANG Xiao-yan  WANG Ting
Affiliation:(Department of Computer,National University of Defense Technology,Changsha 410073,China)
Abstract:Story Link Detection is to determine whether two stories are about the same topic.To overcome the limitation of the story length,sparse data and the drifting problem in story content, this paper provided a technology of dynamic information extending to improve the story representation model.It extended the current story with its previous latest topic-related story.The refinement on the information for dynamic extending was also studied.It aims to reduce the in-fluence of the noise introduced when extending by increasing the weights of some important features in the extending story.This method was used for Story Link Detection on the TDT4 Chinese corpus.The experiment results indicate that the technology of dynamic extending and the refinement of extending information can both affect the performance of story link detection systems evidently.
Keywords:Topic detection and tracking  Dynamic information extending  Story representation model
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号