首页 | 官方网站   微博 | 高级检索  
     

基于时空分析的线索性事件的抽取与集成系统研究
引用本文:吴平博,陈群秀,马亮.基于时空分析的线索性事件的抽取与集成系统研究[J].中文信息学报,2006,20(1):23-30.
作者姓名:吴平博  陈群秀  马亮
作者单位:智能技术与系统国家重点实验室,清华大学计算机科学与技术系
摘    要:信息抽取技术能够提供高质量的检索服务。本文面向网络新闻事件,对人们感兴趣的事件关键信息进行了抽取和集成。系统中采用了如下的方法、策略: (1) 利用句型模板构造抽取规则,然后直接从经过时间短语和空间短语识别和规范化处理的文本中抽取事件信息,从而跳过了深层句法分析,降低了实现系统的难度; (2) 利用事件的规范化的时空信息关联不同文档中的同一事件,进行事件合并; (3) 文档发生事件转移时对文档进行事件切分,从而解决了文档内不同事件信息的归并问题。初步实验结果表明:本文采用的方法和策略是有效的。

关 键 词:计算机应用  中文信息处理  信息抽取  句型模板  线索性事件  时空信息  事件合并  
文章编号:1003-0077(2006)01-0021-08
收稿时间:2005-05-15
修稿时间:2005-10-25

Research on Extraction and Integration of Developing Event Based on Analysis of Space-time Information
WU Ping-bo,CHEN Qun-xiu,MA Liang.Research on Extraction and Integration of Developing Event Based on Analysis of Space-time Information[J].Journal of Chinese Information Processing,2006,20(1):23-30.
Authors:WU Ping-bo  CHEN Qun-xiu  MA Liang
Affiliation:The State Key Laboratory of Intelligent Technology and System , Department of Computer Science and Technology , Tsinghua University
Abstract:Technology of information extraction(IE) can provide high-quality service for retrieval.Targeting at events in web news,this paper conducts a system that can extract and integrate key information of event that interests people.Methodologies and strategies of the system are as follows:(1) Extraction rules are built in terms of sentence patterns,then event information is directly extracted from the text in which temporal phrases(TP) and space phrases(SP) are recognized and normalized.The extraction system can thus be easily implemented owing to skipping complex syntax parsing.(2) The same event in different documents is linked by normalized TP and SP of event,and the information associated with an event is merged.(3) When new event appears in a text,the text is segmented.So isolative information for an event in same segment can be merged into its owner.Preliminary experiments show that methodologies and strategies in this paper are feasible.
Keywords:computer application  Chinese information processing  information extraction  sentence pattern  developing event  space-time information  event merge
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号