首页 | 官方网站   微博 | 高级检索  
     

一个基于语境框架的文本特征提取算法
引用本文:晋耀红,苗传江.一个基于语境框架的文本特征提取算法[J].计算机研究与发展,2004,41(4):582-586.
作者姓名:晋耀红  苗传江
作者单位:1. 中国科学院声学研究所,北京,100080
2. 北京语言大学语言信息处理研究所,北京,100083
基金项目:国家“九七三”重点基础研究发展规划基金项目 (G19980 3 0 5 0 6)
摘    要:介绍了一种新的文本语义形式化模型——语境框架。语境框架是一个三维的语义描述,它把文本内容抽象成领域(静态范畴)、情景(动态描述)、背景(褒贬、参照等)3个框架。在语境框架的基础上,设计实现了文本特征提取算法。算法从语义入手,实现了4元组表示的领域提取算法、以领域句类为核心的情景提取算法和以对象语义立场网络图为基础的褒贬判断。算法可以有效地处理语言中的褒贬倾向、同义、多义等现象,实际应用中表明具有很好的信息抽取能力。

关 键 词:文本特征提取  语境框架模型  领域  情景  背景  领域句类  对象语义立场网络  褒贬

An Algorithm of Extracting Text Character Based on a Model of Context Framework
JIN Yao Hong,and MIAO Chuan Jiang.An Algorithm of Extracting Text Character Based on a Model of Context Framework[J].Journal of Computer Research and Development,2004,41(4):582-586.
Authors:JIN Yao Hong  and MIAO Chuan Jiang
Affiliation:JIN Yao Hong 1 and MIAO Chuan Jiang 2 1
Abstract:A model of semantic based text formalization, the context framework model(CFM) is presented in this paper, which is three coordinate and describes the text as domain, situation and background Based on the context framework, a text character extracting algorithm is developed The algorithm includes domain extracting which uses 4 element array, situation extracting which is triggered by domain sentence category, and background extracting which focuses on the confusion of the commendatory and derogatory based on object semantic stand net As a result, the CFM is a very good model for text retrieval, and the algorithm can remarkably improve the efficiency of text retrieval
Keywords:text character extracting  context framework model  domain  situation  background  domain sentence category  object semantic stand net  commendatory and derogatory
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号