首页 | 官方网站   微博 | 高级检索  
     

词语间依存关系的定量识别
引用本文:王建会,王雷,胡运发.词语间依存关系的定量识别[J].中文信息学报,2005,19(4):32-39.
作者姓名:王建会  王雷  胡运发
作者单位:复旦大学计算机与信息技术系,上海 200433
摘    要:本文扩展和改进了现有的词语间依存关系定量识别算法,充分考虑词项概率分布的影响;明确区分词项之间的搭配关系、并列关系和从属关系,针对它们不同的特点,提出不同的识别算法;提出字串匹配模型;充分考虑两个词项之间相互位置的离散分布和距离的影响、以及它们的概率分布特性, 提出词项间的依存强度模型,并据此构建词语间依存关系树;提出更新策略,对已经建好的依存关系树进行裁剪,并挖掘出潜在的依存关系。应用实验结果表明,本文提出的算法可以有效地识别出词语间的依存关系。

关 键 词:计算机应用  中文信息处理  词语搭配  依存关系  定量识别  
文章编号:1003-0077(2005)04-0031-08
修稿时间:2004年6月21日

To Identify the Dependent Relationship Between Words Quantificationally
WANG Jian-hui,WANG Lei,HU Yun-fa.To Identify the Dependent Relationship Between Words Quantificationally[J].Journal of Chinese Information Processing,2005,19(4):32-39.
Authors:WANG Jian-hui  WANG Lei  HU Yun-fa
Affiliation:The Department of Computing and Information Technology , Fudan University , Shanghai 200433 ,China
Abstract:In order to identify the dependent relationship between words based on statistics efficiently and accurately, this paper has rectified part of the shortcomings of present algorithms by making the best of the distribution characteristic between words, distinguishing the collocation, coordinate and affiliation relationship between words, identifying them respectively by different strategies, presenting a new module of matching between strings and a new module of dependent intensity between words, constructing the tree of dependent relationship, pruning the constructed tree of dependent relationship and identifying some latent dependent relationship. The experiment confirmed that, the new algorithm can identify the dependent relationship between words very accurately.
Keywords:computer application  Chinese information processing  collocation  dependent relationship  quantificational identification
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号