首页 | 官方网站   微博 | 高级检索  
     

基于《知网》的中文信息结构消歧研究
引用本文:张瑞霞,庄晋林,杨国增.基于《知网》的中文信息结构消歧研究[J].中文信息学报,2012,26(4):43-50.
作者姓名:张瑞霞  庄晋林  杨国增
作者单位:1. 华北水利水电学院 信息工程学院,河南 郑州 450011;2.郑州师范学院 数学系,河南 郑州 450044
基金项目:河南省科技厅基础研究项目
摘    要:《中文信息结构库》是《知网》的重要组成部分之一,可以作为中文语义分析的规则库,对其进行消歧是实际应用的基础之一。因此,该文首先对中文信息结构进行了形式化描述;接着对其进行优先级划分;然后根据其构成形式提出了四种不同的消歧方法 即词性序列消歧法、图相容匹配消歧法、图相容度计算消歧法、基于实例的语义相似度计算消歧法;最后针对不同优先级的中文信息结构集设计了不同消歧流程。实验结果证明消歧正确率达到了90% 以上。

关 键 词:知网  中文信息结构  消歧  图相容度  语义相似度  

Chinese Message Structures Disambiguation Based on HowNet
ZHANG Ruixia , ZHUANG Jinlin , YANG Guozeng.Chinese Message Structures Disambiguation Based on HowNet[J].Journal of Chinese Information Processing,2012,26(4):43-50.
Authors:ZHANG Ruixia  ZHUANG Jinlin  YANG Guozeng
Affiliation:1.Department of Information Engineering, North China University of Water Conservancy and Electric Power,
Zhengzhou,Henan 450011, China;
2.Department of Mathematics, Zhengzhou Teachers College, Zhengzhou,Henan 450044, China
Abstract:The Chinese Message Structure Database, as an important component in HowNet, can be treated as a rule base for Chinese semantic analysis. The disambiguation of Chinese message structures is the first step in bring the base into practical application. In this paper, the Chinese message structures are firstly formalized and then divided into different priority levels. Afterwards,, four diverse disambiguation approaches are proposed, including the syntax list judgment, the graph compatibility matching, the graph compatibility computation and the semantic similarity computation based on examples. Finally, different disambiguation processes are designed according to the different priority levels. Experimental results prove the accuracy rate of the disambiguation yields more than 90%.
Key wordsHowNet; Chinese message structure; disambiguation; graph compatibility; semantic similarity
Keywords:HowNet  Chinese message structure  disambiguation  graph compatibility  semantic similarity  
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号