首页 | 官方网站   微博 | 高级检索  
     

一种基于概率上下文无关文法的汉语句法分析
引用本文:林颖,史晓东,郭锋.一种基于概率上下文无关文法的汉语句法分析[J].中文信息学报,2006,20(2):3-7,32.
作者姓名:林颖  史晓东  郭锋
作者单位:厦门大学计算机系
基金项目:国家高科技研究发展计划(863)资助项目(2002AA117010)
摘    要:本文研究了PCFG独立性假设的局限性,并针对这一局限性提出了句法结构共现的概念以引入上下文信息,给出了计算方法;为了打破中文树库规模过小的局限性,对于句法规则参数的获取,本文利用Inside-Outside算法进行迭代,最后提出了一个基于统计模型的自顶向下的汉语句法分析器。在封闭测试下,其标记精确率和标记召回率分别为88.1%和86.8%。实验结果表明,这种方法确实能够提高标记的精确率和召回率,值得深入研究。

关 键 词:人工智能  自然语言处理  统计句法分析  概率上下文无关文法  汉语自动分析  
文章编号:1003-0077(2006)02-0001-07
收稿时间:2005-03-13
修稿时间:2005-03-132005-07-11

A Chinese Parser Based on Probabilistic Context Free Grammar
LIN Ying,SHI Xiao-dong,GUO Feng.A Chinese Parser Based on Probabilistic Context Free Grammar[J].Journal of Chinese Information Processing,2006,20(2):3-7,32.
Authors:LIN Ying  SHI Xiao-dong  GUO Feng
Affiliation:Computer Science Department of Xiamen University
Abstract:This paper studies the limitations of probabilistic context free grammar,and proposes a concept of co-occurrence in syntax structure so as to use the context information.To address the limitation of the Chinese Treebank's small scale,an Inside-Outside algorithm to obtain the parameters of syntactic rules is given.At last,we present a probabilistic top-down Chinese parser.In the closed test,we get the result that label precision and label recall are 88.1% and 86.8%, showing that this method has potential to get a better performance in parsing and deserves further research.
Keywords:artificial intelligence  natural language processing  statistical paring  probabilistic context-free grammar  Chinese NLP
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号