首页 | 官方网站   微博 | 高级检索  
     

一种改进Viterbi算法的应用研究
引用本文:李荣,郑家恒.一种改进Viterbi算法的应用研究[J].计算机工程与设计,2007,28(3):530-531,571.
作者姓名:李荣  郑家恒
作者单位:1. 忻州师范学院,计算机系,山西,忻州,034000
2. 山西大学,计算机与信息技术学院,山西,太原,030006
基金项目:山西省忻州师范学院科研基金
摘    要:为降低现代汉语句法分析的难度,以北大和哈工大语料为基础,利用改进的Viterbi算法对汉语真实文本进行了短语识别研究.提出了在隐马尔可夫模型(HMM)框架下,训练阶段依据统计概率信息,以极大似然法获取HMM参数,识别阶段用一种改进的Viterbi算法进行动态规划,识别同层短语;在此基础上,运用逐层扫描算法和改进Viterbi算法相结合的方法来识别汉语嵌套短语.实验结果表明,识别正确率在封闭测试中可达93.52%,在开放测试中达到77.529%,证明该算法对短语识别问题具有良好的适应性和实用性.

关 键 词:隐马尔可夫模型  Viterbi算法  层次分析  短语识别  句法分析  改进  Viterbi  扫描算法  应用  研究  viterbi  algorithm  improved  study  适应性  识别问题  测试  封闭  识别正确率  结果  实验  嵌套  方法  结合  运用  短语
文章编号:1000-7024(2007)03-0530-02
修稿时间:2006-01-09

Application study of improved viterbi algorithm
LI Rong,ZHENG Jia-heng.Application study of improved viterbi algorithm[J].Computer Engineering and Design,2007,28(3):530-531,571.
Authors:LI Rong  ZHENG Jia-heng
Abstract:To decrease the difficulty of syntax parsing,an improved Viterbi algorithm to recognize phrases in Chinese texts based on the corpus from Peking university and Harbin institute of technology is adopted.An efficient scheme for Chinese phrase recognition is pro-posed in the framework of hidden Markov model.In the tagging system,statistics probability information and maximum likelihood es-timation are used to get HMM parameters for training phase.An improved Viterbi algorithm for dynamic programming is presented to identify thesame hierarchy phrase for identifyingphase.Then the combination method of hierarchical syntax parsing and Viterbi algorithm is brought forward to identify those recursive phrases.The experimental results show that the precision rates of the phrase recognition in the closed test and the open test are 93.52 % and 77.529 % respectively,which proves that the algorithm has a better adaptability and practicability for phrase identification.
Keywords:hidden markov model  viterbi algorithm  hierarchical analysis  phrase recognition  syntax parsing
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号