首页 | 官方网站   微博 | 高级检索  
     

规则与统计相结合的兼类词处理机制
引用本文:黄德根,张丽静,张艳丽,杨元生. 规则与统计相结合的兼类词处理机制[J]. 小型微型计算机系统, 2003, 24(7): 1252-1255
作者姓名:黄德根  张丽静  张艳丽  杨元生
作者单位:大连理工大学,计算机系,辽宁,大连,116023
基金项目:国家自然科学基金 ( 60 14 3 0 0 2 )资助
摘    要:兼类词处理是词性标注的关键所在,本文对兼类词排岐进行了研究,介绍了规则和统计相结合的排岐策略.按照上述策略,实现了一个兼类词处理系统.实验测试结果表明,利用规则与统计相结合的兼类词处理机制可以有效地提高排岐正确率和词性标注正确率,在封闭测试和开放测试中兼类词的排歧正确率分别达到了93.91%和91.16%,标注正确率分别达到了97.85%和96.71%.

关 键 词:词性标注 兼类词 规则 n-元模型
文章编号:1000-1220(2003)07-1252-04

Disambiguation Mechanism Using Rule Techniques and Statistics Techniques
HUANG De gen,ZHANG Li jing,ZHANG Yan li,YANG Yuan sheng. Disambiguation Mechanism Using Rule Techniques and Statistics Techniques[J]. Mini-micro Systems, 2003, 24(7): 1252-1255
Authors:HUANG De gen  ZHANG Li jing  ZHANG Yan li  YANG Yuan sheng
Abstract:Syntactic category disambiguation is the key to part of speech tagging .In this paper, westudy the syntactic category disambiguation and introduce the disambiguation strategy using rule techniques and statistics techniques. With the above method, a system of disambiguation is materialized. The experimental results show the tagging accuracy is raised by using rule techniques and statistics techniques .The disambiguation accuracy of close test and open test is 93.91% and 91.16% respectively, and the overall accuracy is 97.85% and 96.71% respectively.
Keywords:Part of speech tagging  syntactic category  rule  N gram
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号