首页 | 官方网站   微博 | 高级检索  
     

归纳学习与规则结合的分词方法的有效性考察
引用本文:王忠建,王悦.归纳学习与规则结合的分词方法的有效性考察[J].哈尔滨师范大学自然科学学报,2010,26(1):40-43.
作者姓名:王忠建  王悦
作者单位:哈尔滨商业大学
基金项目:2007人事部留学人员科技活动项目,黑龙江省自然科学基金 
摘    要:随着互联网的普及和网上电子文本信息的爆炸式的增加,自然语言处理技术面向动态的、变化的文本显得越来越必要.针对无切分语言的分词处理的主要难点是切分歧义和未知词的处理.基于归纳学习的分词方法,仅利用文本的表层信息,因此具有完全不依赖于某特定语言的优点.通过引入包含上下文信息的消歧处理规则,对基于归纳学习的分词方法进行改进.以归纳学习方法对未知词进行推测,抽出的规则用于歧义切分的消歧处理,提高了对切分歧义的处理精度.通过实验对规则的有效性进行了考察,并给出了改进方法的分词效果.

关 键 词:自然语言处理  分词  归纳学习  规则

Evaluation of Word Segmentation Method Based on Inductive Learning and Rules
Wang Zhongjian,Wang Yue.Evaluation of Word Segmentation Method Based on Inductive Learning and Rules[J].Natural Science Journal of Harbin Normal University,2010,26(1):40-43.
Authors:Wang Zhongjian  Wang Yue
Affiliation:(Harbin University of Commerce)
Abstract:With the development of the Internet and increasing of on-line electronic text,it is necessary that natural language processing technology could deal with those dynamic,open texts.The difficult problem of word segmentation is processing of segmentation ambiguity and identifying of unknown words.The method based on Inductive Learning use only surface information of a text,so that it has an advantage that is entirely not dependent on any specific language.The Inductive Learning method is improved by using segmentation rules that contain information of context.The method predicts unknown words with Inductive Learning and process segment ambiguity by rules,segmentation rate is improved,usefulness of rules is evaluated and results of word segmentation are demonstrated.
Keywords:Natural Language Process  Word Segmentation  Inductive Learning  Rules
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号