首页 | 官方网站   微博 | 高级检索  
     

一种短语结构规则的自动获取方法
引用本文:朱靖波,张玥杰,姚天顺.一种短语结构规则的自动获取方法[J].计算机研究与发展,1999,36(5):601-607.
作者姓名:朱靖波  张玥杰  姚天顺
作者单位:东北大学信息科学与工程学院计算机科学系
基金项目:国家自然科学基金,国家教委博士点基金
摘    要:文中提出一种新的知识获取方法,即从完全没有任何标注的生语料库中,采用NA假设自动构造带标训练数据,利用基于多特征的相似评估技术自动获取名词短语结构规则,该方法具有两个特点:(1)由于从没有任何标注的生语料库中自动获取带标训练数据,促使带标数据规模可以很大,且容易构造不同领域的带标语料库;(2)所获取的短语结构规则具有概率属性,可用于分类检索等应用中的名词短语抽取,为论证方法有效性,采用美国Beri

关 键 词:短语结构规则  自然语言处理  自动获取

A NEW APPROACH TO PHRASE STRUCTURE RULE ACQUISITION
ZHU Jing\|Bo,ZHANG Yue\|Jie,and YAO Tian\|Shun.A NEW APPROACH TO PHRASE STRUCTURE RULE ACQUISITION[J].Journal of Computer Research and Development,1999,36(5):601-607.
Authors:ZHU Jing\|Bo  ZHANG Yue\|Jie  and YAO Tian\|Shun
Abstract:Here presented is a new approach to NP phrase structure rule acquisition based on multi\|feature similarity estimation from corpora without bracketed and nonterminal labels. By computing the distance between a rule and all feature rules based on their local contextual information, the system could sort all rules by their distances. The smaller the distance, the larger the similarity. Experiments using Berlitz corpus show that the approach presented achieves a relatively high accuracy: 80% in the first 50 rules. This result demonstrates that training data acquisition based on NA assumption is effective for rule acquisition and parsing.
Keywords:noun phrase structure rule  distance function  multifeaturebased similarity estimation  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号