一种短语结构规则的自动获取方法 A NEW APPROACH TO PHRASE STRUCTURE RULE ACQUISITION期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种短语结构规则的自动获取方法

引用本文：	朱靖波,张玥杰,姚天顺.一种短语结构规则的自动获取方法[J].计算机研究与发展,1999,36(5):601-607.

作者姓名：	朱靖波张玥杰姚天顺

作者单位：	东北大学信息科学与工程学院计算机科学系

基金项目：	国家自然科学基金,国家教委博士点基金

摘要：	文中提出一种新的知识获取方法，即从完全没有任何标注的生语料库中，采用ＮＡ假设自动构造带标训练数据，利用基于多特征的相似评估技术自动获取名词短语结构规则，该方法具有两个特点：（１）由于从没有任何标注的生语料库中自动获取带标训练数据，促使带标数据规模可以很大，且容易构造不同领域的带标语料库；（２）所获取的短语结构规则具有概率属性，可用于分类检索等应用中的名词短语抽取，为论证方法有效性，采用美国Ｂｅｒｉ
关键词：	短语结构规则自然语言处理自动获取
A NEW APPROACH TO PHRASE STRUCTURE RULE ACQUISITION

ZHU Jing\\|Bo,ZHANG Yue\\|Jie,and YAO Tian\\|Shun.A NEW APPROACH TO PHRASE STRUCTURE RULE ACQUISITION[J].Journal of Computer Research and Development,1999,36(5):601-607.

Authors:	ZHU Jing\\|Bo ZHANG Yue\\|Jie and YAO Tian\\|Shun

Abstract:	Here presented is a new approach to NP phrase structure rule acquisition based on multi\\|feature similarity estimation from corpora without bracketed and nonterminal labels. By computing the distance between a rule and all feature rules based on their local contextual information, the system could sort all rules by their distances. The smaller the distance, the larger the similarity. Experiments using Berlitz corpus show that the approach presented achieves a relatively high accuracy: 80% in the first 50 rules. This result demonstrates that training data acquisition based on NA assumption is effective for rule acquisition and parsing.

Keywords:	noun phrase structure rule distance function multifeaturebased similarity estimation
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏