首页 | 官方网站   微博 | 高级检索  
     

基于抽象语义表示的汉语构式标注与分析
引用本文:黄彤,李斌,闫培艺,戴玉玲,曲维光.基于抽象语义表示的汉语构式标注与分析[J].中文信息学报,1986,34(10):1.
作者姓名:黄彤  李斌  闫培艺  戴玉玲  曲维光
作者单位:1.南京师范大学 文学院,江苏 南京 210097;
2.南京师范大学 计算机科学与技术学院,江苏 南京 210023
基金项目:国家社会科学基金(18BYY127);国家自然科学基金(61772278);江苏省高校哲学社会科学优秀创新团队建设项目
摘    要:构式作为组成成分与实际意义不能完全对应的结构,与常规句子差异较大,对句法和语义分析器的影响较大,构式的自动分析则更是困难。因此,亟需研究构式的结构标注方法及构建相应语料库。由于构式的语义结构与句法结构有较大差异,该文使用中文抽象语义表示(CAMR)来直接标注构式的语义结构。目前收录最全的构式库是北京大学现代汉语构式知识库,通过对该构式库1 057条构式进行人工标注并统计后,发现CAMR可以表示出61.2%的基本符合组合原则的构式;而38.8%不符合组合原则的构式需要修改或添加概念,存在缺少概念、组成成分难以拆分、修辞意义难以表示等情况。该文给出的策略是将其整体作为一个谓词标注或只标注其表层义。汉语构式库的标注可以为构式语义的自动分析提供理论与数据基础。

关 键 词:抽象语义表示  构式  形式化表示  构式语料库  中文信息处理  

Abstract Meaning Representation Based Annotationand Analysis of Chinese Construction
HUANG Tong,LI Bin,YAN Peiyi,DAI Yuling,QU Weiguang.Abstract Meaning Representation Based Annotationand Analysis of Chinese Construction[J].Journal of Chinese Information Processing,1986,34(10):1.
Authors:HUANG Tong  LI Bin  YAN Peiyi  DAI Yuling  QU Weiguang
Affiliation:1.School of Chinese Language and Literature, Nanjing Normal University, Nanjing, Jiangsu 210097, China;
2.School of Computer Science and Technology, Nanjing Normal University, Nanjing, Jiangsu 210023, China
Abstract:As a structure without direct correspondence to its literal meaning, the construction is quite different from the regular sentences yet pose a great influence on the accuracy of parser. To facilitate the automatic analysis of construction, it is necessary to build a corpus for construction for the study of its internal structure. In this paper, Abstract Meaning Representation (AMR) is used to annotate the semantic structure of constructions. According to 1,057 construction with annotation, it is found that 61.2% of constructions can be described by the principle of compositionality in Chinese AMR. As for the remaining 38.8% of the constructions beyond the principle of compositionality (lack of concepts, difficult to separate components, and difficult to express rhetorical meaning), this paper proposes to label the whole structure as word or only annotate its surface meaning. The completed Chinese construction corpus provide data for both theoretical study and automatic analysis of the meaning of construction.
Keywords:abstract meaning representation  construction  formal representation  construction corpus  Chinese information processing  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号