首页 | 官方网站   微博 | 高级检索  
     

基于组块分析的汉语块依存语法
引用本文:钱青青,王诚文,王贵荣,饶高琦,荀恩东.基于组块分析的汉语块依存语法[J].中文信息学报,2022,36(8):20-28.
作者姓名:钱青青  王诚文  王贵荣  饶高琦  荀恩东
作者单位:1.北京语言大学 信息科学学院,北京100083;
2.北京大学 计算语言学教育部重点实验室,北京100871;
3.北京语言大学 汉语国际教育研究院,北京100083
基金项目:国家自然科学基金(62076038)
摘    要:该文提出汉语的块依存语法,以谓词为核心,以组块为研究对象,在句内和句间寻找谓词所支配的组块,构建句群级别的句法分析框架。这一操作可提升叶子节点的语言单位,并针对汉语语义特点进行了分析方式和分析规则上的创新,能够较好地解决微观层次的逻辑结构知识,并为中观论元知识和宏观篇章知识打好基础。该文主要介绍了块依存语法理念、表示、分析方法及特点,并简要介绍了块依存树库的构建情况。截至2020年8月,树库规模为187万字符(4万复句、10万小句),其中包含67%新闻文本和32%百科文本。

关 键 词:组块  依存  依存语法  谓词  

Chinese Chunk-Based Dependency Grammar
QIAN Qingqing,WANG Chengwen,WANG Guirong,RAO Gaoqi,XUN Endong.Chinese Chunk-Based Dependency Grammar[J].Journal of Chinese Information Processing,2022,36(8):20-28.
Authors:QIAN Qingqing  WANG Chengwen  WANG Guirong  RAO Gaoqi  XUN Endong
Affiliation:1.School of Information Science, Beijing Language and Culture University, Beijing 100083, China;
2.MOE Key Loboratory of Computational Linguistics, Peking University, Beijing 100871, China;
3.Research Institute of International Chinese Language Education, Beijing Language and Culture University, Beijing 100083, China
Abstract:This paper proposes a Chinese chunk-based dependency grammar (CCDG), which is focused on the chunks governed by the predicates within and between sentences. As an effort in establishing a syntactic analysis framework at the level of sentence group, the CCDG propose a novel idea to enlarge the linguistic granularity of leaf nodes. It can solve the logical structure knowledge at the micro level and pave a foundation for the meso argument knowledge and macro textual knowledge. This paper presents the concept, representation, analysis method and characteristics of CCDG, as well as the development of corresponding tree-bank. By August, 2020, the treebank was scaled up to 1.87 million tokens (including 40,000 complex sentences and 100,000 sub-sentences), consisting of 67% news texts and 32% encyclopedia texts.
Keywords:chunk  dependency  dependency grammar  predicate  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号