首页 | 官方网站   微博 | 高级检索  
     

汉语学习者依存句法树库构建
引用本文:师佳璐,罗昕宇,杨麟儿,肖丹,胡正升,王一君,袁佳欣,余婧思,杨尔弘.汉语学习者依存句法树库构建[J].中文信息学报,2022,36(1):39-46.
作者姓名:师佳璐  罗昕宇  杨麟儿  肖丹  胡正升  王一君  袁佳欣  余婧思  杨尔弘
作者单位:1.北京语言大学 国家语言资源监测与研究平面媒体中心,北京100083;
2.北京语言大学 信息科学学院,北京100083;
3.北京语言大学 语言资源高精尖创新中心,北京100083
基金项目:国家语委项目(ZDI135-131,ZDI145-24);北京语言大学语言资源高精尖创新中心项目(TYZ19005);北京语言大学研究生创新基金(中央高校基本科研业务费专项资金)(20YCX141)
摘    要:汉语学习者依存句法树库为非母语者语料提供依存句法分析,对第二语言教学与研究,以及面向第二语言的句法分析、语法改错等相关研究有重要意义。然而,现有的汉语学习者依存句法树库数量较少,且在标注方面仍存在一些问题。为此,该文提出一个依存句法标注规范,搭建在线标注平台,并开展汉语学习者依存句法标注。该文重点介绍了数据选取、标注流程等问题,并对标注结果进行质量分析,以探索二语偏误对标注质量与句法分析的影响。

关 键 词:汉语学习者  依存句法树库  语料标注  偏误分析  依存句法分析  

Construction of a Treebank of Learners Chinese
SHI Jialu,LUO Xinyu,YANG Liner,XIAO Dan,HU Zhengsheng,WANG Yijun,
YUAN Jiaxin,YU Jingsi,YANG Erhong.Construction of a Treebank of Learners Chinese[J].Journal of Chinese Information Processing,2022,36(1):39-46.
Authors:SHI Jialu  LUO Xinyu  YANG Liner  XIAO Dan  HU Zhengsheng  WANG Yijun  
YUAN Jiaxin
  YU Jingsi  YANG Erhong
Affiliation:1.National Language Monitoring and Research Center (CNLR) Print Media Language Branch, Beijing Language and Culture University, Beijing 100083, China;
2.School of Information Science, Beijing Language and Culture University, Beijing 100083, China;
3.Advanced Innovation Center for Language Resources, Beijing Language and Culture University, Beijing 100083, China
Abstract:A dependency treebank of Learner Chinese provides dependency parses for non-native sentences, which could promote the teaching and research on Chinese as a second language, and support related researches such as syntactic analysis of learner language and grammatical error correction. However, few dependency treebanks of learner Chinese are available, and there are still some problems in annotation guidelines. In this paper, we develop the annotation guideline, establish an online annotation platform, and build the Treebank of Learner Chinese. This paper also describes the details in data selection and annotation workflow, evaluates the quality of annotation, and explores the impact of errors on annotation quality and syntactic analysis.
Keywords:Chinese learners  dependency treebanks  data annotation  error analysis  dependency analysis  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号