首页 | 官方网站   微博 | 高级检索  
     

基于多层次语义特征的英文作文自动评分方法
引用本文:周险兵,樊小超,任鸽,杨勇.基于多层次语义特征的英文作文自动评分方法[J].计算机应用,2021,41(8):2205-2211.
作者姓名:周险兵  樊小超  任鸽  杨勇
作者单位:1. 新疆师范大学 计算机科学技术学院, 乌鲁木齐 830054;2. 大连理工大学 计算机科学与技术学院, 辽宁 大连 116024
基金项目:国家自然科学基金资助项目(62066044);新疆维吾尔自治区高等学校科研计划项目(XJEDU2016S066)。
摘    要:作文自动评分(AES)技术能够自动地对作文进行分析和评分,其已成为自然语言处理技术在教育领域应用的热点研究问题之一。针对目前AES方法割裂了深层和浅层语义特征,忽视了多层次语义融合对作文评分影响的问题,提出了一种基于多层次语义特征的神经网络(MLSF)模型进行AES。首先,采用卷积神经网络(CNN)捕获局部语义特征,并采用混合神经网络捕获全局语义特征,以从深层次获取作文的语义特征;其次,利用篇章级的作文主题向量来获取主题层特征,同时针对深度学习模型难以挖掘的语法错误和语言丰富程度特征,构造了少量人工特征以从浅层获取作文的语言学特征;最后,通过特征融合对作文进行自动评分。实验结果表明,所提出模型在Kaggle ASAP竞赛公开数据集的所有子集上性能均有显著提升,该模型的平均二次加权的卡帕值(QWK)达到79.17%,验证了该模型在AES任务中的有效性。

关 键 词:英文作文  作文自动评分  多层语义特征  深层语义理解  特征融合  自然语言处理  
收稿时间:2020-10-12
修稿时间:2021-01-22

Automated English essay scoring method based on multi-level semantic features
ZHOU Xianbing,FAN Xiaochao,REN Ge,YANG Yong.Automated English essay scoring method based on multi-level semantic features[J].journal of Computer Applications,2021,41(8):2205-2211.
Authors:ZHOU Xianbing  FAN Xiaochao  REN Ge  YANG Yong
Affiliation:1. College of Computer Science and Technology, Xinjiang Normal University, Urumqi Xinjiang 830054, China;2. School of Computer Science and Technology, Dalian University of Technology, Dalian Liaoning 116024, China
Abstract:The Automated Essay Scoring (AES) technology can automatically analyze and score the essay, and has become one of the hot research problems in the application of natural language processing technology in the education field. Aiming at the current AES methods that separate deep and shallow semantic features, and ignore the impact of multi-level semantic fusion on essay scoring, a neural network model based on Multi-Level Semantic Features (MLSF) was proposed for AES. Firstly, Convolutional Neural Network (CNN) was used to capture local semantic features, and the hybrid neural network was used to capture global semantic features, so that the essay semantic features were obtained from a deep level. Secondly, the feature of the topic layer was obtained by using the essay topic vector of text level. At the same time, aiming at the grammatical errors and language richness features that are difficult to mine by deep learning model, a small number of artificial features were constructed to obtain the linguistic features of the essay from the shallow level. Finally, the essay was automatically scored through the feature fusion. Experimental results show that the proposed model improves the performance significantly on all subsets of the public dataset of the Kaggle ASAP (Automated Student Assessment Prize) champion, with the average Quadratic Weighted Kappa (QWK) of 79.17%, validating the effectiveness of the model in the AES tasks.
Keywords:English essay  Automated Essay Scoring (AES)  Multi-Level Semantic Feature (MLSF)  deep semantic understanding  feature fusion  natural language processing  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号