首页 | 官方网站   微博 | 高级检索  
     

一种基于主题模型的软件缺陷预测技术研究
引用本文:张泽涛,叶立军,程伟,顾军. 一种基于主题模型的软件缺陷预测技术研究[J]. 计算机工程与科学, 2016, 38(5): 932-937
作者姓名:张泽涛  叶立军  程伟  顾军
作者单位:;1.上海市空间智能控制技术重点实验室;2.上海航天控制技术研究所
摘    要:软件缺陷预测通常针对代码表面特征训练预测模型并对新样本进行预测,忽视了代码背后隐藏的不同技术方面和主题,从而导致预测不准确。针对这种问题,提出了一种基于主题模型的软件缺陷预测方法。将软件代码库视为不同技术方面和主题的集合,不同的主题或技术方面有不同的缺陷倾向。采用LDA主题模型对不同主题及其缺陷倾向进行建模,根据建模结果计算主题度量,并将传统度量方式和主题度量结合进行模型训练和预测。实验结果显示,该方法相对传统的软件缺陷预测技术有高的准确性,并且可以在软件演化中保证模型相对稳定,可以适用于各种缺陷预测任务。

关 键 词:主题模型  缺陷预测  软件工程
收稿时间:2015-02-02
修稿时间:2016-05-25

A software defect prediction method based on topic model
ZHANG Ze tao,YE Li jun,CHENG Wei,GU Jun. A software defect prediction method based on topic model[J]. Computer Engineering & Science, 2016, 38(5): 932-937
Authors:ZHANG Ze tao  YE Li jun  CHENG Wei  GU Jun
Affiliation:(1.Shanghai Key Laboratory of Aerospce Intelligent Control Technology,Shanghai 201109;2.Shanghai Insitute of Spaceflight Control Technology,Shanghai 201109,China)
Abstract:Traditional models for defect prediction always consider the textual features of source codes, comments, etc, ignoring hidden topics such as technical aspects, business logics, etc. To solve these problems, we present a new topic based defect prediction model. The software corpus is assumed to be composed by a collection of different topics and technical aspects which lead to different defect tendencies. A set of topic based metrics are proposed. Then, the LDA topic model is adopted to generate topics and the corresponding parameters, and the prediction model is trained by both topic metrics as well as some traditional metrics. Experimental results show that the proposed method outperforms traditional defect prediction methods and can also ensure a stable model through the evolution of software, which means the new method can be efficiently used in defect prediction tasks in software engineering.
Keywords:topic model  software defect prediction  software engineering,
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号