首页 | 官方网站   微博 | 高级检索  
     


Bug localization using latent Dirichlet allocation
Authors:Stacy K Lukins  Nicholas A Kraft  Letha H Etzkorn
Affiliation:1. State Key Laboratory of Software Development Environment, Beihang University, Beijing, China;2. University of Bordeaux, LaBRI, UMR 5800, F-33400, Talence, France;1. Key Laboratory of Dependable Service Computing in Cyber Physical Society, Ministry of Education, Chongqing 400044, PR China;2. School of Software Engineering, Chongqing University, Huxi Town, Shapingba, Chongqing 401331, PR China;3. State Key laboratory of Coal Mine Disaster Dynamics and Control, Chongqing 400044, PR China;1. Center of Exact and Technological Sciences, Federal University of Acre, Rio Branco  AC, Brazil;2. Institute of Computing, Fluminense Federal University, Niterói  RJ, Brazil
Abstract:ContextSome recent static techniques for automatic bug localization have been built around modern information retrieval (IR) models such as latent semantic indexing (LSI). Latent Dirichlet allocation (LDA) is a generative statistical model that has significant advantages, in modularity and extensibility, over both LSI and probabilistic LSI (pLSI). Moreover, LDA has been shown effective in topic model based information retrieval. In this paper, we present a static LDA-based technique for automatic bug localization and evaluate its effectiveness.ObjectiveWe evaluate the accuracy and scalability of the LDA-based technique and investigate whether it is suitable for use with open-source software systems of varying size, including those developed using agile methods.MethodWe present five case studies designed to determine the accuracy and scalability of the LDA-based technique, as well as its relationships to software system size and to source code stability. The studies examine over 300 bugs across more than 25 iterations of three software systems.ResultsThe results of the studies show that the LDA-based technique maintains sufficient accuracy across all bugs in a single iteration of a software system and is scalable to a large number of bugs across multiple revisions of two software systems. The results of the studies also indicate that the accuracy of the LDA-based technique is not affected by the size of the subject software system or by the stability of its source code base.ConclusionWe conclude that an effective static technique for automatic bug localization can be built around LDA. We also conclude that there is no significant relationship between the accuracy of the LDA-based technique and the size of the subject software system or the stability of its source code base. Thus, the LDA-based technique is widely applicable.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号