Comparing fine-grained source code changes and code churn for bug prediction
Autor: | Emanuel Giger, Harald C. Gall, Martin Pinzger |
---|---|
Přispěvatelé: | University of Zurich, Giger, Emanuel |
Rok vydání: | 2011 |
Předmět: |
Source code
10009 Department of Informatics Computer science media_common.quotation_subject Code coverage Static program analysis 000 Computer science knowledge & systems Machine learning computer.software_genre Software bugs Code (cryptography) code churn Software system media_common business.industry prediction models 1712 Software nonlinear regression Software bug source code changes KPI-driven code analysis Data mining Artificial intelligence business computer Predictive modelling |
Zdroj: | MSR Technical Report Series TUD-SERG-2011-007 |
ISSN: | 1872-5392 |
DOI: | 10.1145/1985441.1985456 |
Popis: | A significant amount of research effort has been dedicated to learning prediction models that allow project managers to efficiently allocate resources to those parts of a software system that most likely are bug-prone and therefore critical. Prominent measures for building bug prediction models are product measures, e.g., complexity or process measures, such as code churn. Code churn in terms of lines modified (LM) and past changes turned out to be significant indicators of bugs. However, these measures are rather imprecise and do not reflect all the detailed changes of particular source code entities during maintenance activities. In this paper, we explore the advantage of using fine-grained source code changes (SCC) for bug prediction. SCC captures the exact code changes and their semantics down to statement level. We present a series of experiments using different machine learning algorithms with a dataset from the Eclipse platform to empirically evaluate the performance of SCC and LM. The results show that SCC outperforms LM for learning bug prediction models. Accepted for publication in the Proceedings of the Working Conference on Mining Software Repositories, 2011, ACM Press. |
Databáze: | OpenAIRE |
Externí odkaz: |