DebtHunter: A Machine Learning-based Approach for Detecting Self-Admitted Technical Debt
Autor: | Francesca Arcelli Fontana, Antonela Tommasel, Irene Sala |
---|---|
Přispěvatelé: | Sala, I, Tommasel, A, Arcelli Fontana, F |
Rok vydání: | 2021 |
Předmět: |
Self-admitted technical debt
Source code business.industry Process (engineering) Computer science Natural language processing media_common.quotation_subject Software development Technical debt Machine learning computer.software_genre ING-INF/05 - SISTEMI DI ELABORAZIONE DELLE INFORMAZIONI Open source Software Debt Code (cryptography) Artificial intelligence business computer media_common |
Zdroj: | EASE |
DOI: | 10.1145/3463274.3464455 |
Popis: | Due to limited time, budget or resources, a team is prone to introduce code that does not follow the best software development practices. This code that introduces instability in the software projects is known as Technical Debt (TD). Often, TD intentionally manifests in source code, which is known as Self-Admitted Technical Debt (SATD). This paper presents DebtHunter, a natural language processing (NLP)- and machine learning (ML)- based approach for identifying and classifying SATD in source code comments. The proposed classification approach combines two classification phases for differentiating between the multiple debt types. Evaluations over 10 open source systems, containing more than 259k comments, showed that the approach was able to improve the performance of others in the literature. The presented approach is supported by a tool that can help developers to effectively manage SATD. The tool complements the analysis over Java source code by allowing developers to also examine the associated issue tracker. DebtHunter can be used in a continuous evolution environment to monitor the development process and make developers aware of how and where SATD is introduced, thus helping them to manage and resolve it. |
Databáze: | OpenAIRE |
Externí odkaz: |