Seizing Requirements Engineering Issues through Supervised Learning Techniques
Autor: | Luciana C. Ballejos, Mariel Alejandra Ale, Maria Guadalupe Gramajo |
---|---|
Rok vydání: | 2020 |
Předmět: |
General Computer Science
Requirements engineering Business rule Computer science business.industry Supervised learning Decision tree 020207 software engineering 02 engineering and technology Machine learning computer.software_genre Support vector machine Naive Bayes classifier 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Artificial intelligence Software requirements Electrical and Electronic Engineering business computer Requirements analysis |
Zdroj: | IEEE Latin America Transactions. 18:1164-1184 |
ISSN: | 1548-0992 2002-2018 |
DOI: | 10.1109/tla.2020.9099757 |
Popis: | In recent years, the popularity of machine learning techniques has grown due to the availability of larges volumes of data and the increased processing capacity of computers. Despite the inherent value of these techniques, few studies have attempted to summarize how machine learning algorithms, especially supervised learning have contributed to task automation and resolving challenges in Requirements Engineering. This paper proposes a systematic mapping of the literature to identify and analyze proposals which employ supervised learning in Requirements Engineering between 2002-2018. The goal of this research is to identify trends, datasets, and methods used. Thirty-three studies were selected based on defined inclusion and exclusion criteria. The results show that researches using these techniques focuses on eight broad categories: detection of linguistic problems in requirements documents and artifacts written in natural language, classification of document content, traceability, effort estimation, requirements analysis, failures prediction, quality and detection of business rules. The most used supervised learning algorithms were Support Vector Machine, Naive Bayes, Decision Tree, K-Nearest Neighbour, and Random Forest. Twenty-five public and twenty -eight private data sources were identified. Among the most used public data sources are Predictor Models in Software Engineering, iTrust Electronic Health Care System and Metric Data Program from NASA. |
Databáze: | OpenAIRE |
Externí odkaz: |