A new benchmark for evaluating pattern mining methods based on the automatic generation of testbeds
Autor: | Amir Mohamad Ebrahimi, Abbas Rasoolzadegan, Bahareh Bafandeh Mayvan |
---|---|
Rok vydání: | 2019 |
Předmět: |
Source code
Computer science media_common.quotation_subject 020207 software engineering Context (language use) 02 engineering and technology Benchmarking computer.software_genre Field (computer science) Computer Science Applications 0202 electrical engineering electronic engineering information engineering Benchmark (computing) Software design 020201 artificial intelligence & image processing Class diagram Software system Data mining computer Software Information Systems media_common |
Zdroj: | Information and Software Technology. 109:60-79 |
ISSN: | 0950-5849 |
DOI: | 10.1016/j.infsof.2019.01.007 |
Popis: | Context Mining patterns is one of the most attractive topics in the field of software design. Knowledge about the number, type, and location of pattern instances is crucial to understand the original design decisions. Several techniques and tools have been presented in the literature for mining patterns in a software system. However, evaluating the quality of the detection results is usually done manually or subjectively. This can significantly affect the evaluation results. Therefore, a fair comparison of the quality of the various mining methods is not possible. Objective This paper describes a new benchmark to evaluate pattern mining methods in source code or design. Our work aims at overcoming the challenges faced in benchmarking in pattern detection. The proposed benchmark is comprehensive, fair, and objective, with a repeatable evaluation process. Method Our proposed benchmark is based on automatic generation of testbeds using graph theory. The generated testbeds are Java source codes and their corresponding class diagrams in which various types of patterns and their variants are inserted in different locations. The generated testbeds differ in their levels of complexity and full information is available on the utilized patterns. Results The results show that our proposed benchmark is able to evaluate the pattern mining methods quantitatively and objectively. Also, it can be used to compare pattern mining methods in a fair and repeatable manner. Conclusions Based on our findings, it can be argued that benchmarking in the pattern mining field is significantly less mature than topics such as presenting a new detection method. Therefore, special attention is needed in the pattern evaluation topic. Our proposed benchmark is a step towards achieving a comparative understanding of the effectiveness of detection methods and demonstrating their strengths and weaknesses. |
Databáze: | OpenAIRE |
Externí odkaz: |