Automated N-way Program Merging for Facilitating Family-based Analyses of Variant-rich Software

Autor:	Udo Kelter, Johannes Bürdek, Dennis Reuling, Malte Lochau
Rok vydání:	2019
Předmět:	Theoretical computer science business.industry Computer science 020207 software engineering Context (language use) 02 engineering and technology Base (topology) Automaton Set (abstract data type) Software Factor (programming language) 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing business Representation (mathematics) computer computer.programming_language Abstraction (linguistics)
Zdroj:	ACM Transactions on Software Engineering and Methodology. 28:1-59
ISSN:	1557-7392 1049-331X
DOI:	10.1145/3313789
Popis:	Nowadays software tends to come in many different, yet similar variants, often derived from a common code base via clone-and-own. Family-based-analysis strategies have recently shown very promising potential for improving efficiency in applying quality-assurance techniques to such variant-rich programs, as compared to variant-by-variant approaches. Unfortunately, these strategies require a single program representation superimposing all program variants in a syntactically well-formed, semantically sound, and variant-preserving manner, which is usually not available and manually hard to obtain in practice. In this article, we present a novel methodology, called S i MPOSE, for automatically generating superimpositions of existing program variants to facilitate family-based analyses of variant-rich software. To this end, we propose a novel N-way model-merging methodology to integrate the control-flow automaton (CFA) representations of N given variants of a C program into one unified CFA representation. CFA constitute a unified program abstraction used by many recent software-analysis tools for automated quality assurance. To cope with the inherent complexity of N-way model-merging, our approach (1) utilizes principles of similarity-propagation to reduce the number of potential N-way matches, and (2) enables us to decompose a set of N variants into arbitrary subsets and to incrementally derive an N-way superimposition from partial superimpositions. We apply our tool implementation of S i MPOSE to a selection of realistic C programs, frequently considered for experimental evaluation of program-analysis techniques. In particular, we investigate applicability and efficiency/effectiveness trade-offs of our approach by applying S i MPOSE in the context of family-based unit-test generation as well as model-checking as sample program-analysis techniques. Our experimental results reveal very impressive efficiency improvements by an average factor of up to 2.6 for test-generation and up to 2.4 for model-checking under stable effectiveness, as compared to variant-by-variant approaches, thus amortizing the additional effort required for merging. In addition, our results show that merging all N variants at once produces, in almost all cases, clearly more precise results than incremental step-wise 2-way merging. Finally, our comparison with major existing N-way merging techniques shows that S i MPOSE constitutes, in most cases, the best efficiency/effectiveness trade-off.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::47c131b86fdc7f1daa2b4dff00b9946c https://doi.org/10.1145/3313789 Zobrazit plný text záznamu