Exploring Regular Expression Evolution
Autor: | Kathryn T. Stolee, Peipei Wang, Gina R. Bai |
---|---|
Rok vydání: | 2019 |
Předmět: |
business.industry
Computer science String (computer science) 020207 software engineering 02 engineering and technology Commit computer.software_genre Semantics Software bug 0202 electrical engineering electronic engineering information engineering Test suite Feature (machine learning) 020201 artificial intelligence & image processing Artificial intelligence Regular expression business computer Natural language processing Scope (computer science) |
Zdroj: | SANER |
DOI: | 10.1109/saner.2019.8667972 |
Popis: | Although there are tools to help developers understand the matching behaviors between a regular expression and a string, regular-expression related faults are still common. Learning developers’ behavior through the change history of regular expressions can identify common edit patterns, which can inform the creation of mutation and repair operators to assist with testing and fixing regular expressions. In this work, we explore how regular expressions evolve over time, focusing on the characteristics of regular expression edits, the syntactic and semantic difference of the edits, and the feature changes of edits. Our exploration uses two datasets. First, we look at GitHub projects that have a regular expression in their current version and look back through the commit logs to collect the regular expressions’ edit history. Second, we collect regular expressions composed by study participants during problem-solving tasks. Our results show that 1) 95% of the regular expressions from GitHub are not edited, 2) most edited regular expressions have a syntactic distance of 4-6 characters from their predecessors, 3) over 50% of the edits in GitHub tend to expand the scope of regular expression, and 4) the number of features used indicates the regular expression language usage increases over time. This work has implications for supporting regular expression repair and mutation to ensure test suite quality. |
Databáze: | OpenAIRE |
Externí odkaz: |