Automated Discovery of Valid Test Strings from the Web using Dynamic Regular Expressions Collation and Natural Language Processing

by Muzammil Shahbaz, Phil McMinn and Mark Stevenson

International Conference on Quality Software (QSIC 2012)


A more recent and expanded journal version of this paper is available — see "Automatic Generation of Valid and Invalid Test Data for String Validation Routines Using Web Searches and Regular Expressions".


Classic approaches to test input generation – such as dynamic symbolic execution and search-based testing – are commonly driven by a test adequacy criterion such as branch coverage. However, there is no guarantee that these techniques will generate meaningful and realistic inputs, particularly in the case of string test data. Also, these techniques have trouble handling path conditions involving string operations that are inherently complex in nature. This paper presents a novel approach of finding valid values by collating suitable regular expressions dynamically that validate the format of the string values, such as an email address. The regular expressions are ... [more]


Reference

Muzammil Shahbaz, Phil McMinn and Mark Stevenson. Automated Discovery of Valid Test Strings from the Web using Dynamic Regular Expressions Collation and Natural Language Processing. International Conference on Quality Software (QSIC 2012), pp. 79–88, 2012


Bibtex Entry
@inproceedings{Shahbaz2012,
  author    = "Shahbaz, Muzammil and McMinn, Phil and Stevenson, Mark",
  title     = "Automated Discovery of Valid Test Strings from the Web using Dynamic Regular Expressions Collation and Natural Language Processing",
  booktitle = "International Conference on Quality Software (QSIC 2012)",
  pages     = "79--88",
  year      = "2012",
  publisher = "IEEE"
}