Automatic Generation of Valid and Invalid Test Data for String Validation Routines Using Web Searches and Regular Expressions

by Muzammil Shahbaz, Phil McMinn and Mark Stevenson

Science of Computer Programming, 2015



Classic approaches to automatic input data generation are usually driven by the goal of obtaining program coverage and the need to solve or find solutions to path constraints to achieve this. As inputs are generated with respect to the structure of the code, they can be ineffective, difficult for humans to read, and unsuitable for testing missing implementation. Furthermore, these approaches have known limitations when handling constraints that involve operations with string data types. This paper presents a novel approach for generating string test data for string validation routines, by harnessing the Internet. The technique uses program identifiers to construct ... [more]


Reference

Muzammil Shahbaz, Phil McMinn and Mark Stevenson. Automatic Generation of Valid and Invalid Test Data for String Validation Routines Using Web Searches and Regular Expressions. Science of Computer Programming, vol. 97, no. 4, pp. 405–425, 2015


Bibtex Entry
@article{Shahbaz2015,
  author  = "Shahbaz, Muzammil and McMinn, Phil and Stevenson, Mark",
  title   = "Automatic Generation of Valid and Invalid Test Data for String Validation Routines Using Web Searches and Regular Expressions",
  journal = "Science of Computer Programming",
  volume  = "97",
  number  = "4",
  pages   = "405--425",
  year    = "2015"
}