mutation_test 1.3.1 mutation_test: ^1.3.1 copied to clipboard
A command line application to measure the quality of your test cases by mutating your code and checking if your tests detect the changes.
Mutation testing #
When writing test cases for software, you often rely on metrics like code coverage to verify that your test cases actually test your program. However, this cannot be determined with a simple metric. It is possible to reach high code coverage, while you are only asserting a fraction of the observable behaviour of your units. You can evaluate the quality of your tests by modifying your program in a small way and then verify that your tests are sensitive to that change. This process is called Mutation testing.
This repository contains a simple command line program that automates these tests for code in any programming language. It can be customized to your needs, because all rules how the source code is modified and how to run the tests can be defined in XML documents. The program is fully self contained, so you can just grab the binary and start testing!
Quick start #
If you are working on a dart project, you can simply run the binary without any arguments at the root of your project. The application will the assume that "dart test" is the test command and that all files ending with ".dart" in the directory lib/ are input files.
# Adds the package
dart pub add --dev mutation_test
dart pub run mutation_test
Running this command may take a long time (hours depending on the size of your code). The output will be written to the directory ./mutation-test-report. The default report format is html. A top-level report will be generated listing all input files:
From there, you can follow the links to the reports for the individual input files. These reports show all lines of the source files, and undetected mutations are marked as red lines. You can view the undetected changes by clicking on the respective line:
The application also supports serveral command line options:
# Prints a summary of all command line options:
dart pub run mutation_test --help
# Run the tests defined in "example/config.xml":
dart pub run mutation_test example/config.xml
# Or a fully customized test run with a rules file and 3 input files:
# The rules contained in mutation-rules.xml are always used when testing files.
# inputset1.xml may define special rules for some files that
# are also listed in the same xml document.
# The input files source1.cpp and source2.cpp
# are just tested with the rules from mutation-rules.xml (--rules).
# The output is written to directory output (-o) and the
# report is generated as markdown file (-f md).
dart pub run mutation_test -f md -o output --rules mutation-rules.xml inputset1.xml \
source1.cpp source2.cpp
The first command in the section above would produce the following report. Check also the examples folder, as it contains the inputs to produce this report. The API documentation generated by dart can be found on pub.
Features #
- Fully configurable mutation rules via XML documents and regular expressions
- Sections of files can be whitelisted on a per file basis
- You can add global exclusion rules for e.g. comments, loop conditions via regular expressions
- Different report formats are supported: html, markdown and XML
A brief description of the program #
mutation_test is a program that mutates your source code and verifies that the test commands specified in the input xml files are sensitive to those changes. Mutations are done as simple text replacements with regular expressions, so any text file can be mutated. Once one of the files has been mutated, all provided test commands are run as a separate process. The exit code of these commands is used to verify that the mutation was detected. If all tests return the expected return value, then the mutation was undetected and is added to the results. After all mutations were done, the results will be written to the terminal and a report file is generated. mutation_test is free software, as in "free beer" and "free speech".
mutation_test contains a set of builtin rules, that allow you to start testing right away. However, all rules defining the behaviour of this program can be customized. They are defined in XML documents, and you can change:
- input files and whitelist lines for mutations
- compile/test commands, expected return codes and timeouts
- provide exclusion zones via regular expressions
- mutation rules as simple text replacement or via regular expressions including capture groups
- the quality gate and quality ratings You can view a complete example with every possible XML element parsed by this program by invoking "mutation_test -s". This will print a XML document to the standard output. The displayed document also contains comments explaining the syntax of the XML file. You can provide multiple input documents for a single program start. The inputs are split into three categories:
- xml rules documents: The mutation rules for all other files are parsed from these documents and added globally. Rules are specified via "--rules".
- xml documents: These files will be parsed like the rules documents, but anything defined in them applies only inside this document.
- all other input files If a rules file is provided via the command line flag "--rules", then the builtin rules are disabled, unless you specifically add them by passing "-b". You can provide as many rule sets as you like, and all of them will be added globally. The rest of the input files is processed individually. If the file extension is ".xml", then the file will be parsed like an additional rules file. However, this document must have a
The rules documents and the input xml files use the same syntax, so both files may define mutation rules, inputs, exclusions or test commands. However, a quality threshold may only be defined once.
Reports #
After a input file is processed, a report is generated. You can choose multiple output formats for the reports. As default, a html file is generated, but you can also choose markdown or XML. You can see examples for the produced outputs in the example folder.
Input XML documents #
This chapter explains the structure of the input XML documents. They must use the following structure:
<?xml version="1.0" encoding="UTF-8"?>
<mutations version="1.0">
<files>
...
</files>
<directories>
...
</directories>
<commands>
...
</commands>
<exclude>
...
</exclude>
<rules>
...
</rules>
<threshold failure="80">
...
</threshold>
</mutations>
You can see an example for an input document in the example folder, or the application can generate one by running one of these commands:
# Shows a XML document with the complete syntax:
mutation_test -s
# Shows the builtin mutation rules and exclusions:
mutation_test -g
The generated documents also contain some helpful comments on how to create your own rules. You should usually provide two different documents: one with the mutation rules given as argument to "-r" and another one with the input files. The reason why mutation_test always loads two files (unless you disable the builtin ruleset via "--no-builtin" and don't provide your own rules file) is that you can reuse the same set of rules for many different input files.
Files #
The children of "files" elements are individual files:
<files>
<file>example/source.dart</file>
<file>example/source2.dart
<!-- lines can be whitelisted -->
<!-- if there is no whitelist, the whole file is used -->
<!-- line index starts at 1 -->
<lines begin="13" end="24"/>
<lines begin="29" end="35"/>
</file>
</files>
The application will perform the mutation tests in sequence on the listed files. All mutations that are not in an exclusion or inside a whitelisted area will be applied.
Directories #
The children of "directories" elements are directories where files are searched:
<directories>
<!-- adds all files in the directory -->
<directory>src</directory>
<!-- adds files matching one of the patterns. -->
<directory>lib
<!-- matching tokens need the attribute pattern, which holds a regular expression -->
<matching pattern="\.cpp$"/>
<matching pattern="\.cxx$"/>
<matching pattern="\.c$"/>
</directories>
</directories>
The application will perform the mutation tests on all files found in the directories.
Commands #
The commands block lets you specify the command line programs to verify that a mutation is detected. The commands are run in document sequence and must be each a single command line call.
<!-- Specify the test commands here with the command element -->
<!-- The text of the command element will be executed as shell process -->
<!-- The return value of the command will used to check for success -->
<!-- If all commands execute successfully, a mutation counts as undetected -->
<commands>
<!-- All attributes here are optional -->
<!-- group: is used to show statistics for the commands -->
<!-- expected-return: this value is compared to the return value of the
command. Must be an integer -->
<!-- working-directory: Where the program is executed. Defaults to . -->
<!-- timeout: Timeout in seconds. Must be an integer. If not present,
the commands will run until they are finished. -->
<command group="compile" expected-return="0"
working-directory=".">make -j8</command>
<command group="test" expected-return="0" working-directory="."
timeout="10">ctest -j8</command>
</commands>
Exclude #
You can create rules to exclude protions of the source files from mutations:
<exclude>
<!-- excludes anything between two tokens -->
<token begin="//" end="\n"/>
<token begin="#" end="\n"/>
<!-- excludes anything that matches a pattern -->
<regex pattern="/[*].*?[*]/" dotAll="true"/>
<!-- exclude loops to prevent infinte tests -->
<regex pattern="[\s]for[\s]*\(.*?\)[\s]*{" dotAll="true"/>
<regex pattern="[\s]while[\s]*\(.*?\)[\s]*{.*?}" dotAll="true"/>
<!-- lines can also be globally excluded -->
<!-- line index starts at 1 -->
<!-- lines begin="1" end="2"/-->
</exclude>
Rules #
This element is the most important part of the document. It defines what is mutated, and how it is changed.
<!-- The rules element describes all mutations done during a mutation test -->
<!-- The following children are parsed: literal and regex -->
<!-- A literal element matches the literal text -->
<!-- A regex element mutates source code if the regular expression matches -->
<!-- Each of them must have at least one mutation child -->
<rules>
<!-- A literal element matches the literal text and replaces it with the
list of mutations. This will replace any "+" with "-" or "*". -->
<literal text="+">
<mutation text="-"/>
<mutation text="*"/>
</literal>
<!-- It is also possible to match a regular expression with capture groups. -->
<!-- If the optional attribute dotAll is set to true,
then the . will also match newlines. -->
<!-- If not present, the default value for dotAll is false. -->
<!-- Here, we capture everything inside of the braces of "if ()" -->
<regex pattern="[\s]if[\s]*\((.*?)\)[\s]*{" dotAll="true">
<!-- You can access groups via $1. -->
<!-- If your string contains a $ followed by a number that should not be
replaced, escape the dollar \$ -->
<!-- If your string contains a \$ followed by a number that should not be
replaced, escape the slash \\$ -->
<!-- Tabs and newlines should also be escaped. -->
<mutation text=" if (!($1)) {"/>
</regex>
</rules>
Threshold #
The threshold element allows you to configure the limit for a successful analysis and the quality ratings. Below is the built-in configuration:
<!-- Configures the reporting thresholds as percentage of detected mutations -->
<!-- Attribute failure is required and must be a floating point number. -->
<!-- Note: There can only be one threshold element in all input files! -->
<!-- If no threshold element is found, these values will be used. -->
<threshold failure="80">
<!-- Provides reliability rating levels. Attributes are required. -->
<rating over="100" name="A"/>
<rating over="80" name="B"/>
<rating over="60" name="C"/>
<rating over="40" name="D"/>
<rating over="20" name="E"/>
<rating over="0" name="F"/>
</threshold>
When setting a failure limit, remember that some mutations may be impossible to detect (e.g. converting "0" to "-0").
Table of XML elements #
Here is a table of all XML elements that are parsed by this program:
Element | Children | Attributes | Description |
---|---|---|---|
mutations | files, rules, exclude, commands | version | Top level element |
files | file | Holds the list of files to mutate | |
directories | directory | recusive | Holds the list of directories to search for files |
exclude | token, regex, lines | Holds the list of exclusions from mutations. | |
commands | command | Holds the list of commands to run | |
rules | literal, regex | Holds the list of mutation rules | |
file | lines | Contains the path the to file as text. If there are lines children present, only the given lines are mutated. | |
lines | begin, end | Specifies an interval of lines [begin,end] in the source file. | |
matching | pattern | Specifies the pattern for the file names in the directory. | |
command | name, group, expected-return, timeout | Contains the command to execute as text. All attributes are optional. | |
token | begin, end | A range in the source file delimited by the begin and end tokens. | |
literal | mutation | text | Matches the string in attribute text and replaces it with its children. |
regex | mutation | pattern, dotAll | A pattern for a regular expression. The expression is always multiline and processes the complete file. You can use "." to match newlines if the optional attribute dotAll is set to true. |
mutation | text | A replacement for a match. If this element is a child of a regex node, then capture groups can be used in the text via $i. | |
threshold | rating | failure | Configures the limit for a failed analysis and the quality ratings |
rating | over, name | A quality rating. Attribute over is the lowest percentage for this rating. |
Command line arguments #
mutation_test <options> <input xml files...>
The program accepts the following command line arguments:
Short | Long | Description |
---|---|---|
-h | --help | Displays the help message |
--version | Prints the version | |
--about | Prints information about the application | |
-b | --(no-)builtin | Adds or removes the builtin ruleset |
-s | --show-example | Prints a XML file to the console with every possible option |
-g | --generate-rules | Prints the builtin ruleset as XML string |
-v | --verbose | Verbose output |
-q | --quiet | Disable output |
-d | --dry | Dry run - loads the configuration and counts the possible mutations in all files, but runs no tests |
-o | --output= | Sets the output directory (defaults to ".") |
-f | --format | Sets the report file format [html (default), md, xml, all, none] |
-r | --rules= | Overrides the builtin ruleset with the rules in the given XML Document |
The rest are excepted to be paths to input XML configuration files.
License #
mutation_test is free software, as in "free beer" and "free speech". All Code is licensed with the BSD-3-Clause license, see file "LICENSE"
Issue tracker #
You can view the issues or request features at the issue tracker.