Parry, O., Kapfhammer, G.M., Hilton, M. et al. (1 more author) (Accepted: 2025) Test flimsiness: Characterizing flakiness induced by mutation to the code under test. In: Proceedings of 2026 IEEE/ACM 48th International Conference on Software Engineering (ICSE ’26). 2026 IEEE/ACM 48th International Conference on Software Engineering (ICSE ’26), 12-18 Apr 2026, Rio de Janeiro, Brazil. ACM. ISBN: 9798400720253/26/04. (In Press)
Abstract
Flaky tests, which fail non-deterministically against the same version of code, pose a well-established challenge to software developers. In this paper, we characterize the overlooked phenomenon of test FLIMsiness: FLakiness Induced by Mutations to the code under test. These mutations are generated by the same operators found in standard mutation testing tools. Flimsiness has profound implications for software testing researchers. Previous studies quantified the impact of pre-existing flaky tests on mutation testing, but we reveal that mutations themselves can induce flakiness, exposing a previously neglected threat. This has serious effects beyond mutation testing, calling into question the reliability of any technique that relies on deterministic test outcomes in response to mutations.
On the other hand, flimsiness presents an opportunity to surface potential flakiness that may otherwise remain hidden. Prior work perturbed the execution environment to augment rerunning-based detection and the test code to support benchmarking. We advance these efforts by perturbing a third major source of flakiness: the code under test. We conducted an empirical study on over half a million test suite executions across 28 Python projects. Our statistical analysis on over 30 million mutant-test pairs unveiled flimsiness in 54% of projects. We found that extending the standard rerunning flaky test detection strategy with code-under-test mutations detects a substantially larger number of flaky tests (median 740 vs. 163) and uncovers many that the standard strategy is unlikely to detect.
Metadata
| Item Type: | Proceedings Paper |
|---|---|
| Authors/Creators: |
|
| Copyright, Publisher and Additional Information: | © 2026 Copyright held by the owner/author(s). |
| Keywords: | Software Testing; Mutation Testing; Flaky Tests |
| Dates: |
|
| Institution: | The University of Sheffield |
| Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
| Funding Information: | Funder Grant number ENGINEERING AND PHYSICAL SCIENCE RESEARCH COUNCIL EP/X024539/1 |
| Date Deposited: | 20 Jan 2026 17:26 |
| Last Modified: | 20 Jan 2026 17:26 |
| Status: | In Press |
| Publisher: | ACM |
| Refereed: | Yes |
| Identification Number: | 10.1145/3744916.3773125 |
| Related URLs: | |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:236716 |
Download
Filename: main.pdf

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)