Picture this. A test fails in CI. It's been flaky all week — fails on push, passes when you rerun. So you add --reruns 2 to the pytest command.
Now the suite passes. Green build. Ship it. A week later, the same test fails in production in a way that only happens under load.
You go back to look at the build that passed, and the report says... "passed." One line. No context. No hint that the test ev
