Add support for marking tests as flaky
This restriction is useful if a package contains tests that are not yet reliable enough for gating CI, but can usefully be run to assess how often they fail, with the intention of promoting the test to non-flaky when it is reliable.
Signed-off-by: Simon McVittie smcv@debian.org Closes: #851558