Right now it's terrifically difficult not to write certain types of tests that are flaky.
An example of this is here. Since the entropy generated is different each run this test will occasionally fail.