Jason Ansel

Results 199 comments of Jason Ansel

I am skipping the ROCM tests and making an issue to adding AMD support #139099

Can you check if this is fixed by: ``` torch.backends.cuda.matmul.allow_tf32 = False torch.backends.cudnn.allow_tf32 = False ``` I suspect this is a result of compile being better at enabling tf32 than...

@shunting314 would you mind looking at this again?

cc @eellison any ideas on why there would be a difference related to cudagraphs?

When you create your ArgumentParser, you should do ``` argparse.ArgumentParser(parents=opentuner.argparsers()) ``` `parents=opentuner.argparsers()` will add all the needed OpenTuner args, including default values. If you don't want the opentuner args on...

Hrm weird. Seems to help benchmarks on average though.