Jason Ansel
Jason Ansel
@pytorchbot merge
I am skipping the ROCM tests and making an issue to adding AMD support #139099
@pytorchbot merge
Can you check if this is fixed by: ``` torch.backends.cuda.matmul.allow_tf32 = False torch.backends.cudnn.allow_tf32 = False ``` I suspect this is a result of compile being better at enabling tf32 than...
@shunting314 would you mind looking at this again?
cc @eellison any ideas on why there would be a difference related to cudagraphs?
When you create your ArgumentParser, you should do ``` argparse.ArgumentParser(parents=opentuner.argparsers()) ``` `parents=opentuner.argparsers()` will add all the needed OpenTuner args, including default values. If you don't want the opentuner args on...
Hrm weird. Seems to help benchmarks on average though.
@pytorchbot merge
@pytorchbot merge