xla icon indicating copy to clipboard operation
xla copied to clipboard

A machine learning compiler for GPUs, CPUs, and ML accelerators

Results 653 xla issues
Sort by recently updated
recently updated
newest added

PR #32738: [XLA:GPU] Allow cuDNN scaled dot fusions in the gemm autotuner Imported from GitHub PR https://github.com/openxla/xla/pull/32738 📝 Summary of Changes Allow selecting cuDNN gemm configs when autotuning scaled dot...

📝 Summary of Changes This PR adds a functional test checking for a particular failed gemm config on rocm platform. But this might also be useful on cuda. To detect...

We had several failures on ROCM side due to very large gridDim.x values for fusion kernels. Here is the proposal to split gridDim.x into gridDim.x/y in these cases. 1. added...