xla
xla copied to clipboard
A machine learning compiler for GPUs, CPUs, and ML accelerators
Automated Code Change
Allow disabling nvshmem for build targets.
Cleanup unused visibility rules for XProf
Update XNNPACK in XLA
Add JAX Windows presubmit job to XLA repo.
Update layout only when `allow_spmd_sharding_propagation_to_output/parameter` is true. Old GSPMD propagation needs them since they do not have the concept of open/closed sharding. In Shardy with sdy-round-trip, JAX creates the correct...
Include shape info to (Custom)KernelThunk::buffer_uses
[xla:cpu] Remove runtime_topk target
Add a smoke test to PJRT wheel builds. It dlopens the shared object and checks that the PJRT API loads.
Internal refactoring of patch