xla
xla copied to clipboard
A machine learning compiler for GPUs, CPUs, and ML accelerators
Automated Code Change
Delete jax.lib.xla_client.execute_with_python_values. Nothing under jax.lib.xla_client is public, so there's no deprecation period required.
Loop iteration offset should now be handled with the offset array implementation.
internal changes (to be reverted)
Testing CI after force submit
Move profiler plugin functions to a separate pybind11 module
[HLO Componentization] Create hlo/builder sub-component (Phase II). This CL takes care of 1. Migrating external projects dependencies from xla/client --> xla/hlo/builder Phase I takes care of 1. Migrating xla/translate -->...
PR #16520: [ROCM] ResetStream function for GemmAlgorithmPicker (BlasSupport interface) Imported from GitHub PR https://github.com/openxla/xla/pull/16520 Here I added **ResetStream** function which sets the underlying stream for cublas/rocblas libraries to default stream...
Preserve HLO shardings on calls and non-entry functions. XLA doesn't inline call instructions and their functions if the call instruction has backend_config. As such in Shardy we do the same....
The performance for some workloads dropped and git bisect points to this [commit](https://github.com/openxla/xla/commit/c48011a1cf55a3129f72ad2c25a2c138f7710cfd) on XLA to be causing the drop. The comments indicate that LLVM optimizations are being suppressed when...