xla issues

Add an algebraic simplification pattern for multiply(add(conv(input, filter), bias), broadcast(constant)) -> add(conv(input, multiply(filter, broadcast(constant))), multiply(bias, broadcast(constant)))

Add an algebraic simplification pattern for multiply(add(conv(input, filter), bias), broadcast(constant)) -> add(conv(input, multiply(filter, broadcast(constant))), multiply(bias, broadcast(constant)))

copybara-service[bot]

[ROCm] Fix build break in executor and kernel test introduced in f896afd

hsharsha

Reverts 693ee2e13225331bebc946442af7e2d59355adea

copybara-service[bot]

[NVIDIA] Optimize deterministic scalar scatter

1

This PR is the 1st step (out of 2) to improve the performance of deterministic scatter. Originally, the scatter op will be expanded to be deterministic in `xla/service/ScatterExpander.cc`. However, since...

serach24

[JAX] Add PyClient::GetAllDevices() and expose it as an internal JAX backend API

[JAX] Add PyClient::GetAllDevices() and expose it as an internal JAX backend API JAX backend forwards `xla::ifrt::Client::GetAllDevices()` to `xla::PyClient::GetAllDevices()`, which is accessible via JAX `backend.get_all_devices()`. This API is an internal JAX...

copybara-service[bot]

[IFRT] Add Client::GetAllDevices()

[IFRT] Add Client::GetAllDevices() This defines `Client::GetAllDevices()`. It is similar to `Client::devices()`, but it enumerates all devices available on the client, regardless of the type/kind of devices. This multi-device behavior was...

copybara-service[bot]

Remove more GPU/CUDA/ROCm attribute guards from xla/service/gpu

Remove more GPU/CUDA/ROCm attribute guards from xla/service/gpu - This removes `if_gpu_is_configured` guards from targets that are only supposed to be built for GPU. (Also tags them as `gpu` so that...

copybara-service[bot]

[TEST] Debug linking of mlir_fusion_opt

[TEST] Debug linking of mlir_fusion_opt For whatever reason it somestimes can't find cudnn. Let's find out why.

copybara-service[bot]

[ROCm] Fixed linker issues with rocblas_get_version_string_size and r…

…ocblas_get_version_string

zoranjovanovic-ns

#sdy add JAX Shardy support for memories.

copybara-service[bot]

xla
xla copied to clipboard

Metadata

Add an algebraic simplification pattern for multiply(add(conv(input, filter), bias), broadcast(constant)) -> add(conv(input, multiply(filter, broadcast(constant))), multiply(bias, broadcast(constant)))

[ROCm] Fix build break in executor and kernel test introduced in f896afd

Reverts 693ee2e13225331bebc946442af7e2d59355adea

[NVIDIA] Optimize deterministic scalar scatter

[JAX] Add PyClient::GetAllDevices() and expose it as an internal JAX backend API

[IFRT] Add Client::GetAllDevices()

Remove more GPU/CUDA/ROCm attribute guards from xla/service/gpu

[TEST] Debug linking of mlir_fusion_opt

[ROCm] Fixed linker issues with rocblas_get_version_string_size and r…

#sdy add JAX Shardy support for memories.

← Metadata

Owner

Metadata

xla xla copied to clipboard

Metadata

← Metadata

Owner

Metadata

xla
xla copied to clipboard