xla issues

[XLA:CPU] Support limiting LLVM codegen in Aarch64 and other new x86 instructions

PR https://github.com/openxla/xla/pull/17722 supports limiting the CPU ISA that LLVM will codegen. It currently only supports x86 ISAs from SSE4_2 up to AMX_FP16 and imposes a strict ordering (SSE4_2 < AVX...

penpornk

enhancement

CPU

Add more elementwise ops with bf16 support on Hopper.

Add more elementwise ops with bf16 support on Hopper. There are a few more elementwise ops where we don't need to convert to f32 when running on Hopper.

copybara-service[bot]

PR #17636: [NVIDIA GPU] Enhance concurrency handling in cross-rank address sharing

PR #17636: [NVIDIA GPU] Enhance concurrency handling in cross-rank address sharing Imported from GitHub PR https://github.com/openxla/xla/pull/17636 This is a followup PR to https://github.com/openxla/xla/pull/15144. A distributed cache is maintained when device...

copybara-service[bot]

jaxlib.xla_extension.XlaRuntimeError: UNIMPLEMENTED: Support for annotation groups with gaps doesn't exist yet

RIght now, users can't reuse a jitted method that includes scheduling ids more than once in a JAX program. Here is a very stripped down code JAX example that showcases...

chaserileyroberts

bug

xla
xla copied to clipboard

Metadata

[XLA:CPU] Support limiting LLVM codegen in Aarch64 and other new x86 instructions

Add more elementwise ops with bf16 support on Hopper.

PR #17636: [NVIDIA GPU] Enhance concurrency handling in cross-rank address sharing

jaxlib.xla_extension.XlaRuntimeError: UNIMPLEMENTED: Support for annotation groups with gaps doesn't exist yet

Major deps udpate:

Add int2 dtypes to TensorFlow.

[IFRT] Define Layout and common subclasses

Make error messages from dtype conversion more readable

[OpenXLA] Fix color of JAX logo in OpenXLA diagram

Add sharding devices to XlaCompileOptions and plumb them through from JAX.

← Metadata

Owner

Metadata

xla xla copied to clipboard

Metadata

← Metadata

Owner

Metadata

xla
xla copied to clipboard