xla issues

Add some additional args for select_and_scatter_test for certain backends. Also clean up includes for the select_and_scatter_test.

copybara-service[bot]

Ensure BFloat16Propagation respects if an instruction does not support mixed precision.

copybara-service[bot]

This CL adds a feature in the host instrumentation where tuple outputs are handled correctly. The instrumentation handler was not storing the tuple literals in a way that was compatible with the HloEvaluator. This CL adds this ability by storing the tuple literals in a way that is compatible with the HloEvaluator.

This CL adds a feature in the host instrumentation where tuple outputs are handled correctly. The instrumentation handler was not storing the tuple literals in a way that was compatible...

copybara-service[bot]

A minor update to the HloEvaluatorWithSubstitution where the type is updated from Literal to LiteralBase to allow BorrowingLiteral outputs be passed to the function.

copybara-service[bot]

[XLA:GPU] Plug xla_gpu.loop into EmitThreadLoop.

copybara-service[bot]

[IFRT] Introduce Client::AllocateDevices() and DeviceAllocation

[IFRT] Introduce Client::AllocateDevices() and DeviceAllocation `xla::ifrt::Client::AllocateDevices()` is a new API that processes a user request for getting an ordered set of devices that satisfies constraints specified in the request. It...

copybara-service[bot]

Add the host memory deallocation in GpuExecutor::Deallocate

3

This CL adds the missing host memory deallocation according to the pointer's host memory space allocated by GpuExecutor::Allocate() .

zhenying-liu

xla
xla copied to clipboard

Metadata

Add some additional args for select_and_scatter_test for certain backends. Also clean up includes for the select_and_scatter_test.

[XLA:MSA] Convert synchronous slices to async.

make presubmit happy for DO_NOT_SUBMIT cl

Remove build_cuda_plugin_from_source in xla CI.

Ensure BFloat16Propagation respects if an instruction does not support mixed precision.

A minor update to the HloEvaluatorWithSubstitution where the type is updated from Literal to LiteralBase to allow BorrowingLiteral outputs be passed to the function.

[XLA:GPU] Plug xla_gpu.loop into EmitThreadLoop.

[IFRT] Introduce Client::AllocateDevices() and DeviceAllocation

Add the host memory deallocation in GpuExecutor::Deallocate

← Metadata

Owner

Metadata

xla xla copied to clipboard

Metadata

← Metadata

Owner

Metadata

xla
xla copied to clipboard