xla issues

[XLA:CPU] Propagate HloXlaRuntimePipelineOptions for fusion outlining

[XLA:CPU] Propagate HloXlaRuntimePipelineOptions for fusion outlining In case of fusion outlining, enable expetimental deallocation and disable sparse bufferization.

copybara-service[bot]

Always scalarize thlo.reverse.

copybara-service[bot]

[XLA:CPU] Add `xla_cpu_enable_mlir_fusion_outlining` flag

[XLA:CPU] Add `xla_cpu_enable_mlir_fusion_outlining` flag Enables fusion outlining into functions. This is to improve compile time.

copybara-service[bot]

Also build all xla targets

copybara-service[bot]

#tf-data-service Improve error handling for SnapshotManager.

#tf-data-service Improve error handling for SnapshotManager. If the snapshot manager receives an error from a worker: 1. It writes a StatusProto to an ERROR file. The error status can be...

copybara-service[bot]

[XLA] Add int4 types to MHLO translate.

copybara-service[bot]

[PJRT C API] Bump up the xla_client version as the signature of make_c_api_client was changed in a previous change.

copybara-service[bot]

[PJRT:C] Implement C API version of xla::PjRtChunk.

copybara-service[bot]

Add set_to_apply_wo_fusioncheck function to skip fusion check when assigning a to_apply computation.

copybara-service[bot]

Explore performance of XLA:CPU on ARM.

1

@sherhut @d0k @jreiffers It would be interesting to benchmark XLA:CPU Next on ARM. I am starting this issue to track the progress and also to share information about the code...

pifon2a

xla
xla copied to clipboard

Metadata

[XLA:CPU] Propagate HloXlaRuntimePipelineOptions for fusion outlining

Always scalarize thlo.reverse.

[XLA:CPU] Add `xla_cpu_enable_mlir_fusion_outlining` flag

Also build all xla targets

#tf-data-service Improve error handling for SnapshotManager.

[XLA] Add int4 types to MHLO translate.

[PJRT C API] Bump up the xla_client version as the signature of make_c_api_client was changed in a previous change.

[PJRT:C] Implement C API version of xla::PjRtChunk.

Add set_to_apply_wo_fusioncheck function to skip fusion check when assigning a to_apply computation.

Explore performance of XLA:CPU on ARM.

← Metadata

Owner

Metadata

xla xla copied to clipboard

Metadata

← Metadata

Owner

Metadata

xla
xla copied to clipboard