xla
xla copied to clipboard

→

Metadata

A machine learning compiler for GPUs, CPUs, and ML accelerators

Reame
Issues

Results 653 xla issues

Sort by recently updated

trafficstars

Fix order of operations in failing transpose test

3

comment

Make TpuExecutor use StreamExecutorInterface to create Events.

Make TpuExecutor use StreamExecutorInterface to create Events.

copybara-service[bot]

[NFC] Fix mistypes in scatter expander.

[XLA] [NFC] Serialize all autotuning results

[XLA] [NFC] Serialize all autotuning results The previous logic for filtering by-module wasn't correct, as instructions could be modified after autotuning, resuling in not all relevant information serialized. This could...

copybara-service[bot]

[GPU] Add new flag xla_gpu_exclude_nondeterministic_ops.

It's more granular than the existing --xla_gpu_deterministic_ops because it allows doing an autotuning compilation with non-deterministic ops disabled. --xla_gpu_deterministic_ops is a superset of --xla_gpu_exclude_nondeterministic_ops, so --xla_gpu_deterministic_ops=true will be setting --xla_gpu_exclude_nondeterministic_ops=true...

Remove `TARGET_FILTER` from build script, tag tests that should be filtered

Remove `TARGET_FILTER` from build script, tag tests that should be filtered

copybara-service[bot]

Use StreamExecutorInterface::CreateEvent in event_pool.cc.

Use StreamExecutorInterface::CreateEvent in event_pool.cc.

copybara-service[bot]

[XLA:GPU] add force inline and no preserve local option to get better llvm splits

3

comment

Add option to XLA to enforce inlining before llvm splitModule or set preserveLocals=False to get more balanced splits in parallel compilation case. Some data of GPT3 5B model with different...

Use absl::Status instead of xla::Status now that they're identical.

Use absl::Status instead of xla::Status now that they're identical.

copybara-service[bot]

Propagate error to output if the input buffer has error.

Propagate error to output if the input buffer has error.

copybara-service[bot]

‹
1
2
...
14
15
16
17
18
19
20
...
65
66
›

About

A machine learning compiler for GPUs, CPUs, and ML accelerators

1.1k

Stars

77

Forks

Watchers

Owner

← Metadata

1.1k

Stars

77

Forks

Watchers

Owner

Metadata

A machine learning compiler for GPUs, CPUs, and ML accelerators