Ben Vanik comments

Results 416 comments of


                                            Ben Vanik

Migrate jobs off current GCP GHA runner cluster

As a data point I've used sccache locally and it worked as expected for our cmake builds.

Fixing potential leak when VM disassembly fails.

PTAL; fixed the leak and made it non-fatal (it'll just print a disassembly failed line).

[compile][#812]: crash during iree-auto-input-conversion

` %1 = torch.operator "onnx.Concat"(%0#0, %0#1, %0#2, %0#3, %0#4, %0#5, %0#6, %0#7) {torch.onnx.axis = 3 : si64} : (!torch.vtensor, !torch.vtensor, !torch.vtensor, !torch.vtensor, !torch.vtensor, !torch.vtensor, !torch.vtensor, !torch.vtensor) -> !torch.vtensor` axis 3...

[WIP] Adding support for opt pass plugins.

yeah we can't be polluting the source folder - that'll have to change. I'll take a look at the rest in the morning - a quick glance there's some style...

[WIP] Adding support for opt pass plugins.

yeah, that path is wrong - no clue why `/lib/` is in there, but the `lib` prefix is something we need to drop - here's what we do elsewhere: ```...

[compilation][cpu]: failed to legalize operation onnx.Multinomial

yeah, that loop is going to be a problem (it's going to run entirely on the host in the VM interpreter) - tensorizing may help a bit but it really...

[compilation][cpu]: failed to legalize operation onnx.Multinomial

I'd start by looking at converting the ops if there are equivalents - it's absolutely required that those loops end up inside a dispatch region, and doing that with linalg...

[Codegen][CPU] Change AArch64 matmul tile sizes to (6, 16, 1)

(this looks like a great win but has gone stale - please reopen/rebase if it's still needed!)

Add shared pooling for `IREE_HAL_BUFFER_USAGE_CONSTANT` buffers.

The buffer usage flags have been reworked and `IREE_HAL_BUFFER_USAGE_SHARING_IMMUTABLE` is likely the thing to key off. We may also want to require `IREE_HAL_BUFFER_USAGE_SHARING_CONCURRENT`. The use case for this is wanting...

Rename `iree-hip-` compiler flags to `iree-rocm-` when they apply to codegen.

llvm-amdgpu sgtm