Nirvedh Meshram comments

Results 51 comments of


                                            Nirvedh Meshram

error: 'func.func' op unhandled function with multiple blocks

I dont think the compiler can support such a dispatch region, I think it would be more interesting to see how we ended up forming this region due to https://github.com/iree-org/iree/pull/18618...

IREE Generic Vectorizer to support `i1` pattern in conv.

I am able to vectorize this with this patch https://github.com/llvm/llvm-project/compare/main...nirvedhmeshram:llvm-project:conv_i1 I will try to land it upstream.

IREE Generic Vectorizer to support `i1` pattern in conv.

I have a PR open upstream for this https://github.com/llvm/llvm-project/pull/109480

IREE Generic Vectorizer to support `i1` pattern in conv.

This was fixed by the upstream PR

[LLVMGPU][ROCm] SDXL int8 fails to compile on gfx90a

This failure is in the lowering of a (i8,i8)->i32 batch matmul op here is a smaller repro IR ``` module { func.func @punet_repro(%11 : tensor, %12 : tensor, %13 :...

[LLVMGPU][ROCm] SDXL int8 fails to compile on gfx90a

I have a WIP PR https://github.com/iree-org/iree/pull/18433 that solves seems to solve the issue (at least the batchmatmul compiles), I will clean it up and add some e2e tests to it...

[compilation][cpu]: failed to legalize operation onnx.Multinomial

@pdhirajkumarprasad For this and all the other bugs you filed recently, could you please add this flag to `--mlir-print-ir-after-all` to the iree-compile command and copy the dump to a file...

[Codegen][AMDGPU Backend] Correctness issue for conv_2d_ngchw_gfchw

> Here is the llvm IR for the above example: https://gist.github.com/qedawkins/acce2625d09bac6caa51f53a304df9fe > > Disabling the LoadStoreVectorizerPass appears to fix the issue: https://github.com/llvm/llvm-project/blob/6fcea431eed78f75e8ddb48e074c0078b93c109f/llvm/lib/Target/AMDGPU/AMDGPUTargetMachine.cpp#L1230 @qedawkins can you share the .rocmasm files generated...

Padding failures after LLVM bump

@Groverkss this is not urgent, so whenever you have cycles could you please take a look.

[GPU] Prefer TileAndFuse pipeline over SIMT pipeline

> Can we observe the difference using the gemm shapes in https://github.com/nod-ai/iree-kernel-benchmark ? As I am running it I am realizing that all of but three shapes in the benchmark...