Jerry Wu comments

Results 56 comments of


                                            Jerry Wu

Drop tracy from CI benchmarks?

Make it optional and off by default on PR runs SGTM

[spirv] vector.shape_cast is not handled in ConvertToSPIRV

I'm actually more than happy to try to fix this : ) Also assign to myself

[spirv] vector.shape_cast is not handled in ConvertToSPIRV

It looks like there is a size regression with drop unit dims on vector transfer + convert trivial shape_cast to no-op https://github.com/openxla/iree/pull/14220#issuecomment-1606270765 I'll investigate it further

[spirv] vector.shape_cast is not handled in ConvertToSPIRV

The problem is at `VectorReductionToGPU`, the patterns in `mlir::vector::populatePropagateWarpVectorDistributionPatterns` can't handle the `vector.shape_cast`, which results in bad warp distribution: ```mlir ----- After VectorReduceToGPU ----- func.func @main_dispatch_84_generic_2x256_i8xi32() { %c0 = arith.constant...

[spirv] vector.shape_cast is not handled in ConvertToSPIRV

The patch trying to fix vector distribution is sent out for review https://reviews.llvm.org/D154870

[spirv] vector.shape_cast is not handled in ConvertToSPIRV

Unassigned myself as I'm not working on this now

[CPU] Missing ukernel for f32, i8 -> f32 mmt4d?

Sure, I'll take a look this week

[CPU] Missing ukernel for f32, i8 -> f32 mmt4d?

So my understanding of the TODO tasks are: 1. Currently we don't support DT for `[f32, i8, f32]`. But we should experiment with DT + codegen to see if a...

[CPU] Missing ukernel for f32, i8 -> f32 mmt4d?

I have a prototype to promote matmul for available ukernel: #15873 I think one issue is currently we don't fuse `arith.sitofp` into ukernel dispatch. AFAIK there are two potential ways:...

[CPU] Missing ukernel for f32, i8 -> f32 mmt4d?

Unassigned myself as I'm not actively working on this