Salvatore Mesoraca comments

Repositories
Issues
Comments

Results 4 comments of


                                            Salvatore Mesoraca

vulkan: mul_mat: fix UB with small warps

When I tested it, it passed all tests (e.g. test-backend-ops). So it seems to work, but I don't know if there is any corner case in which it would return...

Add conv_transpose_1d_gemm

> > Implementing conv_transpose_1d via mulmat+col2im uses more memory but it can be ~80% faster (based on a very simple benchmark I did) than the current approach when the input...

Add conv_transpose_1d_gemm

Oh wait. I thought that your > I'm not convinced that the code in this PR is necessarily an upgrade over the code on master was specifically referring to something...

Add conv_transpose_1d_gemm

BTW, if the increased memory usage is a concern, instead of re-implementing the existing `conv_transpose_1d`, I could add a new `conv_transpose_1d_gemm` and let users decide for themselves.