Jack Kosaian comments

Results 15 comments of


                                            Jack Kosaian

[QST] Gather/Scatter in cute/cutlass 3

@jeromeku, It sounds like you've already figured out where new `Arguments` should be placed: [here](https://github.com/NVIDIA/cutlass/blob/47a3ebbea9860e14c095b52c4e6e2db33340f572/include/cutlass/gemm/kernel/gemm_grouped.h#L130). You'll also need to add them to the kernel's `Params` struct [here](https://github.com/NVIDIA/cutlass/blob/47a3ebbea9860e14c095b52c4e6e2db33340f572/include/cutlass/gemm/kernel/gemm_grouped.h#L217), similar to how...

[QST] Gather/Scatter in cute/cutlass 3

Any sort of padding would need to be handled externally to `can_implement`. You would need to pad your tensors, problem shapes, etc. before setting them in the `Arguments` struct.

[QST] Gather/Scatter in cute/cutlass 3

> Are there any examples of gather / scatter fusion and grouped_gemm specifically for Ampere architectures using Cutlass 3.0+ and CuTe? We do not have examples of this. > How...

[QST] How to use cutlass in tensorrt_llm plugin?

@yuanjiechen , what values of `N H W C K R S P Q` are you using in your Python example?

[QST] How to use cutlass in tensorrt_llm plugin?

Thanks for the details. Can you also tell me the stride, dilation, and padding values you used? I'll look into the alignment issue that you mentioned. Regarding easier support for...