cutlass icon indicating copy to clipboard operation
cutlass copied to clipboard

[BUG] FAILED to compile example 47_ampere_gemm_universal_streamk

Open hcyz33 opened this issue 1 year ago • 4 comments

Describe the bug

I tried to replace half_t with bfloat16_t in examples/47_ampere_gemm_universal_streamk/ampere_gemm_universal_streamk.cu, but encountered compilation errors.

Steps/Code to reproduce bug here is the diff Image

here is the part of error Image

hcyz33 avatar Sep 14 '24 07:09 hcyz33

This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.

github-actions[bot] avatar Oct 14 '24 08:10 github-actions[bot]

Have you tried changing the accumulation type to fp32? See https://docs.nvidia.com/cuda/ampere-tuning-guide/index.html for details on which datatypes configurations are supported.

d-k-b avatar Oct 14 '24 18:10 d-k-b

This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.

github-actions[bot] avatar Nov 13 '24 19:11 github-actions[bot]

This issue has been labeled inactive-90d due to no recent activity in the past 90 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed.

github-actions[bot] avatar Feb 11 '25 20:02 github-actions[bot]