Tensile
Tensile copied to clipboard
Add defineLocalSgpr
Added a new method to manage SGPR usage: defineLocalSgpr. This allows in-place defining/undefing of SGPRs.
Also added new parameter PrintRegisterDebug
that informs the invoker if SGPRs have not been explicitly undefined, as a optimization hint.
The old defineSgpr function still works as expected and is in place elsewhere in the code.
CI tests failed due to incorrect asm code. Please fix them.
I think it is better to run precheckin test locally before you commit code change.
@awhittle3 Some CI tests failed, do we still need this change?
@awhittle3 Some CI tests failed, do we still need this change?
This PR will conflict with PR #1843, which probably has higher priority. I'll address the issues here once that PR is merged into develop
.
@AlexBrownAMD believes this change will be helpful in managing SGPR usage.
@awhittle3 Some CI tests failed, do we still need this change?
This PR will conflict with PR #1843, which probably has higher priority. I'll address the issues here once that PR is merged into
develop
.@AlexBrownAMD believes this change will be helpful in managing SGPR usage.
#1843 is merged today. Would you please restart this PR?
@awhittle3 we are approaching FC 6.2 and we want to clear all the requests. Please let me know if you are planning to proceed with this PR. Thanks.
Out of date and would take a while to spin back up. Closing work on this for now.