Intermittent `bifrost.linalg` test failures
Occasionally we see test failures on the self-hosted bifrost.linalg suite. Now that I'm looking for one to point to I cannot find one.
Here's one: https://github.com/ledatelescope/bifrost/pull/167#issuecomment-1152494636
I wonder if this is somehow related to #210. The only places where BF_STATUS_UNSUPPORTED_SHAPE can be thrown from a LinAlg call are in linalg_kernels.cu:
bf_cherk_Nbf_cgemm_TN_smallM_staticN_v2bf_cgemm_TN_smallM
These are all kind of trivial though. It's mostly value checking for the matrix shape. There are a couple of comparisons of the batch size with the texture memory size that can also throw this. It would be nice to know exactly which BF_STATUS_UNSUPPORTED_SHAPE we are hitting.