No Name
Results
2
issues of
No Name
Hi, If I take the same encoder input and pad it to a different maximum length, then I get noticeably different encoder memory key/value tensors from decoder cross attention. And...
Are you planning to upgrade the version of CUTLASS you are using in the near future? If not, are you willing to accept a pull request from us with such...