No Name

Results 2 issues of No Name

Hi, If I take the same encoder input and pad it to a different maximum length, then I get noticeably different encoder memory key/value tensors from decoder cross attention. And...

Are you planning to upgrade the version of CUTLASS you are using in the near future? If not, are you willing to accept a pull request from us with such...