cudnn-frontend
cudnn-frontend copied to clipboard
Is performance better when not using padding?
Right now -- my k/v vectors are padded since I have different sequence lengths. I was wondering, is performance better using ragged tensors / non-padded key/value vectors?