Aditya Ware

Results 2 comments of Aditya Ware

Hi, I'd like to help to fix this issue. I’m currently investigating the shape mismatch in the attention mask construction, specifically why attn_mask has shape (1, 1447, 1, 5234) while...

Following up — happy to prepare a fix if this issue is unassigned.