junyou2001
Results
1
issues of
junyou2001
### Description / 描述 你好! 我注意到minicpm4-0.5B中描述无法支持sparse attention,但是我看modeling中有就修改为可用,但报错: ``` topk_idx[topk_idx >= q_idx[None, :, None]] = -1 RuntimeError: The size of tensor a (355) must match the size of tensor b (711)...
badcase