pytorch-llama icon indicating copy to clipboard operation
pytorch-llama copied to clipboard

causal attention mask

Open itera-del opened this issue 10 months ago • 0 comments

Why is causal attention mask not used?

itera-del avatar Dec 23 '24 14:12 itera-del