Ethan He
Results
13
issues of
Ethan He
`core_attention_bias_type` is needed to use alibi from transformer engine https://docs.nvidia.com/deeplearning/transformer-engine/user-guide/api/pytorch.html?highlight=alibi#transformer_engine.pytorch.DotProductAttention.forward
Hi, Could you provide the model and code of the locator?
``` 1 Successfully loaded and sharded model parameters! 2 0%| | 0/155 [00:00