Kevin Heffernan
Results
2
issues of
Kevin Heffernan
**Describe the bug:** Currently the config for LLaMA 3.2 3B is not using tied weights. Only the 1B model is currently supported (https://github.com/facebookresearch/fairseq2/blob/main/src/fairseq2/models/llama/_config.py#L257) **Describe how to reproduce:** Loaded LLaMA 3.2...
bug
**Describe the bug:** When specifying `max_gen_len`, the `SamplingSequenceGenerator` can potentially generate more than `max_gen_len` for all batched sequences whose prompt length is shorter than the longest prompt length in batch....
bug
generation