Ian Barber issues

Repositories
Issues
Comments

Results 2 issues of


                                            Ian Barber

AMD CPU generation is very slow

Very slow tokens/second in FP32, feels worse than it should be, but I'm not entirely sure the best way to debug. $ python3 torchchat.py generate --prompt "hello model" -v llama2...

performance

GRPO LoRA Single Device

#### Context What is the purpose of this PR? Is it to - [x ] add a new feature - [ ] fix a bug - [ ] update tests...

CLA Signed