gpt-fast

Open nsosio opened this issue 2 years ago • 1 comments

Nov 30 '23 23:11 nsosio

Hey @nsosio, so just clarrifying here, for PyTorch (#21), is is simply, using the HF-pytorch .bin file for llama-2 7B on fp-16/32 precision.

Where as for gpt-fast, it is this latest implementation by PyTorch Labs, right?

Dec 03 '23 20:12 Anindyadeep