gpt-fast icon indicating copy to clipboard operation
gpt-fast copied to clipboard

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Results 132 gpt-fast issues
Sort by recently updated
recently updated
newest added

When I use stories15M and stories110M I got an error. ``` File "D:\_LLM_project\Development\gpt-fast\generate.py", line 114, in speculative_decode torch.cat([cur_token.view(1), draft_tokens]).view(1, -1), RuntimeError: Tensors must have same number of dimensions: got 1...

Currently, I can only use small models. Is there an option to find an appropriate model around the 1B parameter size?