gpt-fast
gpt-fast copied to clipboard
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Results
132
gpt-fast issues
Sort by
recently updated
recently updated
newest added
When I use stories15M and stories110M I got an error. ``` File "D:\_LLM_project\Development\gpt-fast\generate.py", line 114, in speculative_decode torch.cat([cur_token.view(1), draft_tokens]).view(1, -1), RuntimeError: Tensors must have same number of dimensions: got 1...
Currently, I can only use small models. Is there an option to find an appropriate model around the 1B parameter size?