GPTFast issues

Very interesting work! I see you pinned `torch==2.1.2` . PyTorch 2.2 promises a 2x improvement to `scaled_dot_product_attention` and a few `torch.compile` improvements: https://pytorch.org/blog/pytorch2-2/ I don't think using PyTorch 2.2 will...

AndreaPi

Set requirements to more flexible version

1

The current requirements makes it such as installing makes it incompatible with newer pytorch or transformers version They should be set to minimal requirements

nicolas-dufour

Help run GPTFast on Mistral-7B-v0.1 (or v.02) for CausalML

1

Hello, I am having difficulties running GPTFast on Mistral-7b-v0.1, encountering the same errors as reported here: https://github.com/MDK8888/GPTFast/issues/25. My assumption is that the model_config is not set properly (I am currently...

Rinocerbul

Model Config settings for Llama-based architectures

8

Hi there, Thanks for creating this repo. I wanted to know what should be for Llama-2-7b-chat-hf if its the below for gpt and opt arhitectures ? ``` "gpt": { "path_to_blocks":...

jamesoneill12

Help to run GPTFast on Mixtral-8x7B-Instruct-v0.1

2

Could you help to give an example code to run GPTFast on Mixtral-8x7B-Instruct-v0.1? I load the model with GPTFast with empty draft_model_name. Error shows when loading the model as following....

davideuler

Doesn't work on kaggle notebooks - ValueError: Unable to compare versions for numpy>=1.17: need=1.17 found=None. This is unusual. Consider reinstalling numpy.

# to reproduce - `pip install gpt-fast` - run the code included the in readme - reinstalling numpy with `!pip install numpy --upgrade` fixes the numpy error, but then there...

c1505

GPTFast 0.2.1: function argmax_variation() is not used

Dear Sir, I checked the demo code of GPTFast 0.2.1 and found that the function argmax_variation(...) is not used at all. Could you please expain for this ? Many thanks.

alvatar0404

Possible to use with a VL model like LLAVA?

2

I am trying to use this project with a vision-language model like https://huggingface.co/docs/transformers/en/model_doc/llava_next but currently this repo does not support vision part of the model. I have a separate script...

aliencaocao

Help to understand

6

Hi! I don't quite understand how this project works, I guess my main question is : `what is a draft model ? ` For example, I would like to speed-up...

apirrone

GPTFast
GPTFast copied to clipboard

Metadata

Update setup.py

Update PyTorch to 2.2

Set requirements to more flexible version

Help run GPTFast on Mistral-7B-v0.1 (or v.02) for CausalML

Model Config settings for Llama-based architectures

Help to run GPTFast on Mixtral-8x7B-Instruct-v0.1

Doesn't work on kaggle notebooks - ValueError: Unable to compare versions for numpy>=1.17: need=1.17 found=None. This is unusual. Consider reinstalling numpy.

GPTFast 0.2.1: function argmax_variation() is not used

Possible to use with a VL model like LLAVA?

Help to understand

← Metadata

Owner

Metadata

GPTFast GPTFast copied to clipboard

Metadata

← Metadata

Owner

Metadata

GPTFast
GPTFast copied to clipboard