gpt-fast Fixing block size for Mistral-7B.

Fixing block size for Mistral-7B.

Open Artyom17 opened this issue 11 months ago • 1 comments

According to Mistral's paper the block size for Mistral-7B should be 8192 (ref: https://arxiv.org/pdf/2310.06825.pdf, https://huggingface.co/docs/transformers/en/model_doc/mistral). But currently it is set to the default value (2048).

Mar 19 '24 02:03 Artyom17

gpt-fast gpt-fast copied to clipboard

Fixing block size for Mistral-7B.

gpt-fast
gpt-fast copied to clipboard