ConMan05
ConMan05
If you're in a hurry and need a **quick and temporary solution** for now. `--max requests 1 --workers 10` This helped me. You can get 10 simultaneous request where each...
This occurs on windows 10... I'm using Kubuntu 20.04 and it's running fine....
Understood, so max_new_tokens is equivalent to max_output_tokens. 1)Could you provide information on the specific value of max_new_tokens for the [liuhaotian/llava-v1.6-34b](https://huggingface.co/liuhaotian/llava-v1.6-34b) and [liuhaotian/llava-v1.6-mistral-7b](https://huggingface.co/liuhaotian/llava-v1.6-mistral-7b) ? 2)I have tested [mistral-7b-instruct-v0.2](https://replicate.com/mistralai/mistral-7b-instruct-v0.2) on the Replicate...