abd comments

Repositories
Issues
Comments

Results 2 comments of

abd

[BUG] Transformer.from_folder() does not load the model on multiple GPU

You might want to try the vLLM library. I used that to deploy the Mistral-nemo model in a multi-node, multi-gpu setting. Reference: https://docs.mistral.ai/deployment/self-deployment/vllm/ I could be wrong, but I think...

How can I merge the LoRA weights into the base model?

mistral-finetune has a requirement of torch==2.2, whereas mistral-inference has a requirement of torch==2.3.0 for all but the first release. Is there anyway to have the two of them in the...