lorax
lorax copied to clipboard
Not able to host Llama3.2-11b on Azure A100 80GB server
System Info
lorax_version=0.12.0 Using Docker to host the 11b model it runs perfectly for Llama3.1-8b But with LLama3.2-11b I am getting the following error
ModuleNotFoundError: No module named 'lorax_server.utils.attention.utils'
Information
- [X] Docker
- [ ] The CLI directly
Tasks
- [X] An officially supported command
- [ ] My own modifications
Reproduction
Set the model path in environment variable and use docker run to host the model
Expected behavior
The model should have been hosted seemlessly like it does for the older model