Casimiro

Results 4 comments of Casimiro

Hi! Thanks for the PR. I was trying your code with a Bloomz7b1 model but it seems that the `model.eval()` does not merge the lora weights into the base model...

Hi again! I tried to follow your instructions but when calling the `model.eval()` method I got the following error: `RuntimeError: Expected 4-dimensional input for 4-dimensional weight [8192, 16, 1, 1],...

Hi @pacman100, I installed the repo from source (main branch) and now it works. Many thanks!

I am also interested in deploying the `mosaicml/mpt-30b-chat` model. Would be really useful for the community! :pray: