Conqueror
Conqueror
> Can you see if the error persists with a smaller model like `opt-350m`? opt-350m seems to return an, albeit lengthy, response: Notebook: ``` Common sense questions and answers Question:...
> It's possible that you are running out of memory with GPT-J-6B. The model works fine in chat mode. For some reason the responses aren't capped in the other modes...
The issue seems to lie in the eos_token_id figure. Comparing a chat request to a notebook request, the figure is significantly higher in the notebook request (50K+ vs 198 in...