Conqueror

Results 3 comments of Conqueror

> Can you see if the error persists with a smaller model like `opt-350m`? opt-350m seems to return an, albeit lengthy, response: Notebook: ``` Common sense questions and answers Question:...

> It's possible that you are running out of memory with GPT-J-6B. The model works fine in chat mode. For some reason the responses aren't capped in the other modes...

The issue seems to lie in the eos_token_id figure. Comparing a chat request to a notebook request, the figure is significantly higher in the notebook request (50K+ vs 198 in...