mesh-transformer-jax icon indicating copy to clipboard operation
mesh-transformer-jax copied to clipboard

How to stop model generating

Open jingrongchen opened this issue 3 years ago • 1 comments

I have finetuned my model using my own dataset, but when i run the model, it generates a correct answer but the model keep on generating the prompt answer pair, is there any way to tell model to stop the genration after certain character

jingrongchen avatar Jun 05 '22 18:06 jingrongchen

Hi, I am trying to finetune the model on my own dataset as well. But even with GPU memory of 24GB, I had issues loading the model weights into it. What would be the suggestion for the compute hardware for finetuning GPTJ 6B.

I am training with input data with an avg of 660 words.

greyweb avatar Aug 24 '23 12:08 greyweb