llm-foundry icon indicating copy to clipboard operation
llm-foundry copied to clipboard

Generated sample equals to input samples

Open germanjke opened this issue 2 years ago • 3 comments

I have trained 125m MPT on some small dataset, my generated inputs via inference/hf_generate.py (before this converted from composer to HF, and it's gives me some value from eval/eval.py with winograd) are same with input samples, do u know what kind of problem it could be?

germanjke avatar May 29 '23 17:05 germanjke

Hi @germanjke , could you clarify what data you have been finetuning on? Are you running the Quickstart instructions? If so, the outputs are not meant to be good, it's just meant to be a tour of the different scripts available in the LLM Foundry and take you from data -> model -> generations.

It would also help greatly if you could attach the commands and outputs you are seeing.

abhi-mosaic avatar May 31 '23 01:05 abhi-mosaic

Hi, I am using the 'inference/hf_generate.py' with 'mpt-7b-instruct' (inference only) but also getting generated output which is the same as input for certain prompts. Also the answers seem to be more of a text completion, and are not similar to the instruct demo responses. Could you clarify why this is happening, and what are the recommended changes for using this model to be able to be similar to the demo? I have set the temperature, top_p and top_k as in demo already.

kochkinaelena avatar Jun 08 '23 09:06 kochkinaelena

Hi @kochkinaelena , could you provide a bit more info such as the prompts that you are using, or ideally the exact command python hf_generate.py ... you are running?

For the mosaicml/mpt-7b-instruct model specifically, I believe it may be best to follow the instruction format that it was finetuned with, you can see an example here of how we prompt the model in our HF space: https://huggingface.co/spaces/mosaicml/mpt-7b-instruct/blob/main/quick_pipeline.py#L8-L23. This might lead to better generations!

abhi-mosaic avatar Jun 11 '23 19:06 abhi-mosaic

Closing as stale -- please re-open if you continue to have issues!

hanlint avatar Jul 24 '23 06:07 hanlint