FastChat icon indicating copy to clipboard operation
FastChat copied to clipboard

Could not find the transformer layer class to wrap in the model.

Open Jerry-hyl opened this issue 1 year ago • 2 comments

When I use training code to finetune mistral-7b,I have met the exception like "Could not find the transformer layer class to wrap in the model.". How can I fix it?

Jerry-hyl avatar Jun 12 '24 02:06 Jerry-hyl

Hi Jerry, Did you find a solution for this? I have the same exception

juliaparedesq avatar Aug 18 '24 07:08 juliaparedesq

@juliaparedesq I tried to remove --fsdp_transformer_layer_cls_to_wrap 'LlamaDecoderLayer' and the exception was gone. But after finetuning, the model's ability declined significantly. It seems that fastchat can only be used to deploy mistral model instead of finetuning it.

Jerry-hyl avatar Aug 19 '24 01:08 Jerry-hyl