stanford_alpaca icon indicating copy to clipboard operation
stanford_alpaca copied to clipboard

weight_diff.py state_dict_recovered[key].add_(state_dict_raw[key]) RuntimeError: The size of tensor a (32001) must match the size of tensor b (32000) at non-singleton dimension 0

Open gaodexiaozheng opened this issue 1 year ago • 3 comments

when running the below commend: python weight_diff.py recover --path_raw /models/Llama-2-7b-hf --path_diff /models/alpaca-7b-wdiff --path_tuned ./llama-alpaca-7b-hf

it shows the error: RuntimeError: The size of tensor a (32001) must match the size of tensor b (32000) at non-singleton dimension 0

though I can understand this error, this should be resolved.

gaodexiaozheng avatar Nov 22 '23 08:11 gaodexiaozheng

I encountered the same problem. When I check the source code, I found that relates to the format of the model.state_dict. So you should check the state_dict shape of the raw model and model diff.

boyue-jiang avatar Nov 24 '23 20:11 boyue-jiang

I have the same problem. Any idea to solve this error?

Irenehere avatar Jan 11 '24 08:01 Irenehere

I have the same problem. Any idea to solve this error? Hi, have you solved this error?

Yclock avatar Jun 24 '24 11:06 Yclock