DeepSeek-V3-Windows-Installation-Guide icon indicating copy to clipboard operation
DeepSeek-V3-Windows-Installation-Guide copied to clipboard

BEWARE OF MODEL CONVERSION SCRIPT

Open Cyl0nius opened this issue 10 months ago • 0 comments

The conversion script does something to confuse the model ...

Here are the different output for the same question, one time with the model downloaded from huggingface, the other one converted with the provided script.

python windows_server.py --model model_weights --trust-remote-code System: You are a helpful assistant User: tell me the exact date when mao is born System: The exact date when mao is born is 12/26/1893

python windows_server.py --model model_weights_bf16 --trust-remote-code System: You are a helpful assistant User: tell me the exact date when mao is born System: I am not sure about this.

Cyl0nius avatar Feb 14 '25 02:02 Cyl0nius