DeepSeek-V3-Windows-Installation-Guide
DeepSeek-V3-Windows-Installation-Guide copied to clipboard
BEWARE OF MODEL CONVERSION SCRIPT
The conversion script does something to confuse the model ...
Here are the different output for the same question, one time with the model downloaded from huggingface, the other one converted with the provided script.
python windows_server.py --model model_weights --trust-remote-code System: You are a helpful assistant User: tell me the exact date when mao is born System: The exact date when mao is born is 12/26/1893
python windows_server.py --model model_weights_bf16 --trust-remote-code System: You are a helpful assistant User: tell me the exact date when mao is born System: I am not sure about this.