dereklll

Results 6 comments of dereklll

> @dereklll > 意思是一开始有效,后面又失效了? 你这样正样本loss全部为0,被mask了,total loss只有负样本的loss

> (alpaca_env) chunzhamini@chunzhamini llama.cpp % ./main -m ./zh-models/baichuan/Baichuan2-13B-Chat-ggml-model-q4_0.bin -p '从前有一只小狐狸,他' --temp 0 -ngl 1 Log start main: warning: changing RoPE frequency base to 0 (default 10000.0) main: warning: scaling RoPE...