Chaeyeon Kwak

Results 1 issues of Chaeyeon Kwak

Hi, and thanks for the great work on BitNet! I'm trying to fine-tune `microsoft/bitnet-b1.58-2B-4T-bf16` using a Korean dataset (`nlpai-lab/kullm-v2`) with SFTTrainer. However, during training, the loss remains around **3.3 to...