Moris Johnson

Results 1 comments of Moris Johnson

@kwak513 Not sure if it'll help, I was trying to replicate [onebitllms](https://github.com/tiiuae/onebitllms/blob/main/examples/sft.py) script for a custom text data, faced the same problem where loss doesn't decrease. I changed the training...