Wang Jiaqi

Results 1 comments of Wang Jiaqi

hello, I used the same code, conda env, hyper-parameters and dataset, just replaced the base model with LLAMA-2. It did run, but the loss couldn't converge, which was the problem...