Qian
Results
33
comments of
Qian
@gaoxing153 咦,是用的什么GPU呢?他这个不会early-stopping的,理论上会跑满预期的epoch,但你看上次保存的模型距离现在很久就可以手动停止了。
@447428054 thanks for your interest! Since the seed lm should be an LLM after instruction tuning, may I know if you have already used the correct version of the model?
With `load_in_8_bit`, usually it requires (at least) 1 x 80G A100 to do inference on the model. @MahdiMohseni0033