Qian comments

Repositories
Issues
Comments

Results 33 comments of


                                            Qian

请问这个代码大概需要运行多久

@gaoxing153 咦，是用的什么GPU呢？他这个不会early-stopping的，理论上会跑满预期的epoch，但你看上次保存的模型距离现在很久就可以手动停止了。

llama2-7b种子模型结果有问题

@447428054 thanks for your interest! Since the seed lm should be an LLM after instruction tuning, may I know if you have already used the correct version of the model?

GPU

With `load_in_8_bit`, usually it requires (at least) 1 x 80G A100 to do inference on the model. @MahdiMohseni0033