LMDrive icon indicating copy to clipboard operation
LMDrive copied to clipboard

About Training Time

Open ReaFly opened this issue 1 year ago • 1 comments

Hi authors! Thanks for your excellent work. I encounter low training efficiency issues. The second Instruction finetuning stage takes about 6 days on my 8*A100 (40G) GPUs, utilizing only Town01 data downloaded from openxlab. I noticed you mentioned that Instruction finetuning takes around 3 days for the visual encoder on 8x A100 (80G). If you utilize all (Town01-Town07,Town10) data during the finetuning stage? and what could be the possible reasons on my machine?

ReaFly avatar Feb 20 '24 11:02 ReaFly

Hi! Sorry for the late reply. Maybe you can check the disk speed? It will significantly affect the training efficiency. I recommend moving the training data to the SSD.

deepcs233 avatar Feb 25 '24 13:02 deepcs233