LMDrive icon indicating copy to clipboard operation
LMDrive copied to clipboard

Training Issues

Open Allvey opened this issue 1 year ago • 2 comments

Thank you for your excellent work! I have some trouble with training: I tried to install slurm for cluster job scheduling, but unfortunately many attempts failed. So, what we want to know is if there is any impact on training if we don't use the srun command, but execute the training script directly (for example perform ./distributed_pretrain.sh 8 '/path/to/your/dataset' ... in pre-training stage)?

Allvey avatar May 14 '24 08:05 Allvey

Hi! Executing the training script without slurm is ok and doesn't affect the performance.

deepcs233 avatar May 15 '24 09:05 deepcs233

@Allvey Hello, may I ask what command you are using to train the visual encoder? I am also using a single GPU.

CoderXuans avatar Jan 09 '25 01:01 CoderXuans