MagicDrive icon indicating copy to clipboard operation
MagicDrive copied to clipboard

Multi batchsize training.

Open Capchenxi opened this issue 1 year ago • 4 comments

To whom it may concern,

I have a problem when I change the train_batch_size into 2 in configs/runner/default_t.yaml file. The error occurs because of the line below https://github.com/cure-lab/MagicDrive/blob/cc9d9ae7931b2caf1ba2d304f460ba10e850fa31/magicdrive/networks/unet_addon_rawbox.py#L310C13-L310C43

I wonder if this line should be repeat_size = [repeat_size, 1] since the imported repeat_size as an integer is the batchsize, and 1 is the default uncond cam number.

Capchenxi avatar Nov 04 '24 09:11 Capchenxi

Or I wonder how to do multi-batchsize training for video generation since I still got errors when I fixed the line mentioned above. Thansk.

Capchenxi avatar Nov 05 '24 02:11 Capchenxi

This issue is stale because it has been open for 7 days with no activity. If you do not have any follow-ups, the issue will be closed soon.

github-actions[bot] avatar Nov 12 '24 16:11 github-actions[bot]

I do not think it is trivial to support multi-batch training, right now the video training only supports one video on one process. I sometimes use the "batch" dim as T, which is the frame number, to reuse the image generation code. I am sorry for the inconvenience. It may take some effort to support multi-batch training.

flymin avatar Nov 18 '24 08:11 flymin

This issue is stale because it has been open for 7 days with no activity. If you do not have any follow-ups, the issue will be closed soon.

github-actions[bot] avatar Nov 25 '24 16:11 github-actions[bot]