DeepSpeedExamples
DeepSpeedExamples copied to clipboard
Batchsize
You set batchsize differently in parser arg and ds_config.json. For example, in cifar, in ds_config.json, train_batch_size is 16 but in parser argument, batchsize is 32. But you set batchsize as 16 in the final training dataloader.... Which one is right?