TIG_MI icon indicating copy to clipboard operation
TIG_MI copied to clipboard

Trying to train the model and got this error

Open mrunal2401 opened this issue 1 year ago • 0 comments

I was trying to execute "model_train.ipynb" in google colab, all the above steps were executed without any problem in the model_train.ipynb but at the time Training the model I got this error as below

Training the UNet... ERROR: ld.so: object 'libtcmalloc.so' from LD_PRELOAD cannot be preloaded (cannot open shared object file): ignored. ERROR: ld.so: object 'libtcmalloc.so' from LD_PRELOAD cannot be preloaded (cannot open shared object file): ignored. ERROR: ld.so: object 'libtcmalloc.so' from LD_PRELOAD cannot be preloaded (cannot open shared object file): ignored. ERROR: ld.so: object 'libtcmalloc.so' from LD_PRELOAD cannot be preloaded (cannot open shared object file): ignored.

Traceback (most recent call last): File "/content/diffusers/examples/dreambooth/train_dreambooth.py", line 803, in main() File "/content/diffusers/examples/dreambooth/train_dreambooth.py", line 590, in main train_dataloader = torch.utils.data.DataLoader( File "/usr/local/lib/python3.10/dist-packages/torch/utils/data/dataloader.py", line 349, in init sampler = RandomSampler(dataset, generator=generator) # type: ignore[arg-type] File "/usr/local/lib/python3.10/dist-packages/torch/utils/data/sampler.py", line 140, in init raise ValueError(f"num_samples should be a positive integer value, but got num_samples={self.num_samples}") ValueError: num_samples should be a positive integer value, but got num_samples=0 Traceback (most recent call last): File "/usr/local/bin/accelerate", line 8, in sys.exit(main()) File "/usr/local/lib/python3.10/dist-packages/accelerate/commands/accelerate_cli.py", line 43, in main args.func(args) File "/usr/local/lib/python3.10/dist-packages/accelerate/commands/launch.py", line 837, in launch_command simple_launcher(args) File "/usr/local/lib/python3.10/dist-packages/accelerate/commands/launch.py", line 354, in simple_launcher raise subprocess.CalledProcessError(returncode=process.returncode, cmd=cmd) subprocess.CalledProcessError: Command '['/usr/bin/python3', '/content/diffusers/examples/dreambooth/train_dreambooth.py', '--offset_noise', '--image_captions_filename', '--train_only_unet', '--save_starting_step=10', '--save_n_steps=10', '--Session_dir=/content/gdrive/MyDrive/TIG-MI/Sessions/TIG-MI_SESSION', '--pretrained_model_name_or_path=/content/stable-diffusion-custom', '--instance_data_dir=/content/gdrive/MyDrive/TIG-MI/Sessions/TIG-MI_SESSION/instance_images', '--output_dir=/content/models/TIG-MI_SESSION', '--captions_dir=/content/gdrive/MyDrive/TIG-MI/Sessions/TIG-MI_SESSION/captions', '--instance_prompt=', '--seed=695057', '--resolution=512', '--mixed_precision=fp16', '--train_batch_size=1', '--gradient_accumulation_steps=1', '--use_8bit_adam', '--learning_rate=2e-06', '--lr_scheduler=linear', '--lr_warmup_steps=0', '--max_train_steps=150']' returned non-zero exit status 1. Something went wrong

Or can you please provide pre-trained model link to download because in readme file it's wrong Thank you in Advance for helping :)

mrunal2401 avatar Dec 01 '23 05:12 mrunal2401