Tune-A-Video icon indicating copy to clipboard operation
Tune-A-Video copied to clipboard

Unofficial implementation of Tune-A-Video

Tune-A-Video

Unofficial implementation of Tune-A-Video

step_500_resized

Training

accelerate launch train.py --config config/sample.yml

Notes

  • Learning rate and training steps in the sample config are different from the paper.
  • The model seems to memorize a video clip very quickly. prior_preservation is added for the regularization.
  • Requires > 20GB GPU memory with xformers enabled.

References

  • Pseudo 3D Conv & Temporal Attention: https://github.com/lucidrains/make-a-video-pytorch
  • Training Code: https://github.com/huggingface/diffusers
  • Video Source: https://youtu.be/RC-24Nfr7fc?t=100