About checkpoints and stable-diffusion-v1-5
Thanks for your wonderful work! And I have some questions:
-
The checkpoint link you provided includes the folders vae, scheduler, tokenizer, and unet. However, in the pretrained section, you mentioned downloading the pre-trained StableDiffusion-1.5 from SDM, which also includes vae, scheduler, tokenizer, and unet. Which files should I use during training?
-
In your ./data_process/README.md, it mentions needing to download the required checkpoints such as yolov7.pt. Where can I find these checkpoints?
And if I want to use the 'Testset' uploaded to 'HuggingFace', only extracting the keyframes and their corresponding subtitles via 'python ./data_process/extract.py' is enough? I mentioned you said in StorySalon Readme: use the provided data processing pipeline to obtain the frames. But in issues you said it had been processed.Could you show more details about how to use the dataset? Thx a lot!
- Since our model is designed based on SDM, all SDM pre-trained parameters need to be used;
- Please refer to the official implementation repository of yolov7: https://github.com/WongKinYiu/yolov7
- We have processed this part of the test set and can use it directly.