dalle-mini
dalle-mini copied to clipboard
customized dataset
Hi sorry for this naive question as I am new to computer vision and DALLE. Say if I have a folder of images and captions such as
dir/ cat.jpg cat.txt dog.jpg dog.txt
how to convert the folder into the data format the DALLE-mini can accept if I want to train a new model? Is there any script in this repo?
I was using "https://github.com/lucidrains/DALLE-pytorch" the data format for their model is the above format but I am not clear what is the data format used here.
This is the required format: https://github.com/borisdayma/dalle-mini/blob/main/tools/dataset/encode_dataset.ipynb