instruct-pix2pix
instruct-pix2pix copied to clipboard
finetuning pretrained model using the custom dataset
Hi, Thank you for your great work. I am working on finetuning the pre-trained model using a custom dataset. Could you please let me know how I should organize the paired images and editing instructions during finetuning? I tried to download your provided datasets to get an idea but those datasets are more than 700GB. Could you provide a tiny version of the dataset which will contain only a few samples for reference? Thank you in advance.
Hi, you can access to clip-filtered-dataset and just download part of data (14GB), then you can check and find the pattern of the dataset.
Remember to download seeds.json.
Thank you very much!
Hello. I have a question. What is seeds.json used for?
Hello @unmo. The seeds.json contains an array like [ [A, [B, C, D, E] , [A, [B, C, D, E], ...]
where "A"s indicates the sub directory name in the dataset and "BCDE" are the image pair names in sub directory "A". Notice that the two images for "B" are represented as "B_0.jpg" (before edit) and "B_1.jpg" (after edit), which should be edited according to the prompt.json file in "A".
I understood the structure. Thank you very much.