VILA
VILA copied to clipboard
Is there any DataCollatorForSupervisedDatasetSeqParallel for video dataset?
The current DataCollatorForSupervisedDatasetSeqParallel in llava/data/dataset.py is built for image dataset. There will be many errors when directly using it for video dataset. Will you release the similar solution for video training, so as to enbale re-implementation of stage 4 and 5 in LongVILA?
@yukang2017 could you help provide a pointer to the SeqParallel dataset?