Open-Sora-Plan A million-scale text-to-video prompt-gallery dataset

A million-scale text-to-video prompt-gallery dataset

Open WangWenhao0716 opened this issue 11 months ago • 5 comments

Hi, We contribute the first dataset featuring 1.67 million unique text-to-video prompts and 6.69 million videos generated from 4 different state-of-the-art diffusion models. We hope it can help your Open-Sora plan.

Title：VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models

Arxiv：https://arxiv.org/abs/2403.06098

Project：https://github.com/WangWenhao0716/VidProM

Download：https://huggingface.co/datasets/WenhaoWang/VidProM

Mar 12 '24 07:03 WangWenhao0716

Thanks for the heads up, we'll take it under advisement.

Mar 13 '24 04:03 LinB203

@WangWenhao0716 could you please show us some samples for checking on your project link for further researching?

By the way, welcome to update your work in the MiniSora Dataset Section (https://github.com/mini-sora/minisora/tree/main#dataset)

Mar 15 '24 03:03 chg0901

Yeah, thanks for your interest. I will upload an example folder with 10000 random prompts and corresponding videos. I am pleasure to update my work in the MiniSora Dataset Section :)

Mar 15 '24 08:03 WangWenhao0716

@WangWenhao0716 could you please show us some samples for checking on your project link for further researching?

By the way, welcome to update your work in the MiniSora Dataset Section (https://github.com/mini-sora/minisora/tree/main#dataset)

I see it has been updated, thanks!

Mar 15 '24 08:03 WangWenhao0716

@chg0901 Done: https://huggingface.co/datasets/WenhaoWang/VidProM/tree/main/example

Mar 15 '24 10:03 WangWenhao0716

Open-Sora-Plan Open-Sora-Plan copied to clipboard

A million-scale text-to-video prompt-gallery dataset

Open-Sora-Plan
Open-Sora-Plan copied to clipboard