InternVL
InternVL copied to clipboard
Do you plan on releasing the dataset used to train internVL 1.5 ?
Hello, As stated in the huggingface page of InternVL 1.5, a High-Quality Bilingual Dataset was used to train this model. Do you plan to release this dataset in the future ? Thanks !
Hi, we may release the annotation files in the JSONL format that we use. However, to make them usable for everyone, we will need to create a document detailing the placement of paths and the downloading of images. This will take some time.