wit
wit copied to clipboard
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
Do you plan to release the baseline models pre-trained on WIT dataset (as mentioned in WIT paper)? Thank you!
Line 52. Change "filering" to "filtering". Add a missing comma to the same sentence.
what is the plan of releasing the code or murals?
Hi, great work. Do we have any exhaustive list of the topics/categories pertaining to the data-set's contents ?
Thanks for your great works on WikiWeb2M I just download the WikiWeb2M dataset and find there is only the first section of each wiki page, i wonder where is the...