multimodal topic
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Pluralistic-Inpainting
[CVPR 2019]: Pluralistic Image Completion
psi
Platform for Situated Intelligence
blended-diffusion
Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]
Kaleido-BERT
💐Kaleido-BERT: Vision-Language Pre-training on Fashion Domain
KDD_WinnieTheBest
KDD Cup 2020 Challenges for Modern E-Commerce Platform: Multimodalities Recall first place
EPNet
EPNet: Enhancing Point Features with Image Semantics for 3D Object Detection(ECCV 2020)
MultiModalStory-demo
FairyTailor: Multimodal Generative Framework for Storytelling
tsflex
Flexible time series feature extraction & processing
Diverse-Structure-Inpainting
CVPR 2021: "Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE"