UCSC ERIC Lab
UCSC ERIC Lab
awesome-vision-language-navigation
A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"
VLMbench
NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"
PEViT
Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"
CPL
Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"
Aerial-Vision-and-Dialog-Navigation
Codebase of ACL 2023 Findings "Aerial Vision-and-Dialog Navigation"
photoswap
Official implementation of the NeurIPS 2023 paper "Photoswap: Personalized Subject Swapping in Images"
MiniGPT-5
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
ComCLIP
Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"
swap-anything
"SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing"
Discffusion
Official repo for the paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"