Peng "Richard" Xia
Peng "Richard" Xia
awesome-multimodal-in-medical-imaging
A collection of resources on applications of multi-modal learning in medical imaging.
HGCLIP
[arXiv'23] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding
Chinese-Noisy-Text
This repository stores the code of the data augmentation method from Chinese word and character levels, which adds noise to words and characters in redundant, missing, selection and ordering respectiv...
LMPT
[ACLW'24] LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-tailed Multi-Label Visual Recognition
CARES
[arXiv'24 & ICMLW'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
RULE
[EMNLP'24] RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models