AICV Lab
AICV Lab
VLCAP
[ICIP 2022] VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning
ECG_SSL_12Lead
[IEEE BHI 2022] Multimodality Multi-Lead ECG Arrhythmia Classification using Self-Supervised Learning
VLTinT
[AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
AOE-Net
[IJCV] AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation
AISFormer
[BMVC 2022] AISFormer: Amodal Instance Segmentation with Transformer
3DConvCaps
[ICPR 2022] 3DConvCaps: 3DUnet with Convolutional Capsule Encoder for Medical Image Segmentation
OpenFusion
[ICRA 2024 Oral] Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation
MEGANet
[WACV 2024] An implementation of MEGANet for polyp segmentation with multi-scale edge-guided attention
AerialFormer
[preprint] AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation