DV Lab
DV Lab
LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
3D-Box-Segment-Anything
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
MoTCoder
This is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks.
SphereFormer
The official implementation for "Spherical Transformer for LiDAR-based 3D Recognition" (CVPR 2023).
SparseTransformer
A fast and memory-efficient libarary for sparse transformer with varying token numbers (e.g., 3D point cloud).
LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
VoxelNeXt
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
Ref-NPR
[CVPR 2023] Ref-NPR: Reference-Based Non-PhotoRealistic Radiance Fields
RIVAL
[NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion Inversion Chain