Mihir Prabhudesai
Results
3
repositories owned by
Mihir Prabhudesai
AlignProp
198
Stars
7
Forks
Watchers
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (P...
Slot-TTA
18
Stars
3
Forks
Watchers
Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.
VADER
195
Stars
15
Forks
Watchers
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various rewa...