Mihir Prabhudesai

Results 3 repositories owned by Mihir Prabhudesai

AlignProp

198
Stars
7
Forks
Watchers

AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (P...

Slot-TTA

18
Stars
3
Forks
Watchers

Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.

VADER

195
Stars
15
Forks
Watchers

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various rewa...