UniMMVSR icon indicating copy to clipboard operation
UniMMVSR copied to clipboard

Official Code of "UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution"

UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution

Shian Du1 †, Menghan Xia2 ✉, Chang Liu1, Quande Liu3, Xintao Wang3, Pengfei Wan3, Xiangyang Ji1 ✉
1Tsinghua University 2Huazhong University of Science and Technology 3Kling Team, Kuaishou Technology
†: Intern at Kuaishou Technology, ✉: Corresponding Authors

UniMMVSR Video Demo on YouTube

📋 News

  • [2025.10.10] Release Arxiv paper.

📖 Introduction

We propose UniMMVSR, the first unified generative video super-resolution framework to incorporate hybrid-modal conditions, including text, images, and videos, which supports 4K controllable video generation for the first time.

⚙️ Code (Coming soon)

Citation

 @article{du2025unimmvsr,
  title={UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution},
  author={Du, Shian and Xia, Menghan and Liu, Chang and Liu, Quande and Wang, Xintao and Wan, Pengfei and Ji, Xiangyang},
  journal={arXiv preprint arXiv:2510.08143},
  year={2025}
}