NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer

YOU, Meng; Zhu, Zhiyu; LIU, Hui; Hou, Junhui

NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer

Part of International Conference on Representation Learning 2025 (ICLR 2025) Conference

Bibtex Paper Supplemental

Authors

Meng YOU, Zhiyu Zhu, Hui LIU, Junhui Hou

Abstract

By harnessing the potent generative capabilities of pre-trained large video diffusion models, we propose a new novel view synthesis paradigm that operates without the need for training. The proposed method adaptively modulates the diffusion sampling process with the given views to enable the creation of visually pleasing results from single or multiple views of static scenes or monocular videos of dynamic scenes. Specifically, built upon our theoretical modeling, we iteratively modulate the score function with the given scene priors represented with warped input views to control the video diffusion process. Moreover, by theoretically exploring the boundary of the estimation error, we achieve the modulation in an adaptive fashion according to the view pose and the number of diffusion steps. Extensive evaluations on both static and dynamic scenes substantiate the significant superiority of our method over state-of-the-art methods both quantitatively and qualitatively. The source code can be found on https://github.com/ZHU-Zhiyu/NVS_Solver.

NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer

Authors

Abstract

Name Change Policy