SV3D


SV3D addresses the problem of Novel View Synthesis (NVS), which tries to generate the unseen portions of an object given one or more 2D images of that object: for example, generating a view of the back of an object given an image of its front. 

Stability AI leveraged their existing Stable Video Diffusion model, which includes camera control abilities, allowing it to generate orbital videos, where the camera makes a circle around the object of interest. 

This model was fine-tuned using a dataset rendered from 3D objects in the Objaverse dataset. When evaluted on the GSO and OmniObject3D benchmarks, SV3D outperformed baseline models and achieved new state-of-the-art performance. 

Comments

Popular posts from this blog

Perplexity

Aphorisms: AI

DeepAI's Austen on China