Vidu


The announcement was made at the 2024 Zhongguancun Forum in Beijing, where they tried to position Vidu as a strong competitor to OpenAI's Sora.

Vidu is capable of producing 16-second clips at 1080p resolution—Sora by comparison can generate 60-second videos. Vidu is based on a Universal Vision Transformer (U-ViT) architecture, which the company says allows it to simulate the real physical world with multi-camera view generation. 

This architecture was reportedly developed by the Shengshu Technology team in September 2022 and as such would predate the diffusion transformer (DiT) architecture used by Sora. 

Comments

Popular posts from this blog

Perplexity

Aphorisms: AI

DeepAI's Austen on China