Vidu

April 28, 2024

China's Shengshu Technology and Tsinghua University have unveiled Vidu, a text-to-video model capable of generating 16-second clips at 1080p resolution with a single click.

The announcement was made at the 2024 Zhongguancun Forum in Beijing, where they tried to position Vidu as a strong competitor to OpenAI's Sora.

Vidu is capable of producing 16-second clips at 1080p resolution—Sora by comparison can generate 60-second videos. Vidu is based on a Universal Vision Transformer (U-ViT) architecture, which the company says allows it to simulate the real physical world with multi-camera view generation.

This architecture was reportedly developed by the Shengshu Technology team in September 2022 and as such would predate the diffusion transformer (DiT) architecture used by Sora.

Search This Blog

chat ai news

Vidu

Comments

Post a Comment

Popular posts from this blog

Hamza Chaudhry

Perplexity

BYU study