Univg : Unified Multi-Modal Video Generation System

AI video generation

Univg

UniVG

Univg

AI video generation AI model #Video Generation #Multi-Modal #Image Processing Standard Picks Open Source

Overview :

UniVG is a unified multi-modal video generation system that can handle various video generation tasks, including text and image modalities. By introducing multi-condition cross-attention and biased Gaussian noise, it achieves both high-freedom and low-freedom video generation. On the public academic benchmark MSR-VTT, it achieved the lowest Fréchet video distance (FVD), surpassing the performance of current open-source methods in human evaluation, and comparable to the current closed-source method Gen2.

Target Users :

Suitable for multi-modal video generation scenarios, such as film special effects production and video content creation.

Total Visits： 29.7M

Top Region： US(17.94%)

Website Views ： 680.9K

Features

Multi-Condition Cross Attention

Biased Gaussian Noise

Video Generation Task Processing

Featured AI Tools

Sora

AI video generation

Animate Anyone

Animate Anyone aims to generate character videos from static images driven by signals. Leveraging the power of diffusion models, we propose a novel framework tailored for character animation. To maintain consistency of complex appearance features present in the reference image, we design ReferenceNet to merge detailed features via spatial attention. To ensure controllability and continuity, we introduce an efficient pose guidance module to direct character movements and adopt an effective temporal modeling approach to ensure smooth cross-frame transitions between video frames. By extending the training data, our method can animate any character, achieving superior results in character animation compared to other image-to-video approaches. Moreover, we evaluate our method on benchmarks for fashion video and human dance synthesis, achieving state-of-the-art results.

AI video generation

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase