

NUWA
Overview :
NUWA is a suite of research projects developed by Microsoft, including NUWA, NUWA-Infinity, NUWA-LIP, Learning 3D Photography Videos, and NUWA-XL. These projects focus on pre-trained models for visual synthesis, capable of generating or manipulating visual data such as images and videos to perform various visual synthesis tasks.
Target Users :
Applicable to researchers and developers for tasks in visual synthesis, image, and video processing.
Use Cases
Generate new images or video content using the NUWA model
Perform infinite visual synthesis with NUWA-Infinity
Carry out language-guided image repair with NUWA-LIP
Features
Visual data generation and manipulation
Multi-modal pre-training
Infinite visual synthesis
Language-guided image repair
Self-supervised learning 3D photography videos
Long video generation
Featured AI Tools

Sora
AI video generation
17.0M

Animate Anyone
Animate Anyone aims to generate character videos from static images driven by signals. Leveraging the power of diffusion models, we propose a novel framework tailored for character animation. To maintain consistency of complex appearance features present in the reference image, we design ReferenceNet to merge detailed features via spatial attention. To ensure controllability and continuity, we introduce an efficient pose guidance module to direct character movements and adopt an effective temporal modeling approach to ensure smooth cross-frame transitions between video frames. By extending the training data, our method can animate any character, achieving superior results in character animation compared to other image-to-video approaches. Moreover, we evaluate our method on benchmarks for fashion video and human dance synthesis, achieving state-of-the-art results.
AI video generation
11.4M