

Dreamwaltz G
Overview :
DreamWaltz-G is an innovative framework designed for generating 3D avatars and expressive full-body animations driven by text. Its core components include skeletal-guided score distillation and blended 3D Gaussian avatar representation. By integrating skeletal control of 3D human templates into a 2D diffusion model, it enhances the consistency of viewpoints and human poses, yielding high-quality avatars while addressing issues such as multiple faces, extra limbs, and blurriness. Moreover, the blended 3D Gaussian avatar representation, which combines neural implicit fields with parameterized 3D meshes, facilitates real-time rendering, stable SDS optimization, and expressive animations. DreamWaltz-G excels in generating and animating 3D avatars, surpassing existing methods in both visual quality and animation expressiveness. This framework also supports various applications, including human video reenactment and multi-subject scene compositions.
Target Users :
The target audience for DreamWaltz-G includes 3D animators, game developers, virtual reality content creators, and any professionals needing to generate or edit 3D avatars and animations. This technology simplifies the creation process of 3D content, enhances the quality and expressiveness of animations, and lowers the technical barrier, enabling non-professionals to effortlessly create personalized 3D avatars and animations.
Use Cases
Animators use DreamWaltz-G to create high-quality 3D animated characters for films.
Game developers leverage this framework to generate NPC characters and animations in games.
Virtual reality content creators utilize DreamWaltz-G to develop interactive virtual characters.
Features
Skeletal-guided score distillation: Integrates skeletal control of 3D human templates into a 2D diffusion model, enhancing the consistency of generated avatars.
Blended 3D Gaussian avatar representation: Combines neural implicit fields with parameterized 3D meshes to support real-time rendering and stable optimization.
High-quality avatar generation: Addresses issues related to multiple faces, extra limbs, and blurriness.
Support for multiple applications: Including video reenactment and multi-subject scene compositions.
Text-driven 3D avatar creation: Users can generate 3D avatars using text descriptions.
Expressive full-body animation: The generated 3D avatars can be animated expressively.
Shape control during training: Control shapes by modifying the SMPL-X template during training.
Shape editing during inference: Edit shapes during inference by explicitly adjusting 3D Gaussians.
How to Use
Visit the DreamWaltz-G project page.
Read the project introduction and related papers to understand the technical background and principles.
Review the code repository to download the necessary software and code.
Follow the documentation to set up the development environment and dependencies.
Experiment with generating 3D avatars using text descriptions.
Explore how to optimize generated avatars through skeletal guidance and blended representations.
Learn how to apply generated 3D avatars in video reenactments and scene compositions.
Engage in community discussions to obtain technical support and best practices.
Featured AI Tools
Chinese Picks

Capcut Dreamina
CapCut Dreamina is an AIGC tool under Douyin. Users can generate creative images based on text content, supporting image resizing, aspect ratio adjustment, and template type selection. It will be used for content creation in Douyin's text or short videos in the future to enrich Douyin's AI creation content library.
AI image generation
9.0M

Outfit Anyone
Outfit Anyone is an ultra-high quality virtual try-on product that allows users to try different fashion styles without physically trying on clothes. Using a two-stream conditional diffusion model, Outfit Anyone can flexibly handle clothing deformation, generating more realistic results. It boasts extensibility, allowing adjustments for poses and body shapes, making it suitable for images ranging from anime characters to real people. Outfit Anyone's performance across various scenarios highlights its practicality and readiness for real-world applications.
AI image generation
5.3M