

Blockfusion
Overview :
BlockFusion is a diffusion-based model that can generate 3D scenes and seamlessly integrate new blocks into existing scenes. It is trained on a dataset of 3D blocks sampled randomly from complete 3D scene meshes. Through piece-wise fitting, all training blocks are converted into hybrid neural fields: consisting of triangular meshes containing geometric features, followed by multi-layer perceptrons (MLPs) used to decode signed distance values. A variational autoencoder is used to compress the triangular meshes into a latent triangular space for denoising diffusion. Diffusion applied to the latent representations enables the generation of high-quality and diverse 3D scenes. During scene expansion, only empty blocks are appended to overlap with the current scene, and existing latent triangles are extrapolated to fill the new blocks. Extrapolation is achieved by modulating the generation process using feature samples from overlapping triangles during the denoising iterations. Latent triangle extrapolation produces semantically and geometrically meaningful transitions, harmoniously blending with the existing scene. A 2D layout control mechanism is used to regulate the placement and arrangement of scene elements. Experimental results demonstrate that BlockFusion can generate diverse, geometrically consistent, and high-quality large-scale indoor and outdoor 3D scenes.
Target Users :
BlockFusion can be used in game development, virtual reality applications, architectural design, and other fields.
Use Cases
Game development: Use BlockFusion to generate diverse game scenes.
Virtual reality applications: Utilize BlockFusion to create realistic virtual environments.
Architectural design: Employ BlockFusion to generate indoor and outdoor scenes during architectural design.
Features
Generate 3D scenes
Seamlessly integrate new 3D blocks
Extrapolate existing latent triangles to fill new blocks
Control the placement and arrangement of scene elements
Featured AI Tools
Chinese Picks

Capcut Dreamina
CapCut Dreamina is an AIGC tool under Douyin. Users can generate creative images based on text content, supporting image resizing, aspect ratio adjustment, and template type selection. It will be used for content creation in Douyin's text or short videos in the future to enrich Douyin's AI creation content library.
AI image generation
9.0M

Outfit Anyone
Outfit Anyone is an ultra-high quality virtual try-on product that allows users to try different fashion styles without physically trying on clothes. Using a two-stream conditional diffusion model, Outfit Anyone can flexibly handle clothing deformation, generating more realistic results. It boasts extensibility, allowing adjustments for poses and body shapes, making it suitable for images ranging from anime characters to real people. Outfit Anyone's performance across various scenarios highlights its practicality and readiness for real-world applications.
AI image generation
5.3M