

Layerdiffusion
Overview :
LayerDiffusion is a method that enables large-scale pre-trained latent diffusion models to generate transparent images. This method allows for the generation of single transparent images or multiple transparent layers. It learns a 'latent transparency' by encoding the Alpha channel transparency into the latent space of a pre-trained latent diffusion model. By adjusting the added transparency as a latent offset, the method minimally alters the original latent distribution of the pre-trained model, preserving the production-ready quality of large diffusion models. By fine-tuning the latent space, any latent diffusion model can be converted into a transparent image generator. We trained the model on a dataset of 1 million transparent image layers collected through human-in-the-loop data gathering. We demonstrate that latent transparency can be applied to different open-source image generators or adapted to various conditioning systems, enabling applications such as foreground/background conditioned layer generation, joint layer generation, and content structure control of layers. User studies revealed that in most cases (97%), users preferred our locally generated transparent content over previous makeshift solutions like generating and then removing the background. Users also reported that the quality of our generated transparent images is comparable to real commercial transparent assets from sources like Adobe Stock.
Target Users :
Used for generating transparent images or layers, suitable for design, image processing and other fields.
Use Cases
Generate transparent images for product design
Generate transparent layers for image composition
Control content structure of layers to generate custom images
Features
Generate transparent images
Generate multiple transparent layers
Learn latent transparency
Applicable to different image generators
Adaptable to various conditioning systems
Foreground/background conditioned layer generation
Joint layer generation
Content structure control of layers
Featured AI Tools
Chinese Picks

Capcut Dreamina
CapCut Dreamina is an AIGC tool under Douyin. Users can generate creative images based on text content, supporting image resizing, aspect ratio adjustment, and template type selection. It will be used for content creation in Douyin's text or short videos in the future to enrich Douyin's AI creation content library.
AI image generation
9.0M

Outfit Anyone
Outfit Anyone is an ultra-high quality virtual try-on product that allows users to try different fashion styles without physically trying on clothes. Using a two-stream conditional diffusion model, Outfit Anyone can flexibly handle clothing deformation, generating more realistic results. It boasts extensibility, allowing adjustments for poses and body shapes, making it suitable for images ranging from anime characters to real people. Outfit Anyone's performance across various scenarios highlights its practicality and readiness for real-world applications.
AI image generation
5.3M