Masked Diffusion Transformer (MDT) : Masked Diffusion Transformer is the latest technology in image synthesis, achieving SOTA (State of the Art) at ICCV 2023.

Masked Diffusion Transformer (MDT)

AI image generation AI model #Image #Image Synthesis #Deep Learning #Masked Transformer #SOTA Standard Picks Open Source

Overview :

MDT explicitly enhances the ability of diffusion probability models (DPMs) to learn relationships between object parts in images by introducing a masked latent model scheme. MDT operates in the latent space during training, masking certain tokens, and then designs an asymmetrical diffusion transformer to predict masked tokens from unmasked tokens while maintaining the diffusion generation process. MDTv2 further improves the performance of MDT through more efficient macro network structures and training strategies.

Target Users :

Suitable for researchers and developers who require high-quality image synthesis, particularly in the fields of image generation and deep learning.

Total Visits： 474.6M

Top Region： US(19.34%)

Website Views ： 57.7K

Use Cases

Generate high-resolution images using MDT

Achieve fast learning in image synthesis tasks

Utilize MDTv2 to improve the FID score of image synthesis

Features

Image Synthesis

Masked Latent Model Scheme