Fouriscale : Frequency-based method for training free high-resolution image synthesis.

Fouriscale

AI image generation AI image enhancement #High-resolution image #Frequency analysis #No-training Standard Picks Open Source

Overview :

FouriScale explores high-resolution image generation from pre-trained diffusion models from a frequency analysis perspective. Through an innovative, no-training method, it replaces the original convolutional layers in pre-trained diffusion models with a combination of dilation techniques and low-pass operations, further enhanced by a fill-and-crop strategy. This allows for flexible handling of various aspect ratios in text-to-image generation. Guided by FouriScale, this method successfully balances the structural integrity and fidelity of generated images, achieving remarkable capabilities for arbitrary-sized, high-resolution, and high-quality generation. With its simplicity and compatibility, this method provides valuable insights for future explorations in ultra-high-resolution image synthesis.

Target Users :

Used for generating high-resolution images and text-to-image synthesis.

Total Visits： 474.6M

Top Region： US(19.34%)

Website Views ： 66.8K

Use Cases

Generate high-quality anime-style avatars

Text-to-high-resolution image generation

Handle large-size image generation requirements

Features

Generate high-resolution images from pre-trained diffusion models

Handle repetitive patterns and structural distortions