CogView
C
Cogview
Overview :
CogView is a pre-trained Transformer model designed for general-text-to-image generation. The model consists of 4.1 billion parameters and is capable of generating high-quality and diverse images. The model's training approach follows an abstract-to-specific methodology, first pretraining to acquire general knowledge and then fine-tuning within specific domains to generate images, significantly enhancing the quality of generated images. Notably, the research paper also introduces two techniques to stabilize the training of large models: PB-relax and Sandwich-LN.
Target Users :
["Text-to-Image Generation","Image Super-Resolution","Semantic Understanding"]
Total Visits: 474.6M
Top Region: US(19.34%)
Website Views : 67.9K
Use Cases
A fluffy cat sitting on a table
A pink rose blooming in the sunlight
A flock of white clouds floating in the blue sky
Features
Generate matching images from common language descriptions
Support both Chinese and English inputs
Upgrade image quality via super-resolution
Enable post-filtering of generated samples
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase