

SPRIGHT
Overview :
SPRIGHT is a large-scale visual language dataset and model focusing on spatial relationships. It constructs the SPRIGHT dataset by re-describing 6 million images, significantly increasing the spatial phrases in the descriptions. The model is fine-tuned on 444 images containing numerous objects to optimize the generation of images with spatial relationships. SPRIGHT achieves state-of-the-art spatial consistency in multiple benchmark tests while improving image quality scores.
Target Users :
SPRIGHT can be applied to any scenario requiring the generation of images with reasonable spatial layouts, such as interior design, floor plan creation, and robot environment simulation.
Use Cases
A living room with a fireplace, sofa on the right side of the fireplace, coffee table in front of the sofa.
A basket full of fruit, apples on the left, bananas on the right, and oranges in the middle.
A cityscape with skyscrapers on both sides of the road, a fountain in the middle.
Features
Large-scale spatial relationship dataset SPRIGHT
Fine-tuned on images with numerous objects to optimize spatial consistency
Achieves state-of-the-art spatial consistency in multiple benchmark tests
Improves image quality scores FID and CMMD
Featured AI Tools
Chinese Picks

Capcut Dreamina
CapCut Dreamina is an AIGC tool under Douyin. Users can generate creative images based on text content, supporting image resizing, aspect ratio adjustment, and template type selection. It will be used for content creation in Douyin's text or short videos in the future to enrich Douyin's AI creation content library.
AI image generation
9.0M

Outfit Anyone
Outfit Anyone is an ultra-high quality virtual try-on product that allows users to try different fashion styles without physically trying on clothes. Using a two-stream conditional diffusion model, Outfit Anyone can flexibly handle clothing deformation, generating more realistic results. It boasts extensibility, allowing adjustments for poses and body shapes, making it suitable for images ranging from anime characters to real people. Outfit Anyone's performance across various scenarios highlights its practicality and readiness for real-world applications.
AI image generation
5.3M