Generative Powers of Ten
G
Generative Powers Of Ten
Overview :
Generative Powers of Ten is a method for generating multi-scale consistent content using text-to-image models. It enables extreme semantic zoom of a scene, ranging from a wide-angle landscape view of a forest to a macro shot of an insect on a branch. This representation allows us to render continuous zoom videos or interactively explore different scales of a scene. We achieve this through a joint multi-scale diffusion sampling method that encourages consistency across different scales while preserving the integrity of each individual sampling process. Since each generated scale is guided by different text prompts, our method can achieve a deeper level of zoom than traditional super-resolution methods, which may struggle to create new contextual structures at completely different scales. We conducted qualitative comparisons of our method against image super-resolution and external sketching techniques and demonstrated that our method is most effective at generating consistent multi-scale content.
Target Users :
Users can use Generative Powers of Ten to generate videos with multi-scale continuous zoom or guide the zoom based on an input image.
Total Visits: 408
Top Region: US(57.62%)
Website Views : 51.6K
Use Cases
Generate a continuous zoom video from a forest landscape to an insect macro shot using Generative Powers of Ten
Implement seamless zoom of a real image using Generative Powers of Ten
Explore a multi-scale scene interactively using Generative Powers of Ten
Features
Generates videos with multi-scale continuous zoom based on text descriptions
Guides zoom level to match an input image
Different results can be obtained from the same input prompt by changing the seed
Benchmarked against Stable Diffusion super-resolution and external sketching models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase