

WHAM
Overview :
WHAM (World and Human Action Model) is a generative model developed by Microsoft Research, specifically designed for generating game scenes and player behaviors. Trained on data from Ninja Theory's 'Bleeding Edge' game, it can generate coherent and diverse game visuals and controller actions. WHAM's primary advantage lies in its ability to capture the 3D structure of game environments and the temporal sequences of player actions, providing a powerful tool for game design and creative exploration. The model primarily targets academic research and the game development field, helping developers rapidly iterate on game designs.
Target Users :
WHAM is primarily designed for game developers and researchers, assisting them in exploring the application of generative AI in game design and rapidly iterating on game scenes and player behavior ideas.
Use Cases
Use WHAM to generate character actions and scenes in the game 'Bleeding Edge'.
Provide creative iteration support for game design based on WHAM's model inference.
Display the generated game visuals and controller actions in real-time using the WHAM demo tool.
Features
Generates game visuals and controller actions
Supports three modes: world modeling, behavioral strategies, and full generation
Captures the 3D structure of game environments and the temporal sequences of player actions
Offers two model sizes (200M parameters and 1.6B parameters) to suit different needs
Supports generating game sequences using initial visuals or controller actions as prompts
Provides local model inference and demonstration tools
Evaluates the model's consistency, diversity, and persistence
Supports various application scenarios in academic research and game development
How to Use
1. Clone the WHAM GitHub repository and set up a virtual environment.
2. Download the model weights file (either the 200M or 1.6B parameter model).
3. Prepare sample data or use the provided sample data.
4. Run the local model inference script to generate game sequences.
5. Use the WHAM demo tool to connect to the model server and display the generated results in real-time.
6. Adjust model parameters or prompt inputs as needed to explore different generative effects.
Featured AI Tools

Gemini
Gemini is the latest generation of AI system developed by Google DeepMind. It excels in multimodal reasoning, enabling seamless interaction between text, images, videos, audio, and code. Gemini surpasses previous models in language understanding, reasoning, mathematics, programming, and other fields, becoming one of the most powerful AI systems to date. It comes in three different scales to meet various needs from edge computing to cloud computing. Gemini can be widely applied in creative design, writing assistance, question answering, code generation, and more.
AI Model
11.4M
Chinese Picks

Liblibai
LiblibAI is a leading Chinese AI creative platform offering powerful AI creative tools to help creators bring their imagination to life. The platform provides a vast library of free AI creative models, allowing users to search and utilize these models for image, text, and audio creations. Users can also train their own AI models on the platform. Focused on the diverse needs of creators, LiblibAI is committed to creating inclusive conditions and serving the creative industry, ensuring that everyone can enjoy the joy of creation.
AI Model
6.9M