

LEO
Overview :
LEO is a multimodal, multi-task all-in-one agent based on a large language model, capable of perceiving, localizing, reasoning, planning, and executing tasks in the 3D world. LEO achieves this through two stages of training: (i) 3D visual-language alignment and (ii) 3D visual-language action instruction tuning. We carefully curated and generated a large-scale dataset with object-level and scene-level multimodal tasks, requiring deep understanding and interaction with the 3D world. Through rigorous experiments, we demonstrate LEO's outstanding performance across a wide range of tasks, including 3D captioning, question answering, reasoning, navigation, and robot manipulation."
Target Users :
LEO can be used to complete a variety of tasks in the 3D world, including 3D captioning, question answering, reasoning, navigation, and robot manipulation.
Features
3D Visual-Language Alignment
3D Visual-Language Action Instruction Tuning
3D Captioning
Question Answering
Reasoning
Navigation
Robot Manipulation
Featured AI Tools

Alice
Alice is a lightweight AI agent designed to create a self-contained AI assistant similar to JARVIS. It achieves this by building a "text computer" centered around a large language model (LLM). Alice excels in tasks like topic research, coding, system administration, literature reviews, and complex mixed tasks that go beyond these basic capabilities. Alice has achieved near-perfect performance in everyday tasks using GPT-4 and is leveraging the latest open-source models for practical application.
AI Agents
459.8K

Feshua Smart Assistant
Feshua Smart Assistant is an intelligent assistant product that allows users to choose their favorite avatar, set a name, and remember user behavior on Feshua. It supports the deployment of business applications on Feshua, enabling cross-system task completion and a unified user experience. The product aims to enhance work efficiency and creativity, serving as a new type of digital employee for enterprises.
AI Agents
206.4K