WebVoyager
W
Webvoyager
Overview :
WebVoyager is an innovative large multimodal model (LMM)-powered web agent that can complete user instructions end-to-end by interacting with real-world websites. We propose a novel web agent evaluation protocol to address the challenge of automatic evaluation for open-world agent tasks, leveraging the powerful multimodal understanding capabilities of GPT-4V. We collected real-world tasks from 15 widely used websites to evaluate our agent. We demonstrate that WebVoyager achieves a 55.7% task success rate, significantly outperforming the performance of GPT-4 (with all tools) and WebVoyager (text only) settings, highlighting WebVoyager's superior capabilities in practical applications. We find that our proposed automatic evaluation achieves 85.3% consistency with human judgment, paving the way for further development of web agents in real-world environments.
Target Users :
WebVoyager can be used to automatically execute real-world website tasks, suitable for scenarios requiring large-scale information processing and interaction.
Total Visits: 29.7M
Top Region: US(17.94%)
Website Views : 51.3K
Use Cases
Automation of webpage content updates
Real-time website interaction
Automatic execution of website tasks
Features
Completes user instructions end-to-end
Interacts with real-world websites
Possesses powerful multimodal understanding capabilities
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase