

Webvoyager
Overview :
WebVoyager is an innovative large multimodal model (LMM)-powered web agent that can complete user instructions end-to-end by interacting with real-world websites. We propose a novel web agent evaluation protocol to address the challenge of automatic evaluation for open-world agent tasks, leveraging the powerful multimodal understanding capabilities of GPT-4V. We collected real-world tasks from 15 widely used websites to evaluate our agent. We demonstrate that WebVoyager achieves a 55.7% task success rate, significantly outperforming the performance of GPT-4 (with all tools) and WebVoyager (text only) settings, highlighting WebVoyager's superior capabilities in practical applications. We find that our proposed automatic evaluation achieves 85.3% consistency with human judgment, paving the way for further development of web agents in real-world environments.
Target Users :
WebVoyager can be used to automatically execute real-world website tasks, suitable for scenarios requiring large-scale information processing and interaction.
Use Cases
Automation of webpage content updates
Real-time website interaction
Automatic execution of website tasks
Features
Completes user instructions end-to-end
Interacts with real-world websites
Possesses powerful multimodal understanding capabilities
Featured AI Tools

Openui
Building UI components is often tedious work. OpenUI aims to make this process fun, quick, and flexible. This is the tool we use at W&B to test and prototype the next generation of tools, built on top of LLMs to create powerful applications. You can describe your UI with imagination, and then see the rendering effect in real time. You can request changes, and convert HTML to React, Svelte, Web Components, and more. Think of it as an open-source and less polished version of a V0.
AI Development Assistant
757.9K

Opendevin
OpenDevin is an open-source project aiming to replicate, enhance, and innovate Devin—an autonomous AI software engineer capable of executing complex engineering tasks and actively collaborating with users on software development projects. Through the power of the open-source community, the project explores and expands Devin's capabilities, identifies its strengths and areas for improvement, thus guiding the advancement of open-source code models.
AI Development Assistant
594.8K