

Eureka
Overview :
Eureka is a human-level reward design algorithm implemented by encoding large language models. It leverages the zero-shot generation, code writing, and context improvement capabilities of state-of-the-art language models (such as GPT-4) to evolve and optimize reward code. The generated rewards can be used to acquire complex skills through reinforcement learning. Eureka-generated reward functions outperform human-expert-designed reward functions in 29 open-source reinforcement learning environments, including 10 diverse robot morphologies. Eureka is also capable of flexibly improving reward functions to enhance the quality and safety of generated rewards. By combining with curriculum learning, using Eureka reward functions, we first demonstrate a simulated Shadow Hand performing a pen-twirling trick, skillfully manipulating the pen at a fast speed within a circle.
Target Users :
Suitable for tasks requiring reward design and reinforcement learning.
Features
Utilizes large language models for reward design
Generates complex reward functions through evolutionary optimization
Uses generated reward functions for reinforcement learning
Featured AI Tools

Openui
Building UI components is often tedious work. OpenUI aims to make this process fun, quick, and flexible. This is the tool we use at W&B to test and prototype the next generation of tools, built on top of LLMs to create powerful applications. You can describe your UI with imagination, and then see the rendering effect in real time. You can request changes, and convert HTML to React, Svelte, Web Components, and more. Think of it as an open-source and less polished version of a V0.
AI Development Assistant
758.2K

Opendevin
OpenDevin is an open-source project aiming to replicate, enhance, and innovate Devin—an autonomous AI software engineer capable of executing complex engineering tasks and actively collaborating with users on software development projects. Through the power of the open-source community, the project explores and expands Devin's capabilities, identifies its strengths and areas for improvement, thus guiding the advancement of open-source code models.
AI Development Assistant
597.5K