

Databonsai
Overview :
databonsai is a Python library that leverages Large Language Models (LLMs) to execute data cleaning tasks. It offers a range of tools including data categorization, transformation, and extraction, as well as validation of LLM outputs. It supports batch processing to save tokens and features retry logic to handle rate limits and transient errors.
Target Users :
["Data Scientist: Rapidly classify and clean large datasets to facilitate further analysis.","Developer: Integrable into applications to automate the data preprocessing workflow.","Corporate User: Improve processing efficiency and reduce costs through automated data cleaning."]
Use Cases
Classifying and sentiment analysis of social media comments.
Automated archiving and thematic classification of news articles.
Organizing and extracting customer feedback data for product improvements.
Features
Data Classification: Uses LLMs to categorize data into predefined categories.
Data Transformation: Converts data using prompts.
Data Extraction: Extracts data into structured format according to patterns.
Batch Processing: Saves tokens by sending one model and example to classify a batch of data.
Retry Logic: Built-in retry logic to handle API-related errors.
Progress Bar: Provides progress feedback when processing large datasets.
Automatic Batch Processing: Automatically adjusts batch size to optimize token usage and error handling.
How to Use
1. Install the databonsai library.
2. Create a .env file with the API key in the root directory of your project.
3. Set up the LLM provider and categories.
4. Use the categorize function to classify individual data records.
5. Use the categorize_batch function to classify data in batches.
6. Use the apply_to_column_autobatch function to automate batch processing for DataFrames or lists.
7. Monitor the progress bar to understand current processing progress.
8. Adjust the batch size or use a better LLM model if errors occur.
Featured AI Tools

Openui
Building UI components is often tedious work. OpenUI aims to make this process fun, quick, and flexible. This is the tool we use at W&B to test and prototype the next generation of tools, built on top of LLMs to create powerful applications. You can describe your UI with imagination, and then see the rendering effect in real time. You can request changes, and convert HTML to React, Svelte, Web Components, and more. Think of it as an open-source and less polished version of a V0.
AI Development Assistant
757.9K

Opendevin
OpenDevin is an open-source project aiming to replicate, enhance, and innovate Devin—an autonomous AI software engineer capable of executing complex engineering tasks and actively collaborating with users on software development projects. Through the power of the open-source community, the project explores and expands Devin's capabilities, identifies its strengths and areas for improvement, thus guiding the advancement of open-source code models.
AI Development Assistant
594.8K