

TF ID
Overview :
TF-ID is an object detection model series created by Yifei Hu for extracting tables and figures from academic papers. These models are fine-tuned based on the microsoft/Florence-2 checkpoint, offering versions with or without title text. Their aim is to enhance the accessibility and processing efficiency of information in academic literature.
Target Users :
TF-ID is primarily designed for researchers and scholars who need to process a large volume of academic papers, especially those who require automated extraction of tables and figures from literature. It saves time in manually searching and organizing data, thereby improving research efficiency.
Use Cases
Researchers use TF-ID to automatically extract experimental result tables from academic papers.
Scholars utilize the TF-ID model to analyze chart data from historical literature.
Educational institutions adopt TF-ID to assist students in quickly obtaining statistical information from literature.
Features
Extract tables and figures from academic papers
Provide versions with and without title text
Fine-tuned from the microsoft/Florence-2 model checkpoint
Supports custom model training
Open-source model weights and annotated datasets
Detailed training and usage guides provided
How to Use
Clone the TF-ID GitHub repository locally.
Download and prepare the required datasets and annotation files.
Place the annotated files and image files in the specified directory as required.
Use the provided scripts to convert the dataset into the required format.
Launch model training using the Accelerate tool.
After training is complete, use the trained checkpoint for model inference.
Featured AI Tools

Yolov8
YOLOv8 is the latest version of the YOLO (You Only Look Once) family of object detection models. It can accurately and rapidly identify and locate multiple objects in images or videos, and track their movements in real time. Compared to previous versions, YOLOv8 has significantly improved detection speed and accuracy, while also supporting a variety of additional computer vision tasks, such as instance segmentation and pose estimation. YOLOv8 can be deployed on various hardware platforms in different formats, providing a one-stop end-to-end object detection solution.
AI image detection and recognition
228.3K

Lexy
Lexy is an AI-powered image text extraction tool. It can automatically recognize text in images and extract it for user convenience in subsequent processing and analysis. Lexy boasts high accuracy and fast recognition speed, suitable for various image text extraction scenarios. Whether you are an individual user needing to extract text from images or an enterprise user requiring large-scale image text processing, Lexy can meet your needs.
AI image detection and recognition
221.6K