TF ID : Tool for recognizing tables and figures in academic literature

AI image detection and recognition

TF ID

TF-ID

TF ID

AI image detection and recognition AI model #Academic literature #Object detection #Information extraction #Automation Standard Picks Open Source

Overview :

TF-ID is an object detection model series created by Yifei Hu for extracting tables and figures from academic papers. These models are fine-tuned based on the microsoft/Florence-2 checkpoint, offering versions with or without title text. Their aim is to enhance the accessibility and processing efficiency of information in academic literature.

Target Users :

TF-ID is primarily designed for researchers and scholars who need to process a large volume of academic papers, especially those who require automated extraction of tables and figures from literature. It saves time in manually searching and organizing data, thereby improving research efficiency.

Total Visits： 474.6M

Top Region： US(19.34%)

Website Views ： 48.6K

Use Cases

Researchers use TF-ID to automatically extract experimental result tables from academic papers.

Scholars utilize the TF-ID model to analyze chart data from historical literature.

Educational institutions adopt TF-ID to assist students in quickly obtaining statistical information from literature.

Features

Extract tables and figures from academic papers

Provide versions with and without title text

Fine-tuned from the microsoft/Florence-2 model checkpoint

Supports custom model training

Open-source model weights and annotated datasets

Detailed training and usage guides provided

How to Use

Clone the TF-ID GitHub repository locally.

Download and prepare the required datasets and annotation files.

Place the annotated files and image files in the specified directory as required.

Use the provided scripts to convert the dataset into the required format.

Launch model training using the Accelerate tool.

After training is complete, use the trained checkpoint for model inference.

Featured AI Tools

YOLOv8

YOLOv8 is the latest version of the YOLO (You Only Look Once) family of object detection models. It can accurately and rapidly identify and locate multiple objects in images or videos, and track their movements in real time. Compared to previous versions, YOLOv8 has significantly improved detection speed and accuracy, while also supporting a variety of additional computer vision tasks, such as instance segmentation and pose estimation. YOLOv8 can be deployed on various hardware platforms in different formats, providing a one-stop end-to-end object detection solution.

AI image detection and recognition

Lexy

Lexy is an AI-powered image text extraction tool. It can automatically recognize text in images and extract it for user convenience in subsequent processing and analysis. Lexy boasts high accuracy and fast recognition speed, suitable for various image text extraction scenarios. Whether you are an individual user needing to extract text from images or an enterprise user requiring large-scale image text processing, Lexy can meet your needs.

AI image detection and recognition

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase