Pandrator : An open-source GUI audiobook and voiceover generator.

Pandrator

AI speech synthesis AI text-to-speech #Text to Speech #Voice Cloning #Audio Editing #GUI #Open Source Standard Picks Open Source

Overview :

Pandrator is a tool based on open-source software that converts text, PDF, EPUB, and SRT files into voice audio in multiple languages. It includes features for voice cloning, LLM-based text preprocessing, and directly saving generated audio subtitles into video files, blending them with original audio tracks. It is designed for ease of use and installation, featuring a one-click installer and a graphical user interface.

Target Users :

Pandrator is ideal for users who need to convert text to speech, particularly those looking to create audiobooks or add voiceovers to videos. It is especially suited for tech enthusiasts and developers who can leverage its open-source features for customization and expansion.

Total Visits： 474.6M

Top Region： US(19.34%)

Website Views ： 81.4K

Use Cases

Convert a novel into an audiobook using Pandrator.

Add multilingual voiceovers to video projects.

Utilize voice cloning technology to generate audio in specific voices.

Features

Text preprocessing: Segments text into sentences while preserving paragraphs.

LLM text preprocessing: Uses a local LLM for text correction and enhancement.

Audio generation: Converts the processed text into speech, supporting voice cloning and quality enhancement.

Audio evaluation: Predicts the mean opinion score (MOS) for generated sentences.

Generate and add voiceovers to video files: Synchronizes audio from subtitle files with SRT timestamps.

Session management: Supports creating, deleting, and loading sessions to organize workflows.

Graphical user interface: Built using customtkinter, providing a user-friendly experience.

How to Use

Download and install Pandrator.

Run Pandrator and select text or files as input.

Choose the desired voice and language settings.

Perform text preprocessing and LLM preprocessing if needed.

Start generating audio and adjust settings as necessary.

Use the GUI to play, edit, or delete generated sentences.

Save the output audio file or incorporate it into a video file.

Featured AI Tools

Chattts

ChatTTS is an open-source text-to-speech (TTS) model that allows users to convert text into speech. This model is primarily aimed at academic research and educational purposes and is not suitable for commercial or legal applications. It utilizes deep learning techniques to generate natural and fluent speech output, making it suitable for individuals involved in speech synthesis research and development.

AI speech synthesis

1.4M

Openai TTS

OpenAI TTS offers a text-to-speech API based on their TTS models. It features 6 built-in voices, which can be used to read blog posts, generate speech audio in multiple languages, and stream real-time audio output. Users can generate audio files by controlling the model name, text, and voice selection, and it supports various audio output formats.

AI text-to-speech

882.9K

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Direct Visits	51.61%	External Links	33.46%	Email	0.04%
Organic Search	12.58%	Social Media	2.19%	Display Ads	0.11%

Monthly Visits	4.92m
Average Visit Duration	393.01
Pages Per Visit	6.11
Bounce Rate	36.20%

Monthly Visits	4.92m
United States	19.34%
China	13.25%
India	9.32%
Russia	4.28%
Germany	3.63%