VoiceCraft
V
Voicecraft
Overview :
VoiceCraft is a token-filling based neural encoder-decoder language model that achieves leading performance in voice editing and zero-shot text-to-speech (TTS). For unseen voices, VoiceCraft only needs a few seconds of voice samples to clone the voice or edit the recording. The model is suitable for wild data such as audiobooks, online videos, and podcasts.
Target Users :
Generates and edits voice content for audiobooks, online videos, podcasts, and more.
Total Visits: 1.8K
Top Region: US(75.65%)
Website Views : 141.6K
Use Cases
Use VoiceCraft to generate natural-sounding voices for audiobooks or podcast episodes.
Edit existing recordings to modify content or change the speaker's voice.
Clone someone's voice from a small amount of voice samples to generate customized voice content.
Features
Voice Editing
Zero-Shot Text-to-Speech
Clone Unseen Voices
Edit Recordings
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase