kokoro-onnx
K
Kokoro Onnx
Overview :
kokoro-onnx is a text-to-speech (TTS) project based on the Kokoro model and ONNX runtime. It supports English and plans to support French, Japanese, Korean, and Chinese. The model offers near real-time performance on macOS M1 and provides a variety of voice options, including whispering. The model is lightweight, approximately 300MB (around 80MB when quantized). This project is open-source on GitHub under the MIT license, facilitating easy integration and use for developers.
Target Users :
The primary target audience is developers and researchers who can utilize this model to add text-to-speech capabilities to applications or engage in research and development related to speech synthesis. Its open-source nature and lightweight characteristics make it suitable for developers who wish to integrate high-quality TTS functionality into their projects without building a model from scratch.
Total Visits: 474.6M
Top Region: US(19.34%)
Website Views : 144.1K
Use Cases
Add voice prompt functionality to mobile applications
Integrate into smart assistant devices for natural language interactions
Conduct research on speech synthesis to explore new voice generation technologies
Features
Supports English (soon to support French, Japanese, Korean, and Chinese)
Offers near real-time performance on macOS M1
Provides a variety of voice options, including whispering
Lightweight model, approximately 300MB (around 80MB when quantized)
Based on ONNX runtime, easy to deploy and integrate
Includes example scripts for quick user onboarding
How to Use
1. Install uv (recommended) or use a regular Python environment
2. Create a new project folder and run 'uv init -p 3.12' to initialize the project within it
3. Use 'uv add' to add the kokoro-onnx and soundfile dependencies
4. Copy the contents of examples/save.py into hello.py
5. Download kokoro-v0_19.onnx and voices.json files and place them in the project directory
6. Run 'uv run hello.py' to generate the audio file
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase