

Whisper Turbo
Overview :
Whisper Turbo aims to be an alternative to the OpenAI Whisper API. It consists of three parts: a compatibility layer that converts audio files of different formats into Whisper-compatible formats; a developer-friendly API supporting both batch and streaming inference; and the Rust + WebGPU inference framework Rumble, designed for fast cross-platform inference.
Target Users :
Speech Recognition to Text
Use Cases
Convert speech to text
Stream speech recognition
Utilize GPU acceleration for Whisper
Features
Audio Format Conversion
Compatibility with OpenAI Whisper API
GPU Acceleration
Streaming and Batch Inference
Featured AI Tools

Openvoice
OpenVoice is an open-source voice cloning technology capable of accurately replicating reference voicemails and generating voices in various languages and accents. It offers flexible control over voice characteristics such as emotion, accent, and can adjust rhythm, pauses, and intonation. It achieves zero-shot cross-lingual voice cloning, meaning it does not require the language of the generated or reference voice to be present in the training data.
AI speech recognition
2.4M

Azure AI Studio Speech Services
Azure AI Studio is a suite of artificial intelligence services offered by Microsoft Azure, encompassing speech services. These services may include functions such as speech recognition, text-to-speech, and speech translation, enabling developers to incorporate voice-related intelligence into their applications.
AI speech recognition
271.3K