Whisper-Input
W
Whisper Input
Overview :
Whisper Input is a desktop tool developed in Python, enabling fast voice-to-text conversion. It supports voice recording controlled by key presses and utilizes the Groq Whisper Large V3 Turbo or FunAudioLLM/SenseVoiceSmall models for transcription. The tool's main advantages are high transcription speed, accuracy, and multilingual support. It is perfect for users requiring efficient input, particularly for frequent voice recording and text conversion scenarios. Currently, this tool is completely free to use, with no charges involved.
Target Users :
Ideal for users who need efficient voice input, such as office workers, students, content creators, etc. Especially beneficial for rapidly capturing ideas, meeting notes, and writing tasks.
Total Visits: 474.6M
Top Region: US(19.34%)
Website Views : 72.9K
Use Cases
Quickly capture key points during meetings without manual input.
Students record lecture notes via voice, converting them into text for review later.
Content creators compose articles or scripts using voice input.
Features
Supports recording by pressing the Option or Alt key, and stops recording when the key is released.
Enables multilingual voice transcription, converting various languages into text.
Supports Chinese-to-English translation, catering to bilingual input needs.
Utilizes high-performance speech transcription models provided by Groq or SiliconFlow, ensuring rapid conversion.
Includes built-in punctuation support, generating complete sentences without additional formatting.
How to Use
1. Ensure you have a local Python environment, version 3.10 or higher.
2. Register for a Groq or SiliconFlow account to obtain a free API KEY.
3. Clone the project locally: `git clone git@github.com:ErlichLiu/Whisper-Input.git`.
4. Create and activate a virtual environment: `python -m venv venv`, then run `source venv/bin/activate` (macOS/Linux) or `.\venv\Scripts\activate` (Windows).
5. Install dependencies: `pip install pip-tools`, then run `pip-compile requirements.in` and `pip install -r requirements.txt`.
6. Configure the `.env` file, entering your API KEY and related settings.
7. Run the program: `python main.py`, allowing you to convert speech to text using key presses.
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase