Desktop voice-to-text application with AI transcription. Press hotkey, speak, text appears in any app.
- Push-to-talk dictation — hold hotkey and speak
- 3 transcription providers — OpenAI Whisper, Groq (free), ElevenLabs (realtime streaming)
- Auto-translation — automatic translation to target language after transcription
- Custom instructions — post-process text via LLM (summarize, format, etc.)
- Custom vocabulary — auto-correct technical terms (Groq, Tauri, WebSocket, etc.)
- Transcription history — browse, copy, and manage past transcriptions
- Typing simulation — text is typed into active window as if from keyboard
- Background operation — system tray, auto-start, global hotkeys
- Auto-updates — automatic updates from GitHub releases
Download the latest version for your OS:
- Windows:
.msior.exeinstaller - macOS:
.dmg(Intel and Apple Silicon) - Linux:
.debor.AppImage
git clone https://github.com/RuKapSan/easy-dictate.git
cd easy-dictate/src-tauri
cargo tauri buildRequirements: Rust 1.77+, Node.js 18+
- Install the application
- Select a provider (Groq is free)
- Enter your API key
- Press the hotkey (default:
Ctrl+Shift+Space) - Speak — text will appear in the active window
| Hotkey | Action |
|---|---|
| Main | Record and transcribe |
| With translation | Record + force translation |
| Toggle translation | Toggle auto-translate on/off |
All hotkeys are configurable in Settings.
| Provider | Speed | Price | Features |
|---|---|---|---|
| Groq | Fast | Free | Whisper Large v3 |
| OpenAI | Medium | Paid | GPT-4o Transcribe |
| ElevenLabs | Realtime | Paid | Text streaming during speech |
- Tauri v2 + Rust (backend)
- HTML/CSS/JS (frontend, no frameworks)
- cpal (audio capture)
MIT

