Dictate.
Format.
Paste.
Hold a hotkey, speak, release. Your words are transcribed, cleaned up by AI, and pasted into whatever app you're using. Free forever.
Dictate in any context
Switch modes to format your voice for the task at hand.
I think we should refactor the auth middleware. It's messy right now, and we should swap to using JWT tokens instead of sessions.
No Subscription
Bring your own API key. Groq's free tier, OpenAI, or any OpenAI-compatible endpoint. You control the cost.
Private by Design
No audio stored on disk. API keys kept in your OS credential manager. Audio goes directly to your chosen API.
Lightweight
Built with Tauri and Rust. The entire app is around 10MB. No Electron, no bloat, launches instantly.
Fully Customizable
Create custom modes with your own system prompts. Add vocabulary entries for words the model gets wrong.
Get started in seconds
How Voxly compares
| Feature | Voxly | Wispr Flow | Superwhisper |
|---|---|---|---|
| Pricing | Free (BYOK) | $15/mo | $8.49/mo |
| Open Source | check_circle | cancel | cancel |
| Custom Modes | check_circle | check_circle | check_circle |
| Platforms | Windows, macOS, Linux | Mac, Windows | Mac only |
| API Provider | Any (BYOK) | Built-in only | Built-in only |
Report a Bug
Found something off? Open an issue on GitHub and help us improve.
Test on macOS / Linux
Currently tested on Windows. We need testers on other platforms.
Contribute
MIT licensed. Fork it, improve it, submit a PR. Contributions welcome.