Dictate.
Format.
Paste.
Hold a hotkey, speak, release. Your words are transcribed, cleaned up by AI, and pasted into whatever app you're using. Free forever.
Dictate in any context
Switch modes to format your voice for the task at hand.
I think we should refactor the auth middleware. It's messy right now, and we should swap to using JWT tokens instead of sessions.
No Subscription
Bring your own API key. Groq's free tier, OpenAI, or any OpenAI-compatible endpoint. You control the cost.
Private by Design
Dictation audio is never written to disk. Meeting recordings stay local until you click Transcribe. API keys live in your OS credential manager.
Lightweight
Built with Tauri and Rust. The entire app is around 10MB. No Electron, no bloat, launches instantly.
Fully Customizable
Create custom modes with your own system prompts. Add vocabulary entries for words the model gets wrong.
One hotkey. Full meeting capture.
Record screen, mic, and system audio as a local MP4. When done, click Transcribe — Voxly sends the audio to AssemblyAI and returns a speaker-labeled transcript with your voice and system audio on separate channels.
Record
Press Ctrl+Alt+M. Screen, mic, and system audio recorded locally. No cloud at this step.
Transcribe
Click Transcribe. Voxly extracts a dual-channel audio file — mic on channel 1, system audio on channel 2 — and uploads it to AssemblyAI for multichannel diarization.
Speaker-labeled transcript
You and System — not generic Speaker A/B. Mic bleed from system audio is filtered automatically. Click any line to seek the video.
Format
MP4
Channels
2-ch AAC
Provider
AssemblyAI
Design Review — Jun 3, 2026
Get started in seconds
How Voxly compares
| Feature | Voxly | Wispr Flow | Superwhisper |
|---|---|---|---|
| Pricing | Free (BYOK) | $15/mo | $8.49/mo |
| Open Source | check_circle | cancel | cancel |
| Custom Modes | check_circle | check_circle | check_circle |
| Platforms | Windows, macOS, Linux | Mac, Windows | Mac only |
| API Provider | Any (BYOK) | Built-in only | Built-in only |
| Meeting Transcripts | check_circle | cancel | cancel |
| Local Recording | check_circle | cancel | cancel |
Report a Bug
Found something off? Open an issue on GitHub and help us improve.
Test on macOS / Linux
Currently tested on Windows. We need testers on other platforms.
Contribute
MIT licensed. Fork it, improve it, submit a PR. Contributions welcome.