Voxly

Open Source

Dictate.
Format.
Paste.

Hold a hotkey, speak, release. Your words are transcribed, cleaned up by AI, and pasted into whatever app you're using. Free forever.

download Download Free star Star on GitHub

check Windows

check macOS

check Linux

Claude Code

1 $ Hold Ctrl+Space and speak:

2 "refactor the auth module to use jwt tokens instead of sessions and um make sure the middleware handles token refresh"

3 // Developer Mode output:

4 "Refactor the auth module to use JWT tokens

instead of sessions. Make sure the middleware

handles token refresh."

5 // Pasted into active app automatically

DEVELOPER MODE

Tauri v2

SolidJS

Rust

star Star

scale MIT LICENSE

Dictate in any context

Switch modes to format your voice for the task at hand.

Raw Input (Voice)

mic yeah so basically i think we should probably refactor the auth middleware cause its kinda messy right now and maybe swap to using jwt tokens instead of sessions

Processed Output

I think we should refactor the auth middleware. It's messy right now, and we should swap to using JWT tokens instead of sessions.

System Prompt (Editable) model: llama-3.3-70b-versatile

You are a precise text editor that cleans up voice-dictated text. Remove filler words and verbal hesitations. Fix grammar, punctuation, and capitalization. Preserve the speaker's original wording, vocabulary, and tone. Do not rephrase or "improve" their language.

savings

No Subscription

Bring your own API key. Groq's free tier, OpenAI, or any OpenAI-compatible endpoint. You control the cost.

security

Private by Design

Dictation audio is never written to disk. Meeting recordings stay local until you click Transcribe. API keys live in your OS credential manager.

bolt

Lightweight

Built with Tauri and Rust. The entire app is around 10MB. No Electron, no bloat, launches instantly.

tune

Fully Customizable

Create custom modes with your own system prompts. Add vocabulary entries for words the model gets wrong.

New in v1.20

One hotkey. Full meeting capture.

Record screen, mic, and system audio as a local MP4. When done, click Transcribe — Voxly sends the audio to Deepgram and returns a speaker-labeled transcript with your voice and remote speakers separated.

fiber_manual_record

Record

Press Ctrl+Alt+M. Screen, mic, and system audio recorded locally. No cloud at this step.

upload_file

Transcribe

Click Transcribe. Voxly extracts a dual-channel audio file — mic on channel 1, system audio on channel 2 — and sends it to Deepgram for multichannel diarization.

record_voice_over

Speaker-labeled transcript

You plus distinct remote speakers — not one flattened System label. Mic bleed from system audio is filtered automatically. Click any line to seek the video.

Format

MP4

Channels

2-ch AAC

Provider

Deepgram

Design Review — Jun 3, 2026

schedule48:22 hard_drive312 MB

Get started in seconds

Getting Started

1. Download the latest release from GitHub Releases

2. Run the app and open Settings

3. Add your API key — Groq (free) or OpenAI

4. Hold Ctrl+Space and speak

5. Release — clean text appears in your active app ✓

How Voxly compares

Feature	Voxly	Wispr Flow	Superwhisper
Pricing	Free (BYOK)	$15/mo	$8.49/mo
Open Source	check_circle	cancel	cancel
Custom Modes	check_circle	check_circle	check_circle
Platforms	Windows, macOS, Linux	Mac, Windows	Mac only
API Provider	Any (BYOK)	Built-in only	Built-in only
Meeting Transcripts	check_circle	cancel	cancel
Local Recording	check_circle	cancel	cancel

bug_report arrow_forward

Dictate.
Format.
Paste.

Dictate in any context

No Subscription

Private by Design

Lightweight

Fully Customizable

One hotkey. Full meeting capture.

Get started in seconds

How Voxly compares

Report a Bug

Test on macOS / Linux

Contribute

Dictate. Format. Paste.

Dictate in any context

No Subscription

Private by Design

Lightweight

Fully Customizable

One hotkey. Full meeting capture.

Get started in seconds

How Voxly compares

Report a Bug

Test on macOS / Linux

Contribute

Dictate.
Format.
Paste.