🎙️ Clone Any Voice for Free — This Local App Replaces ElevenLabs

Shayla · February 21, 2026, 10:23am

3 Seconds of Audio → Perfect Voice Clone — Free Desktop App, No Subscription

Think ElevenLabs but it runs on your laptop, costs nothing, and your voice data never touches the internet.

Someone built a local ElevenLabs. Record 3 seconds of anyone’s voice, and Voicebox creates a clone that speaks any text you feed it — natural emotion, real cadence, not robotic TTS garbage.

ElevenLabs charges $22-99/month and keeps your voice data on their servers. A professional voice actor charges $250-500 per finished minute. Voicebox is one download and $0 forever.

🧠 How It Works — The 60-Second Version

Think of it like a voice photocopier. You feed it a short audio clip of someone talking — 3 to 10 seconds is enough. The AI model (Qwen3-TTS by Alibaba, same class as the paid services) learns the voice’s fingerprint: tone, rhythm, accent, emotion. Then you type any text and it speaks it back in that voice.

The model downloads once (~2-4GB). After that — no internet needed. Everything runs on your hardware.

What Happens	Where It Happens
Voice sample analyzed	Your machine
Voice profile created	Your machine
Speech generated	Your machine
Data sent to cloud	Nowhere. Ever.

⚡ What You Get — Not Just a TTS Toy

Feature	Details
Voice cloning	3-10 seconds of audio → near-perfect clone
DAW-style timeline	Multi-track editor — drag clips, layer voices, mix conversations
Multi-voice projects	Build entire podcasts with different cloned voices
Transcription	Built-in Whisper — auto-transcribes your audio
In-app recording	Record voice samples directly, no external tools needed
Model sizes	1.7B (better quality) or 0.6B (faster, lighter)
Languages	English, Chinese, and more coming

This isn’t a command-line script for nerds. It’s a full production app with a proper UI.

💻 Download & Setup — Pick Your OS

Download: github.com/jamiepine/voicebox

Platform	GPU Requirement	Speed
macOS (M1/M2/M3/M4)	None — native Metal acceleration via MLX	Near real-time, 4-5x faster
Windows	NVIDIA GPU (CUDA)	Fast with decent GPU
Linux	Coming soon	Blocked by build infra

Step 1 — Download the installer from the GitHub releases page.

Step 2 — Launch → it auto-downloads the Qwen3-TTS model on first run.

Step 3 — Record or upload a voice sample (3+ seconds).

Step 4 — Type your text → hit generate → done.

Mac users win here — Apple Silicon gets native Neural Engine acceleration. Generation is near real-time.

💰 What This Replaces

Service	Cost	Your Data
ElevenLabs	$22-99/month	Stored on their servers
Professional voice actor	$250-500/finished minute	N/A
Play.ht / Murf	$29-99/month	Cloud-processed
Voicebox	$0 forever	Never leaves your machine

Quick Hits

Want	Do
Clone a voice	→ Upload 3-10 sec audio clip → instant profile
Build a podcast	→ Create multiple voice profiles → arrange on timeline
Keep voice data private	→ Already done — nothing ever leaves your laptop
Use offline	→ Model downloads once, works without internet forever

Your laptop is now a voice studio. Nobody asked permission and nobody’s charging rent.

DukeForever · February 21, 2026, 3:37pm

very slow generation

only works with GPU possibly with 10+ VRAM !!

KOUSTHUBH_lanka · February 21, 2026, 3:43pm

yea very slow , its taking too long to download the qwen model while generating

Thesharingmaster · February 21, 2026, 3:45pm

Only works reasonably on a 4090 or 5090. Luckily, I have both!!

Asher · February 21, 2026, 4:49pm

I’ve been trying the recently released MioTTS. It’s better, faster, runs on much lower VRAM, and only needs 5 seconds of audio recording to clone someone’s voice.

Topic		Replies	Views
🎙️ Build a Free Voice-Cloning Monster — One Offline Pipeline, 27 Repos Tutorials & Methods programming , tips-tricks , ai	0	449	May 30, 2026
⚡ ElevenLabs Killer Just Dropped — 100% Free & Open Source Tools & Scripts programming , freebies	3	961	January 27, 2026
Make Voices with Meta Audiobox (Without Losing Your Mind) Tutorials & Methods tools , audio , ai	0	333	July 11, 2025
🎧 Turn Any PDF Into a Podcast (Free Local Tools) Tools & Scripts tools , programming , freebies	2	414	January 24, 2026
Top Free AI Voice Tools That Sound Incredibly Human :star: Tools & Scripts tools , audio , ai	2	722	July 5, 2025

🎙️ Clone Any Voice for Free — This Local App Replaces ElevenLabs

3 Seconds of Audio → Perfect Voice Clone — Free Desktop App, No Subscription

Related topics