Vocal Remover Vocals BPM Finder BPM Key Finder Key Track ID ID
30 min remaining

Get more Cloud Assist minutes
Pricing

Vocal Remover

Remove vocals from any song using AI. Runs on your device — files never uploaded.

Drop audio file here or click to browse

MP3, WAV, FLAC, OGG, M4A · Up to 15 minutes

Private — processed on your device, never uploaded

Faster processing, smaller download. First use downloads a 14 MB AI model.

How to Remove Vocals

  1. 1 Upload a song (MP3, WAV, FLAC, OGG)
  2. 2 AI separates vocals, drums, bass, and instruments
  3. 3 Download individual stems or play them back

Frequently Asked Questions

What's the difference between Fast and Quality mode?
Fast mode uses a 14 MB model optimized for speed — typically 15-30 seconds for a 4-minute song. Quality mode uses a larger 32 MB model trained on more diverse material, better at preserving bass clarity and handling reverb tails. Choose Fast for quick previews, Quality for final production use.
How does vocal removal actually work?
A neural network trained on thousands of professional multitracks learns to separate the frequency patterns of vocals from instruments. It processes the audio spectrogram in overlapping chunks and reconstructs separate vocal and instrumental waveforms.
Does it use my GPU?
When WebGPU is available (most modern desktop devices), inference runs on your GPU — significantly faster than CPU-only processing. On older devices, it falls back to multi-threaded WASM automatically. No configuration needed.
Will it remove backing vocals too?
The model separates lead and backing vocals together from the instrumental. For tracks where you want to keep harmonies but remove the lead vocal only, results may vary — the model treats all vocal content similarly.
Does it work on podcasts or speech?
Yes, though it's optimized for music. For spoken word with background music, it effectively separates the voice from the accompaniment. For speech with only ambient noise, a dedicated noise removal tool may produce cleaner results.
What about processing on mobile?
On mobile devices without sufficient GPU memory, processing automatically routes to Cloud Assist — secure server-side processing where files are deleted immediately after completion. Desktop users process entirely on-device.
What output formats are available?
Both the isolated vocal track and instrumental track are exported as lossless WAV at the original sample rate. The files maintain the same duration and can be loaded directly into a DAW.

Unlock Cloud Assist

Offload heavy processing to secure private GPUs. Free account, 30 min/day.

or