Voxta docs

XTTS (Coqui)

Multilingual local TTS with voice-cloning support.

XTTS (Coqui) is an open-source local TTS engine. Strong on multilingual output and supports voice cloning from short reference clips.

Setup

Add the service

Manage Services → + Add Services → XTTS (Coqui) → Add. Voxta installs the Python runtime and model weights automatically on first use.

Pick or clone a voice

In the XTTS config, pick from built-in voices or upload a reference clip to clone a custom voice. Per-character voice overrides are set in Studio.

On this page