XTTS (Coqui)
Multilingual local TTS with voice-cloning support.
XTTS (Coqui) is an open-source local TTS engine. Strong on multilingual output and supports voice cloning from short reference clips.
Setup
Add the service
Manage Services → + Add Services → XTTS (Coqui) → Add. Voxta installs the Python runtime and model weights automatically on first use.
Pick or clone a voice
In the XTTS config, pick from built-in voices or upload a reference clip to clone a custom voice. Per-character voice overrides are set in Studio.