Voxta docs

Setup wizard

First-run installer that gets your essential services configured in a few clicks.

The wizard is Voxta's friendly on-ramp. On first launch it walks you through picking the four essential service types so you don't have to learn the Manage Services UI before having a first conversation.

The four service categories

The wizard prompts you to pick at least one service for each of these:

CategoryWhat it doesExamples
Text GenerationGenerates the character's text replies (the "thinking" part).OpenAI, Anthropic, Voxta Cloud, llama.cpp, ExLlamaV2
Text-to-SpeechSynthesizes the character's voice.ElevenLabs, Coqui XTTS, Kokoro, UnrealSpeech, Cartesia
Speech-to-TextTranscribes your microphone input.Deepgram, Vosk, WhisperLive, Azure Speech
Action Inference & SummarizationSpecialized LLM calls for action selection and chat summarization.Often the same LLM as Text Generation, can be a separate cheaper model.

Local vs cloud picks

The wizard mixes local-only options (free, but require a competent GPU) with cloud options (paid, no GPU needed). Hover any service in the picker to see its requirements.

Easiest path — pick Voxta Cloud for all four. One API key, everything works, monthly Patreon credits cover it. See Voxta Cloud → Getting Started.

Skipping the wizard

The wizard is optional. Click skip / close on first launch and configure services manually via Manage Services at your own pace.

You can also re-run the wizard later if you reinstall Voxta or want to swap stacks.

What's next

On this page