- List voices — browse TTS voices across all configured providers
- List models — discover TTS, STT, and STS models
- Create voice session — establish a real-time voice session over LiveKit WebRTC
- List providers — see which voice providers are enabled for your organization
List voices
Browse all TTS voices available across providers.
Create voice session
Start a real-time voice call powered by LiveKit.
List voice models
Discover TTS, STT, and STS models for agent configuration.
List providers
View enabled voice providers for your organization.
Key concepts
- TTS (Text-to-Speech) — converts the agent’s text reply into audio streamed to the caller.
- STT (Speech-to-Text) — converts caller speech into text that the LLM processes.
- STS (Speech-to-Speech) — end-to-end speech model that bypasses the text layer entirely (lower latency, less control).
- Voice pipeline — the combination of STT → LLM → TTS that processes a voice turn.
voicePipelineModeon agents:pipeline— classic STT + LLM + TTSsts— direct speech-to-speechhybrid— adaptive mode that switches based on conditions
Connecting to a voice session
After callingPOST /v1/voice/sessions, you receive a token and serverUrl for LiveKit. Use the LiveKit SDK to connect:
Node.js
Inbound calls
Assign a phone number to an agent in the dashboard or viaPATCH /v1/chat/phone-numbers/{id}. Inbound calls to that number automatically start a voice session with the assigned agent.