Cartesia Sonic Text-to-Speech API
The Sonic text-to-speech API converts text into ultra-low-latency, emotive speech with sub-100ms time-to-first-byte. It supports REST, server-sent events, and WebSocket streaming for real-time voice agents and applications.
Documentation
Documentation
https://docs.cartesia.ai
GettingStarted
https://docs.cartesia.ai/get-started
APIReference
https://docs.cartesia.ai/api-reference
Authentication
https://docs.cartesia.ai
Specifications
SDKs
SDK
https://github.com/cartesia-ai/cartesia-python
SDK
https://github.com/cartesia-ai/cartesia-js
SDK
https://github.com/cartesia-ai/cartesia-go
GitHubRepository
https://github.com/cartesia-ai