Cartesia

Cartesia Sonic Text-to-Speech API

The Sonic text-to-speech API converts text into ultra-low-latency, emotive speech with sub-100ms time-to-first-byte. It supports REST, server-sent events, and WebSocket streaming for real-time voice agents and applications.

Documentation GitHub

Documentation

https://docs.cartesia.ai

https://docs.cartesia.ai/get-started

https://docs.cartesia.ai/api-reference

https://docs.cartesia.ai

Specifications

https://raw.githubusercontent.com/api-evangelist/cartesia/refs/heads/main/asyncapi/cartesia-asyncapi.yml

SDKs

GitHubRepository

https://github.com/cartesia-ai

Other Resources

https://play.cartesia.ai

https://github.com/cartesia-ai/cartesia-python

https://github.com/cartesia-ai/cartesia-js

https://github.com/cartesia-ai/cartesia-go

https://cartesia.ai/pricing

https://raw.githubusercontent.com/api-evangelist/cartesia/refs/heads/main/graphql/cartesia-graphql.md

https://raw.githubusercontent.com/api-evangelist/cartesia/refs/heads/main/apis.yml

AsyncAPI Specification