The ElevenLabs Text to Speech API converts text into lifelike spoken audio with nuanced intonation, pacing, and emotional awareness. It supports multiple output formats includin...
The ElevenLabs Speech to Text API provides state-of-the-art transcription capabilities, converting spoken audio into accurate text. It supports multiple audio formats and langua...
The ElevenLabs Voice Cloning API allows developers to create custom AI voices from audio recordings. Instant Voice Cloning requires as little as 60 seconds of clean audio to gen...
The ElevenLabs Voices API provides management capabilities for the voice library, including listing, retrieving, creating, editing, and deleting voices. Developers can access a ...
The ElevenLabs Sound Effects API generates cinematic sound effects from text descriptions. Developers can describe the desired sound in natural language and receive high-quality...
The ElevenLabs Audio Isolation API removes background noise from audio recordings, isolating vocal tracks from ambient sounds and interference. This is useful for cleaning up re...
The ElevenLabs Dubbing API enables automatic translation and voice-over of audio and video content into different languages. It preserves the original speaker's voice characteri...
The ElevenLabs Voice Changer API performs speech-to-speech conversion, replacing one voice with another while preserving the original speech content, timing, and emotional deliv...
The ElevenLabs Music Generation API creates music from text prompts, allowing developers to generate original musical compositions programmatically. Users describe the desired g...
The ElevenLabs Conversational AI API enables developers to build interactive voice agents that can engage in natural, real-time conversations. It combines speech recognition, la...
The ElevenLabs Studio API provides programmatic access to the ElevenLabs Studio project management system. Developers can create, manage, and render long-form audio content proj...
The ElevenLabs Conversational AI WebSocket API enables real-time, interactive voice conversations with AI agents. It supports bidirectional audio streaming, text events, and con...
The ElevenLabs Text to Speech WebSocket API enables bidirectional streaming for text-to-speech conversion. Clients send text chunks incrementally and receive audio chunks as the...
The ElevenLabs Webhook system delivers event notifications to configured endpoints when specific actions occur within the platform. This includes post-call webhooks from Convers...
aid: elevenlabs
url: https://raw.githubusercontent.com/api-evangelist/elevenlabs/refs/heads/main/apis.yml
modified: '2026-05-19'
common:
- type: GitHubOrganization
url: https://github.com/elevenlabs
- type: LinkedIn
url: https://www.linkedin.com/company/elevenlabsio
- type: JSONLD
url: json-ld/elevenlabs-context.jsonld
- type: JSONSchema
url: json-schema/elevenlabs-voice-schema.json
- type: JSONSchema
url: json-schema/elevenlabs-agent-schema.json
- type: JSONSchema
url: json-schema/elevenlabs-webhook-event-schema.json
- type: Features
data:
- 'Free: 10k credits/mo with TTS/STT/Sound effects'
- 'Starter at $6/mo: 30k credits, commercial license, IVC'
- 'Creator at $11/mo: 121k credits, Professional Voice Cloning'
- 'Pro at $99/mo: 600k credits, 44.1kHz PCM'
- 'Scale at $299/mo: 1.8M credits, team collaboration'
- 'Business at $990/mo: 6M credits, low-latency TTS at 5c/min'
- 'Enterprise custom: HIPAA BAA, SSO, custom concurrency'
- 'Concurrent: 2 Free, 3 Starter, 5 Creator, 10 Pro, 15 Scale/Business'
- TTS API with multilingual voices
- STT (Scribe) API
- Dubbing API for video translation
- Voice cloning (Instant + Professional)
- Sound Effects generation
- Music generation
- Conversational AI (voice agents)
- WebSocket streaming for low-latency synthesis
sources:
- https://elevenlabs.io/pricing
updated: '2026-05-04'
- type: LLMsTxt
url: https://elevenlabs.io/llms.txt
apis:
- aid: elevenlabs:text-to-speech
name: ElevenLabs Text to Speech API
tags:
- AI
- Audio
- Speech Synthesis
- Text to Speech
- Voice
image: https://kinlane-images.s3.amazonaws.com/shared/apis-json/apis-json-logo.jpg
baseURL: https://api.elevenlabs.io
humanURL: https://elevenlabs.io/docs/api-reference/text-to-speech/convert
properties:
- url: https://elevenlabs.io/docs/api-reference/text-to-speech/convert
type: Documentation
- url: openapi/elevenlabs-text-to-speech-openapi.yml
type: OpenAPI
- url: asyncapi/elevenlabs-text-to-speech-streaming-asyncapi.yml
type: AsyncAPI
description: >-
The ElevenLabs Text to Speech API converts text into lifelike spoken audio with nuanced intonation, pacing, and
emotional awareness. It supports multiple output formats including MP3, PCM, and mu-law, and offers a range of
models such as Flash v2.5 for ultra-low latency real-time applications and Multilingual v2 for support across 70+
languages. Developers can select from thousands of pre-built voices or use custom cloned voices to generate speech
that sounds natural and expressive.
- aid: elevenlabs:speech-to-text
name: ElevenLabs Speech to Text API
tags:
- AI
- Audio
- Speech to Text
- Transcription
image: https://kinlane-images.s3.amazonaws.com/shared/apis-json/apis-json-logo.jpg
baseURL: https://api.elevenlabs.io
humanURL: https://elevenlabs.io/docs/api-reference/speech-to-text/convert
properties:
- url: https://elevenlabs.io/docs/api-reference/speech-to-text/convert
type: Documentation
- url: openapi/elevenlabs-speech-to-text-openapi.yml
type: OpenAPI
description: >-
The ElevenLabs Speech to Text API provides state-of-the-art transcription capabilities, converting spoken audio
into accurate text. It supports multiple audio formats and languages, enabling developers to build applications
that require reliable audio transcription. The API is designed for both real-time and batch processing use cases.
- aid: elevenlabs:voice-cloning
name: ElevenLabs Voice Cloning API
tags:
- AI
- Audio
- Voice
- Voice Cloning
image: https://kinlane-images.s3.amazonaws.com/shared/apis-json/apis-json-logo.jpg
baseURL: https://api.elevenlabs.io
humanURL: https://elevenlabs.io/docs/api-reference/voices/ivc/create
properties:
- url: https://elevenlabs.io/docs/api-reference/voices/ivc/create
type: Documentation
- url: openapi/elevenlabs-voice-cloning-openapi.yml
type: OpenAPI
description: >-
The ElevenLabs Voice Cloning API allows developers to create custom AI voices from audio recordings. Instant Voice
Cloning requires as little as 60 seconds of clean audio to generate a usable voice clone, while Professional Voice
Cloning produces higher fidelity results from a minimum of 30 minutes of recordings. Cloned voices can then be
used with the Text to Speech API for generating speech that closely matches the original speaker.
- aid: elevenlabs:voices
name: ElevenLabs Voices API
tags:
- AI
- Voice Library
- Voice Management
- Voices
image: https://kinlane-images.s3.amazonaws.com/shared/apis-json/apis-json-logo.jpg
baseURL: https://api.elevenlabs.io
humanURL: https://elevenlabs.io/docs/api-reference/voices/get
properties:
- url: https://elevenlabs.io/docs/api-reference/voices/get
type: Documentation
- url: openapi/elevenlabs-voices-openapi.yml
type: OpenAPI
description: >-
The ElevenLabs Voices API provides management capabilities for the voice library, including listing, retrieving,
creating, editing, and deleting voices. Developers can access a library of over 5,000 pre-built voices and manage
their own custom voices. The API also supports voice design, allowing creation of new AI voices from text
descriptions specifying desired characteristics such as accent, age, and tone.
- aid: elevenlabs:sound-effects
name: ElevenLabs Sound Effects API
tags:
- AI
- Audio Generation
- Sound Effects
image: https://kinlane-images.s3.amazonaws.com/shared/apis-json/apis-json-logo.jpg
baseURL: https://api.elevenlabs.io
humanURL: https://elevenlabs.io/docs/api-reference/sound-generation/create
properties:
- url: https://elevenlabs.io/docs/api-reference/sound-generation/create
type: Documentation
- url: openapi/elevenlabs-sound-effects-openapi.yml
type: OpenAPI
description: >-
The ElevenLabs Sound Effects API generates cinematic sound effects from text descriptions. Developers can describe
the desired sound in natural language and receive high-quality audio output. The API supports audio tags for
controlling delivery, emotion, emphasis, pauses, and specific sound effects, making it suitable for game
development, film production, and multimedia content creation.
- aid: elevenlabs:audio-isolation
name: ElevenLabs Audio Isolation API
tags:
- Audio Isolation
- Audio Processing
- Noise Removal
image: https://kinlane-images.s3.amazonaws.com/shared/apis-json/apis-json-logo.jpg
baseURL: https://api.elevenlabs.io
humanURL: https://elevenlabs.io/docs/api-reference/audio-isolation/audio-isolation
properties:
- url: https://elevenlabs.io/docs/api-reference/audio-isolation/audio-isolation
type: Documentation
- url: openapi/elevenlabs-audio-isolation-openapi.yml
type: OpenAPI
description: >-
The ElevenLabs Audio Isolation API removes background noise from audio recordings, isolating vocal tracks from
ambient sounds and interference. This is useful for cleaning up recordings, improving audio quality for podcasts
and interviews, and preparing audio files for further processing such as voice cloning or transcription. The API
processes audio files and returns cleaned versions with the vocal content preserved.
- aid: elevenlabs:dubbing
name: ElevenLabs Dubbing API
tags:
- Audio
- Dubbing
- Localization
- Translation
- Video
image: https://kinlane-images.s3.amazonaws.com/shared/apis-json/apis-json-logo.jpg
baseURL: https://api.elevenlabs.io
humanURL: https://elevenlabs.io/docs/api-reference/dubbing/resources/dub-segment
properties:
- url: https://elevenlabs.io/docs/api-reference/dubbing/resources/dub-segment
type: Documentation
- url: openapi/elevenlabs-dubbing-openapi.yml
type: OpenAPI
description: >-
The ElevenLabs Dubbing API enables automatic translation and voice-over of audio and video content into different
languages. It preserves the original speaker's voice characteristics while translating the spoken content,
supporting seamless localization of multimedia content. The API handles the full dubbing pipeline including
transcription, translation, and speech synthesis with lip-sync timing.
- aid: elevenlabs:voice-changer
name: ElevenLabs Voice Changer API
tags:
- Audio Processing
- Voice Changer
- Voice Conversion
image: https://kinlane-images.s3.amazonaws.com/shared/apis-json/apis-json-logo.jpg
baseURL: https://api.elevenlabs.io
humanURL: https://elevenlabs.io/docs/api-reference/speech-to-speech/convert
properties:
- url: https://elevenlabs.io/docs/api-reference/speech-to-speech/convert
type: Documentation
- url: openapi/elevenlabs-voice-changer-openapi.yml
type: OpenAPI
description: >-
The ElevenLabs Voice Changer API performs speech-to-speech conversion, replacing one voice with another while
preserving the original speech content, timing, and emotional delivery. Developers can transform audio recordings
to sound like a different speaker using any voice from the ElevenLabs library or a custom cloned voice. This is
useful for content creation, privacy protection, and character voice generation.
- aid: elevenlabs:music
name: ElevenLabs Music Generation API
tags:
- AI
- Audio Generation
- Music
image: https://kinlane-images.s3.amazonaws.com/shared/apis-json/apis-json-logo.jpg
baseURL: https://api.elevenlabs.io
humanURL: https://elevenlabs.io/docs/overview/capabilities/music
properties:
- url: https://elevenlabs.io/docs/overview/capabilities/music
type: Documentation
- url: openapi/elevenlabs-music-openapi.yml
type: OpenAPI
description: >-
The ElevenLabs Music Generation API creates music from text prompts, allowing developers to generate original
musical compositions programmatically. Users describe the desired genre, mood, tempo, and instrumentation in
natural language and receive generated audio output. The API is designed for applications that need background
music, jingles, or custom soundtracks without requiring manual composition.
- aid: elevenlabs:conversational-ai
name: ElevenLabs Conversational AI API
tags:
- AI
- Conversational AI
- Real-Time
- Voice Agents
image: https://kinlane-images.s3.amazonaws.com/shared/apis-json/apis-json-logo.jpg
baseURL: https://api.elevenlabs.io
humanURL: https://elevenlabs.io/docs/overview/capabilities/conversational-ai
properties:
- url: https://elevenlabs.io/docs/overview/capabilities/conversational-ai
type: Documentation
- url: openapi/elevenlabs-conversational-ai-openapi.yml
type: OpenAPI
- url: asyncapi/elevenlabs-conversational-ai-asyncapi.yml
type: AsyncAPI
- url: asyncapi/elevenlabs-webhooks-asyncapi.yml
type: AsyncAPI
description: >-
The ElevenLabs Conversational AI API enables developers to build interactive voice agents that can engage in
natural, real-time conversations. It combines speech recognition, language understanding, and speech synthesis
into a unified interface supporting multi-turn dialogue across 70+ languages. The API is designed for building
customer service agents, voice assistants, and interactive voice response systems with expressive, human-sounding
voices.
- aid: elevenlabs:studio
name: ElevenLabs Studio API
tags:
- Content Management
- Projects
- Studio
image: https://kinlane-images.s3.amazonaws.com/shared/apis-json/apis-json-logo.jpg
baseURL: https://api.elevenlabs.io
humanURL: https://elevenlabs.io/docs/overview/capabilities/projects
properties:
- url: https://elevenlabs.io/docs/overview/capabilities/projects
type: Documentation
- url: openapi/elevenlabs-studio-openapi.yml
type: OpenAPI
description: >-
The ElevenLabs Studio API provides programmatic access to the ElevenLabs Studio project management system.
Developers can create, manage, and render long-form audio content projects through the API, organizing text into
chapters and assigning different voices to different sections. The Studio is designed for producing audiobooks,
podcasts, and other long-form audio content at scale.
description: Converts text into speech using a voice of your choice and returns audio.