Amazon Polly

Amazon Polly is a cloud service that converts text into lifelike speech, enabling you to create applications that talk and build entirely new categories of speech-enabled products. Polly supports multiple voices, languages, and audio output formats including neural and generative engines for natural-sounding speech.

Amazon Polly publishes 5 APIs on the APIs.io network, including Lexicons API, Speech API, Speech Synthesis API, and 2 more. Tagged areas include AI, Machine Learning, Speech Synthesis, Text-To-Speech, and TTS.

The Amazon Polly catalog on APIs.io includes 1 JSON-LD context and 2 Spectral governance rulesets.

Amazon Polly’s developer surface includes authentication, developer portal, engineering blog, support, CLI, documentation, pricing, and 91 more developer resources.

🌐 Visit website 📡 Source on GitHub

65.6/100 strong ▼ -7.4 Agent 41/100 agent ready Full breakdown ↓
scored 2026-07-28 · rubric v0.6

AccessFreemiumSelf serve⚡ Free to try

5 APIs 8 Features 6 Use Cases

AIMachine LearningSpeech SynthesisText-To-SpeechTTSVoiceSSMLNeural EngineGenerative AI

Kin Score

Kin Score How this is scored →
scored 2026-07-28 · rubric v0.6

Composite quality — 65.6/100 · strong

Contract Quality 17.4 / 25

Developer Ergonomics 12.6 / 20

Commercial Clarity 15.8 / 20

Operational Transparency 6.8 / 13

Governance 8.3 / 12

Discoverability 7.4 / 10

Agent readiness — 41/100 · agent ready

Machine-Readable Contract 18 / 18

Agentic Access Contract 10 / 10

MCP Server 0 / 12

Machine-Readable Auth 10 / 10

Idempotency 0 / 9

Stable Error Semantics 8 / 8

Request/Response Examples 7 / 7

Rate-Limit Signaling 7 / 7

Typed Event Surface 0 / 6

Agent Skills 0 / 5

Well-Known Catalog 0 / 4

Consent & Bot Identity 0 / 3

A2A Agent Card 0 / 8

Dry-Run / Simulate Mode 0 / 4

Improve this rating by publishing the missing artifacts — every area above can be raised, and the full rubric is at apis.io/rating/. This rating is computed from github.com/api-evangelist/amazon-polly: open an issue to ask a question, or submit a pull request to add artifacts. Want it done for you? Prioritized profiling — $2,500 →

APIs 5

Individual APIs this provider publishes, each with its own machine-readable definition.

Scroll for all 9

Pricing Plans 1

Published pricing tiers and plan structures.

Amazon Polly Plans Pricing

3 plans

PLANS

Rate Limits 1

Documented rate limits and quota policies.

Amazon Polly Rate Limits

5 limits

RATE LIMITS

FinOps 1

Cost, billing, and metering signals for API financial operations.

Amazon Polly Finops

FINOPS

Features 8

Notable capabilities this provider offers.

Neural Text-to-Speech

Produce natural-sounding speech using neural network-based text-to-speech technology.

Generative Engine

New generative engine delivers the highest quality, most human-like speech synthesis.

Multiple Voices and Languages

Choose from 60+ voices across 30+ languages including male, female, and child voices.

SSML Support

Use Speech Synthesis Markup Language (SSML) to control pronunciation, volume, pitch, and speech rate.

Custom Lexicons

Create custom pronunciation lexicons to control how specific words and phrases are spoken.

Speech Marks

Generate speech marks metadata to synchronize spoken text with animations or visual highlights.

Asynchronous Synthesis Tasks

Process large text bodies asynchronously with S3 output for long-form content.

Multiple Audio Formats

Output audio in MP3, OGG, PCM, and JSON (speech marks) formats.

Scroll for all 8

JSON Schema 24

Standalone JSON Schema definitions for this provider's data models.

Scroll for all 24

JSON Structure 24

JSON Structure definitions describing this provider's data shapes.

Scroll for all 24

Examples 16

Example request and response payloads for these APIs.

Scroll for all 16

Use Cases 6

What developers build with this provider.

Voice Assistants

Build conversational interfaces that speak responses to users.

Accessibility Features

Add text-to-speech reading to applications for visually impaired users.

Podcast and Audio Content

Convert written articles and content into audio podcasts automatically.

E-Learning Narration

Add spoken narration to educational courses and training materials.

Call Center IVR

Create interactive voice response systems with natural-sounding speech.

Language Learning Apps

Provide native-speaker pronunciation examples for language education.

Integrations 5

Pre-built integrations with other platforms and tools.

Amazon S3

Store synthesized speech output from asynchronous synthesis tasks in S3 buckets.

Amazon Lex

Combine Polly speech synthesis with Lex conversational AI for voice chatbots.

AWS Lambda

Trigger speech synthesis from Lambda functions for event-driven voice applications.

Amazon Transcribe

Pair Polly text-to-speech with Transcribe speech-to-text for round-trip voice applications.

Amazon Connect

Power Amazon Connect contact center voice responses with Polly neural speech.

Resources

Get Started 3

Portal, sign-up, and the first successful call

Portal

Portal

GettingStarted

GettingStarted

Signup

Signup

Documentation 25

Reference material describing how the API behaves