OpenAI · Schema

CreateSpeechRequest

AIArtificial IntelligenceLarge Language ModelsT1

Properties

Name Type Description
model string One of the available TTS models. tts-1 is optimized for speed, tts-1-hd is optimized for quality, and gpt-4o-mini-tts supports advanced voice instructions.
input string The text to generate audio for. The maximum length is 4096 characters.
voice string The voice to use when generating the audio. Previews of the voices are available in the Text to Speech guide.
instructions string Control the voice of your generated audio with additional instructions. Only supported with gpt-4o-mini-tts.
response_format string The format to audio in. Supported formats are mp3, opus, aac, flac, wav, and pcm. Opus is recommended for internet streaming and communication, aac for digital audio compression, and flac for lossless
speed number The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.
View JSON Schema on GitHub