OpenAI · Schema

CreateSpeechRequest

AIArtificial IntelligenceLarge Language ModelsT1

Properties

Name	Type	Description
model	string	One of the available TTS models. tts-1 is optimized for speed, tts-1-hd is optimized for quality, and gpt-4o-mini-tts supports advanced voice instructions.
input	string	The text to generate audio for. The maximum length is 4096 characters.
voice	string	The voice to use when generating the audio. Previews of the voices are available in the Text to Speech guide.
instructions	string	Control the voice of your generated audio with additional instructions. Only supported with gpt-4o-mini-tts.
response_format	string	The format to audio in. Supported formats are mp3, opus, aac, flac, wav, and pcm. Opus is recommended for internet streaming and communication, aac for digital audio compression, and flac for lossless
speed	number	The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.

View JSON Schema on GitHub

JSON Schema

{
  "$schema": "https://json-schema.org/draft/2020-12/schema",
  "title": "CreateSpeechRequest",
  "type": "object",
  "properties": {
    "model": {
      "type": "string",
      "description": "One of the available TTS models. tts-1 is optimized for speed, tts-1-hd is optimized for quality, and gpt-4o-mini-tts supports advanced voice instructions."
    },
    "input": {
      "type": "string",
      "description": "The text to generate audio for. The maximum length is 4096 characters."
    },
    "voice": {
      "type": "string",
      "description": "The voice to use when generating the audio. Previews of the voices are available in the Text to Speech guide."
    },
    "instructions": {
      "type": "string",
      "description": "Control the voice of your generated audio with additional instructions. Only supported with gpt-4o-mini-tts."
    },
    "response_format": {
      "type": "string",
      "description": "The format to audio in. Supported formats are mp3, opus, aac, flac, wav, and pcm. Opus is recommended for internet streaming and communication, aac for digital audio compression, and flac for lossless audio compression."
    },
    "speed": {
      "type": "number",
      "description": "The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default."
    }
  }
}