Text to Speech

Convert text to speech

{
  "voiceId": "v_8eelc901",
  "text": "سلام، آپ اِس وقت اوریٹر کی آواز سن رہے ہیں۔",
  "outputFormat": "MP3_22050_128",
}

"<string>"

POST

synthesis

text-to-speech

Convert text to speech

{
  "voiceId": "v_8eelc901",
  "text": "سلام، آپ اِس وقت اوریٹر کی آواز سن رہے ہیں۔",
  "outputFormat": "MP3_22050_128",
}

"<string>"

Authorizations

Authorization

string

header

required

API key with format "Bearer sk_api_..."

Body

application/json

Request for text-to-speech synthesis

voiceId

enum<string>

required

Identifier for the voice to use. Options include v_8eelc901 (Info/Edu), v_kwmp7zxt (Gen Z), v_yypgzenx (Dada Jee), v_30s70t3a (Nostalgic News)

Available options:

v_8eelc901,

v_kwmp7zxt,

v_yypgzenx,

v_30s70t3a

text

string

required

The text to synthesize

Maximum string length: 2500

outputFormat

enum<string>

required

Format of the output audio. Wav files are usually 10x larger, we recommend using MP3 or OGG for best compression results while maintaining quality.

Available options:

WAV_22050_16,

WAV_22050_32,

MP3_22050_32,

MP3_22050_64,

MP3_22050_128,

OGG_22050_16,

ULAW_8000_8

phraseReplacementConfigId

string

Optional ID of a phrase replacement configuration to apply

Response

Successful audio synthesis

The response is of type file.

Request Limits Stream Text to Speech

Getting Started

Core Concepts

Orator API Endpoints

Scribe API Endpoints

Authorizations

Body

Response