Skip to main content
POST
/
synthesis
/
text-to-speech
Convert text to speech
{
  "voiceId": "v_8eelc901",
  "text": "سلام، آپ اِس وقت اوریٹر کی آواز سن رہے ہیں۔",
  "outputFormat": "MP3_22050_128",
}
This response does not have an example.

Authorizations

Authorization
string
header
required

API key with format "Bearer sk_api_..."

Body

application/json

Request for text-to-speech synthesis

voiceId
enum<string>
required

Identifier for the voice to use. Options include v_8eelc901 (Info/Edu), v_kwmp7zxt (Gen Z), v_yypgzenx (Dada Jee), v_30s70t3a (Nostalgic News)

Available options:
v_8eelc901,
v_kwmp7zxt,
v_yypgzenx,
v_30s70t3a
text
string
required

The text to synthesize

Maximum length: 2500
outputFormat
enum<string>
required

Format of the output audio. Wav files are usually 10x larger, we recommend using MP3 or OGG for best compression results while maintaining quality.

Available options:
WAV_22050_16,
WAV_22050_32,
MP3_22050_32,
MP3_22050_64,
MP3_22050_128,
OGG_22050_16,
ULAW_8000_8
phraseReplacementConfigId
string

Optional ID of a phrase replacement configuration to apply

Response

Successful audio synthesis

The response is of type file.

I