This endpoint initiates text-to-speech synthesis and immediately returns a mediaId and token. The audio is generated asynchronously and can be retrieved using the returned credentials.
When to use this endpoint:
For best results with Urdu, use Urdu script. For English words within Urdu text, use ASCII characters. Example: “یہ ایک exerted force ہے”
The generated audio URL can be shared directly with end users or services without proxying through your server.
API key with format "Bearer sk_api_..."
Request for asynchronous text-to-speech synthesis
Successfully initiated audio synthesis
Response containing mediaId and token for retrieving synthesized audio