XTTS: Streaming text-to-speech

Recent Requests
Log in to see full request history
TimeStatusUser Agent
Retrieving recent requests…
LoadingLoading…

Streaming example

See this example script for how to stream the API response using Python: https://gist.github.com/reuben/70f856d79800e35222138b44232e2369

Body Params
uuid
required

ID of a Speaker or an XTTS Voice object to use for rendering this sample.

string
length between 1 and 120

Name of sample. Optional, to be used for your own bookkeeping purposes.

string
required
length between 1 and 250

Text to be synthesized. Text length limit varies per language: en: 250, de: 253, fr: 273, es: 239, it: 213, pt: 203, pl: 224, zh-cn: 82, ar: 166, cs: 186, ru: 182, nl: 251, tr: 226, ja: 71, hu: 224, ko: 95,

double
0 to 2
Defaults to 1
string
enum
Defaults to en
  • en - en
  • de - de
  • fr - fr
  • es - es
  • it - it
  • pt - pt
  • pl - pl
  • zh-cn - zh-cn
  • ar - ar
  • cs - cs
  • ru - ru
  • nl - nl
  • tr - tr
  • ja - ja
  • hu - hu
  • ko - ko
Responses

401

No response body

Language
Credentials
Bearer
LoadingLoading…
Response
Click Try It! to start a request and see the response here! Or choose an example:
*/*
application/json