Fixed Headers

Accept
audio/mp3
required

Variable Parameters

speaker
string
required

Must be one of the voices listed in our documentation.

text
string
required

The text you’d like spoken

modelId
string

Choose mist for hyper-realistic conversational voices or v1 for Rime’s first-gen model (default: v1)

pauseBetweenBrackets
bool
default: "false"

When set to true, adds pauses between words enclosed in angle brackets. The number inside the brackets specifies the pause duration in milliseconds.
Example: “Hi. <200> I’d love to have a conversation with you.” adds a 200ms pause between the first and second sentences.

phonemizeBetweenBrackets
bool
default: "false"

When set to true, you can specify the phonemes for a word enclosed in curly brackets.
Example: “{h’El.o} World” will pronounce “Hello” as expected. More details on this feature are incoming!

inlineSpeedAlpha
string

Comma-separated list of speed values applied to words in square brackets. Values < 1.0 speed up speech, > 1.0 slow it down. Example: “This sentence is [really] [fast]” with inlineSpeedAlpha “0.5, 3” will make “really” slow and “fast” fast.

samplingRate
int

The value, if provided, must be between 4000 and 44100. Default: 22050

speedAlpha
float
default: "1.0"

Adjusts the speed of speech. Lower than 1.0 is faster than default. Higher than 1.0 is slower than default.

reduceLatency
bool
default: "false"

Reduces the latency of response, at the cost of some possible mispronunciation of digits and abbreviations.