WAV over JSON
Fixed Parameters
Variable Parameters
Must be one of the voices listed in our documentation.
The text you’d like spoken
Choose mist
for hyper-realistic conversational voices or v1
for Rime’s first-gen model (default: v1
)
When set to true, adds pauses between words enclosed in angle brackets. The number inside the brackets specifies the pause duration in milliseconds.
Example: “Hi. <200> I’d love to have a conversation with you.” adds a 200ms pause between the first and second sentences.
When set to true, you can specify the phonemes for a word enclosed in curly brackets.
Example: “{h’El.o} World” will pronounce “Hello” as expected. More details on this feature are incoming!
Comma-separated list of speed values applied to words in square brackets. Values < 1.0 speed up speech, > 1.0 slow it down. Example: “This sentence is [really] [fast]” with inlineSpeedAlpha “0.5, 3” will make “really” slow and “fast” fast.
The value, if provided, must be between 4000 and 44100. Default: 22050
Adjusts the speed of speech. Lower than 1.0 is faster than default. Higher than 1.0 is slower than default.
Reduces the latency of response, at the cost of some possible mispronunciation of digits and abbreviations.