Input tools
Custom Pauses and Pronunciations
Rime’s TTS is unique among next-gen TTS offerings in allowing users to customize their output. Out-of-the-box, Rime’s voices will speak fluently and correctly, but since strings of words can be pronounced differents for special effect, we allow users to add custom pauses and custom pronunciations. Let’s see how.
Custom Pauses
To insert a pause within a sentence, simply inset the length of your desired pause in milliseconds inside angle brackets, for example: <750>
inserts a pause of 750 millseconds (or .75 seconds).
To hear the difference, compare the following:
input: "text": "wait, are you actually serious.",
input: "text": "wait, <750> are you actually serious.",
The API request would look like this:
Custom Pronunciation
We can add words to our dictionary with in about 24 hours, but if you want to stipulate a pronunciation yourself, you can input a specific pronunciation within curly brackets.
For example the word custom
would be represented as {k1Ast0xm}
.
This works for even made-up words, like gorbulets
as {g1orby0ul2Ets}
The API request would look like this:
Below is a guide to the phonetic alphabet we use and how to input syllabic stress.
Vowels
symbol | use in an english word |
---|---|
@ | bat |
a | hot |
A | butt |
W | about |
x | comma |
Y | bite |
E | bet |
R | bird, foreword |
e | bait |
I | bit |
i | beat |
o | boat |
O | boy |
U | book |
u | boot |
N | button |
Consonants
symbol | use in an english word |
---|---|
b | buy |
C | China |
d | die |
D | thy |
f | fight |
g | guy |
h | high |
J | jive |
k | kite |
l | lie |
m | my |
n | nigh |
G | sing |
p | pie |
r | rye |
s | sigh |
S | shy |
t | tie |
T | thigh |
v | vie |
w | wise |
y | yacht |
z | zoo |
Z | pleasure |
stress
For primary stress, put 1 before the relevant vowel. For example, comma
would be {k1am0x}
For seconadary stress, put 2 in front of the relevant vowel. For example, auctioneer
would be {2akS0In1ir}
All other vowels shoud have a 0 in front of them.