Hi, thanks a lot for this awesome project!
I have a question about handling uncommon or custom words in realTimeTTS.
For example, suppose I want the model to pronounce a made-up brand name like "Xyloq".
By default, the phonemizer may guess something like ZAI-LOK, but I’d actually want it to be pronounced ZAI-LOH.
My questions are:
Is there a way to override the phonemizer and provide custom phonemes directly?
Does realTimeTTS support SSML tags or some kind of user lexicon?
If not, what’s the best practice for real-time scenarios? Should I rely on spelling variants (like writing “Xyloh”) to guide pronunciation?
Thanks in advance for any advice!