Skip to content

How to handle custom pronunciations (brand names / acronyms) in realTimeTTS? #344

@sienaiwun

Description

@sienaiwun

Hi, thanks a lot for this awesome project!

I have a question about handling uncommon or custom words in realTimeTTS.
For example, suppose I want the model to pronounce a made-up brand name like "Xyloq".
By default, the phonemizer may guess something like ZAI-LOK, but I’d actually want it to be pronounced ZAI-LOH.

My questions are:

Is there a way to override the phonemizer and provide custom phonemes directly?

Does realTimeTTS support SSML tags or some kind of user lexicon?

If not, what’s the best practice for real-time scenarios? Should I rely on spelling variants (like writing “Xyloh”) to guide pronunciation?

Thanks in advance for any advice!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions