Skip to content

Latest commit

 

History

History
31 lines (19 loc) · 1.05 KB

File metadata and controls

31 lines (19 loc) · 1.05 KB

PhonoGlyphe

PhonoGlyphe is a G2P (grapheme to phoneme) 6M transformer model meant as a fallback method for the Misaki G2P engine. It currently only supports the English language.

Usage

from misaki import en
from PhonoGlyphe import G2PModel

fallback = G2PModel(device="cpu")	# Note: with its small size, PhonoGlyphe is often faster on CPU
g2p = en.G2P(trf=False, british=False, fallback=fallback)

text = "Misaki is a G2P engine designed for Kokoro models."
phonemes, tokens = g2p(text)

print(f"Phonemes: {phonemes}")

You can easily listen to the final result using [.](/<GENERATED PHONEMES>/) in this HF Space.

Training

Phonoglyphe was trained using the English dictionaries of the Misaki project. The training code can be found in the training folder.

Acknowledgements

Thanks to Noelle for her support and all her precious advice.