Hi! Thanks for sharing this great project. Do you have any recommendations for a text-to-speech model that works well for generating audio in this task?