-
Notifications
You must be signed in to change notification settings - Fork 495
Trim generated audio based on edge silence #22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
|
Really nice improvement! |
|
This works a bit better IMHO. I am swamped at work so it's kinda raw atm: |
It'll still always cut off 5000/24000 samples ~= 0.2 seconds from each end, no matter if there happens to be information there? 🤔 |
|
Noticed that with the latest model this seems no longer needed. |
|
@Mic92 what is the latest model and how can I get it? |
|
Copied from the README: m = KittenTTS("KittenML/kitten-tts-nano-0.2") Going to huggingface also should work. |
|
Unsubscribe
…On Thu., Sep. 4, 2025, 9:04 a.m. Jörg Thalheim ***@***.***> wrote:
*Mic92* left a comment (KittenML/KittenTTS#22)
<#22 (comment)>
Copied from the README:
m = KittenTTS("KittenML/kitten-tts-nano-0.2")
Going to huggingface also should work.
—
Reply to this email directly, view it on GitHub
<#22 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/A5AYBKENOBP6LLTUXILYPQ33RA2EHAVCNFSM6AAAAACDHQ23JWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZTENJTGYYTEMZWGQ>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
|
This fixes very short generations ("Hello, world!") being abruptly cut off. The threshold 0.01 was chosen empirically.