Text-To-Speech feature (new) #4

Jeronymous · 2025-03-25T15:45:59Z

This add TTS feature (function text -> audio waveform)

Jeronymous · 2025-03-25T15:48:37Z

ssak/utils/tts.py

+
+    text_tokens = tokenizer(text, return_tensors="pt").input_ids.to(device)
+    speaker_type_prompt_tokens = tokenizer(prompt, return_tensors="pt").input_ids.to(device)
+    audio_tensor = model.generate(input_ids=speaker_type_prompt_tokens, prompt_input_ids=text_tokens)


@hedhoud "speaker_type_prompt_tokens" represents here the type of speaker/speech (e.g. "A female speaker delivers an expressive and animated speech with a very high-pitch voice [...]" : see above).
But it turns out that it's not working. The gender is not respected. Do you see what can be wrong?

This was a problem with the "description tokenizer" that is in fact different from the other tokenizer (which tokenizes the text to pronounce).
Fixed by 060a85d

Jeronymous requested review from AudranBert and hedhoud March 25, 2025 15:45

Add TTS

0a18fc8

Jeronymous force-pushed the feature/tts branch from a706388 to 0a18fc8 Compare March 25, 2025 15:46

Jeronymous commented Mar 25, 2025

View reviewed changes

Jeronymous added 2 commits March 25, 2025 18:09

Fix description tokenizer

060a85d

Fix missing comma

f7c1e1b

linagora-labs deleted a comment from hedhoud Apr 1, 2025

Jeronymous added 2 commits April 3, 2025 15:11

Fix master/main confusion

489ba3e

cosm (ruff)

55755ae

Jeronymous merged commit df18fae into main Apr 3, 2025
1 check passed

Jeronymous deleted the feature/tts branch April 3, 2025 13:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Text-To-Speech feature (new) #4

Text-To-Speech feature (new) #4

Uh oh!

Jeronymous commented Mar 25, 2025

Uh oh!

Jeronymous Mar 25, 2025

Uh oh!

Jeronymous Mar 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Text-To-Speech feature (new) #4

Text-To-Speech feature (new) #4

Uh oh!

Conversation

Jeronymous commented Mar 25, 2025

Uh oh!

Jeronymous Mar 25, 2025

Choose a reason for hiding this comment

Uh oh!

Jeronymous Mar 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants