Add Small Gaps Between Audio Chunks to Avoid Rushed Speech

Hi,

I noticed that the repository uses a really smart approach to account for text length by segmenting the text into smaller chunks and generating audio for each segment separately.

But there's a small issue. The part where the stitching happens is very noticeable. When the audio chunks are stitched together, there are no gaps between them, which makes the resulting speech sound a bit rushed and unnatural.

Can you add this in future versions? Making it configurable via .env would also be nice.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Small Gaps Between Audio Chunks to Avoid Rushed Speech #53

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Add Small Gaps Between Audio Chunks to Avoid Rushed Speech #53

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions