-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Description
Listening to some vloggers, they often modulate the volume of their voice - could be cool to add this feature.
An initial step would be tagging sentences at 3 different volume levels and applying numpy masks.
Bigger steps:
if we can tag word to word or phrase to phrase translation we can better connect these.
if we can quantify the amplitude of the input audio, we could make a continuous volume mask.
Metadata
Metadata
Assignees
Labels
No labels