Skip to content

Mimic input audio volume #6

@kpister

Description

@kpister

Listening to some vloggers, they often modulate the volume of their voice - could be cool to add this feature.

An initial step would be tagging sentences at 3 different volume levels and applying numpy masks.
Bigger steps:
if we can tag word to word or phrase to phrase translation we can better connect these.
if we can quantify the amplitude of the input audio, we could make a continuous volume mask.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions