'noisescalar' derivation in clean speech and noise mix 

Hi, 

Thanks for sharing this open-source dataset. I am trying to apply this code to generate synthetic noisy datasets for speech processing.  In my practice, I observed that the code-generated data has only half of SNR than the code nominated, which I tested from Audacity. After further checked the 'audiolib.py', I think the 'noisescalar' derivation (line 68) seems to be incorrect.   
 
In the 'audiolib.py' code, the original code is:
**noisescalar = np.sqrt(rmsclean / (10**(snr/20)) / rmsnoise)**

Where I think the square root shall not be used for the noise scalar since the SNR is calculated based on RMS in the derivation, and it shall be corrected as below in the scaling of the noise level.
**noisescalar = rmsclean / (10**(snr/20)) / rmsnoise**

In my test, I got the synthetic noisy data with the correct SNR level after this correction. So could you please correct it in the code?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

'noisescalar' derivation in clean speech and noise mix #18

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

'noisescalar' derivation in clean speech and noise mix #18

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions