add a function--ApplyAddAdditiveNoise by LvHang · Pull Request #10 · pegahgh/kaldi

LvHang · 2016-11-29T07:26:25Z

@pegahgh
Hi Pegah,
I have already finished the function about add additive noise with the option "--add-noise".
Please check it.
Thank you for your guidance.

Hang

pegahgh · 2016-11-29T20:37:22Z

src/xvectorbin/nnet3-xvector-signal-perturb-egs.cc

+  // In the version, we ask the noise_cols >= input_cols. If mfcc, the cols are equal.
+  // If raw data, we ask the noise_cols > input_cols.
+  int32 input_rows = input_eg.NumRows(), input_cols = input_eg.NumCols();  
+  KALDI_ASSERT(noise_eg.NumCols() >= input_cols);


The dimension of noise eg and input should be equal. noise_eg.NumCols() == input_cols

@pegahgh
Hi, pegah.
I know the noise_eg.NumCols() == input_cols should be equal in feature domain, such as mfcc.
I let noise_eg.NumCols() > input_cols, just because I want to do something like you write in ApplyPerturbation(). It makes the dimensionality of the noise_eg will a little longer than input_eg. It maybe useful in raw-data situation.
I just want to make sure. Thanks a lot.
Hang

For now, we can focus on MFCC domain, and if it gives us improvement, we can switch to raw waveform.
We may need to write different Perturbation class for raw waveform as we have more flexibility in raw waveform domain.

LvHang · 2016-11-29T22:12:09Z

@pegahgh
Hi, Pegah
I modify it. Please check it and review others. Thanks!
Hang

pegahgh · 2016-11-30T16:32:24Z

src/xvectorbin/nnet3-xvector-signal-perturb-egs.cc

+// This function add the noise to the orginial signal. We should not normalize 
+// the signal level of the orginial signal. According to SNR, we rescale the noise
+// and add it. So that the perturbed signal is created. 
+void ApplyAddAdditiveNoise(const int32 &SNR,


Change the name to ApplyAdditiveNoise

pegahgh · 2016-11-30T16:40:35Z

src/xvectorbin/nnet3-xvector-signal-perturb-egs.cc

+                                                  start_col_ind, input_cols));
+  // compute the energy of noise and input
+  Matrix<BaseFloat> input_energy_mat(input_rows, input_cols);
+  input_energy_mat.AddMatMatElements(1.0, input_eg, input_eg, 1.0);


Although input_energy_mat initialized with zero, it should be AddMatMatElements(1.0, input_eg, input_eg, 0.0),

It is not a good idea to design the code like this. You should write this function in signal-distort.h and add-noise and snr should be added as options to XvectorPerturbationOptions struct.
The function should be ApplyAdditiveNoise(const VectorBase input, const VectorBase noise, BaseFloat snr, Vector *noisy_input)

In class PerturbXvectorSignal, you have applyDistortion which is a general function, which applies all type of distortions to input.
Then it applies distortions w.r.t opts_.

You need to add a function PerturbExamples(const XvectorOptions opts, const Matrix &input_egs, Matrix *perturbed_egs)
and this function called in nnet3-xvector-signal-perturb.cc and it generates object from PerturbXvectorSignal and vectorize the input and calls ApplyDistortion to apply different type of distortions on input.

LvHang · 2016-12-04T04:54:10Z

@pegahgh
Hi Pegah,
According to your suggestions, I modify the files. Maybe it also has some unsatisfying points.
Could you give me some suggestion?
Thank you very much for your patient and guidance.
Hang

pegahgh · 2016-12-05T16:09:30Z

src/feat/signal-distort.cc

+void PerturbXvectorSignal::ApplyAdditiveNoise(const MatrixBase<BaseFloat> &input_eg,
+                                              const Matrix<BaseFloat> &noise_eg,
+                                              const int32 &SNR,
+                                              Matrix<BaseFloat> *perturb_eg) {


the name should be perturbed_eg

pegahgh · 2016-12-05T16:14:12Z

src/feat/signal-distort.cc

+// and add it. So that the perturbed signal is created. 
+void PerturbXvectorSignal::ApplyAdditiveNoise(const MatrixBase<BaseFloat> &input_eg,
+                                              const Matrix<BaseFloat> &noise_eg,
+                                              const int32 &SNR,


You don't need to define snr. SNR is defined in XvectorPerturbOptions and you can use opts_.snr.
Also you should not use uppercase in defining function variables.
The names of variables (including function parameters) and data members are all lowercase, with underscores between words.

pegahgh · 2016-12-05T16:20:22Z

src/feat/signal-distort.cc

+    const kaldi::nnet3::NnetIo &noise_eg_io = noise_eg.io[0];
+    Matrix<BaseFloat> noise_eg_mat;
+    noise_eg_io.features.CopyToMat(&noise_eg_mat);
+    int32 SNR = opts_.snr;


Add these lines nnet3-perturb-egs binary.
I told you that You have to just call PerturbExamples function in nnet3-perturb-egs.cc. You should put loop for reading egs in nnet3 binary not here!
PerturbExamples should be defined as a separate function not a function of this class. The point is that in PertubEgs function, you create object from class PerturbXvectorSignal and call ApplyDistortion.

pegahgh · 2016-12-05T16:20:45Z

src/feat/signal-distort.h

 #include "feat/resample.h"
 #include "matrix/matrix-functions.h"
 #include "cudamatrix/cu-matrix.h"
+#include "nnet3/nnet-example.h"


remove it, it is a wrong dependency!

pegahgh · 2016-12-05T16:21:31Z

src/feat/signal-distort.h

+
+  void ApplyAdditiveNoise(const MatrixBase<BaseFloat> &input_eg,
+                          const Matrix<BaseFloat> &noise_eg,
+                          const int32 &SNR,


pegahgh · 2016-12-05T16:25:08Z

src/xvectorbin/nnet3-xvector-signal-perturb-egs.cc

+                    const Matrix<BaseFloat> &input_egs,
+                    Matrix<BaseFloat> *perturb_egs) {
+  //new a PerturbXvectorSignal object and call ApplyDistortion
+  PerturbXvectorSignal perturb_xvector(opts);


Change perturb_egs to perturbed_egs

Change perturb_xvector to perturb_egs.

LvHang · 2016-12-06T05:27:05Z

@pegahgh
Hi Pegah,
Accorading to our discussion, I modify the files.
I check some other option structures. In general, they will be the one-dimensional data type, such as int, double, string and so on. So I didn't add a matrix to XvectorPerturbOptions. I add a private point in class PerturbXvectorSignal. Maybe we can discuss and find a better solution.
Please check the binaries. Thank you very much for your guidance.
Hang

pegahgh · 2016-12-06T18:08:27Z

src/feat/signal-distort.cc

+void PerturbXvectorSignal::ApplyDistortion(const MatrixBase<BaseFloat> &input_egs,
+                                           Matrix<BaseFloat> *perturb_egs) {
+    // conduct ApplyAdditiveNoise
+  if (!opts_.add_noise_rspecifier.empty()) {


change option name to add_noise

I think the best strategy is to have a add-noise option in PertubXvectorOption as noise rspecifier not noise examples.
--add-noise=noise.scp, where noise.scp corresponds to features for different noises. You can randomly select different noises.
Then you no longer need to pass noise matrix to PerturbExample and you can easily pass noise rspecifier using --add-noise option.
You don't need to change ApplyAdditiveNoise class. You just need to check if add-noise is not empty in ApplyDistortion and the read matrix of noise using BaseFloatMatrixReader and pass it to ApplyAdditiveNoise.

LvHang · 2016-12-06T22:09:08Z

@pegahgh
Hi Pegah,
I modify it. Now, we pass a filename, such as noise.scp, to add-noise option. And we choose a noise matrix in ApplyDistortion function. I hope I understand your intention in a right way.
Please check it. Thanks a lot for your patience.
Hang

pegahgh · 2016-12-07T19:12:14Z

src/feat/signal-distort.cc

+  if (!opts_.add_noise.empty()) {
+    // choose a noise from the noise.scp/ark
+    // 1) we need to record the keys of noise_egs
+    std::vector<std::string> list_noise_egs;


It is no longer noise_egs, the name should be noise_list!

pegahgh · 2016-12-07T19:14:31Z

src/feat/signal-distort.cc

+    noise_seq_reader.Close();
+
+    // 2) we random choose an noise example
+    int32 num_noise_egs = list_noise_egs.size();


num_noises is better name for num_noise_egs!

pegahgh · 2016-12-07T19:16:56Z

src/feat/signal-distort.cc

+    ApplyAdditiveNoise(input_egs, *noise_egs_, perturb_egs);
+    // conduct others
+    // TODO
+  } else { // deal with the opts_.noise_egs situation


no need for else condition!
We can compose several different perturbation to generated perturbed_egs.

pegahgh · 2016-12-07T19:18:21Z

src/feat/signal-distort.cc

+  }
+}
+
+// This function is a entrance. It calls ApplyDistortion to apply different


Change the comment!

pegahgh · 2016-12-07T19:22:08Z

src/feat/signal-distort.h

    opts->Register("noise-egs", &noise_egs, "If supplied, the additive noise is added to input signal.");
    opts->Register("rand_distort", &rand_distort, "If true, the signal is slightly changes"
                   "using some designed FIR filter with no zeros.");
+    opts->Register("add-noise", &add_noise, "specify a file contains some noise egs");


change the definition! e.g. Noise rspecifier for additive noises, if nonempty, the additive noise randomly selected and added to input egs.

pegahgh · 2016-12-07T19:26:32Z

src/feat/signal-distort.h

  }
 };

 class PerturbXvectorSignal {


Add Comment about PerturbXvectorSignal class

pegahgh · 2016-12-07T19:27:27Z

src/feat/signal-distort.h

 public:
  PerturbXvectorSignal(XvectorPerturbOptions opts): opts_(opts) { };
-
+  inline void SetNoiseEgs(const Matrix<BaseFloat> &noise_egs) {


remove this. You don't need noise_egs_ as private member of class.

pegahgh · 2016-12-07T19:28:03Z

src/feat/signal-distort.h

  XvectorPerturbOptions opts_;
+  // if we want use many examples in once ApplyDistortion, we can expand the point
+  // to a point vector.
+  const Matrix<BaseFloat> *noise_egs_;


remove noise_egs_!

pegahgh · 2016-12-07T19:29:46Z

src/feat/signal-distort.cc

+    std::string key_noise_eg = list_noise_egs[index_noise_eg];
+    RandomAccessBaseFloatMatrixReader noise_random_reader(opts_.add_noise);
+    Matrix<BaseFloat> noise_eg_mat = noise_random_reader.Value(key_noise_eg);
+    SetNoiseEgs(noise_eg_mat);


remove this line, and also change noise_egs_mat to noise_mat and key_noise_eg to noise_name!

pegahgh · 2016-12-07T19:30:23Z

src/feat/signal-distort.cc

+    Matrix<BaseFloat> noise_eg_mat = noise_random_reader.Value(key_noise_eg);
+    SetNoiseEgs(noise_eg_mat);
+
+    ApplyAdditiveNoise(input_egs, *noise_egs_, perturb_egs);


You can directly use noise_mat, why do you use noise_egs_?

…hange

pegahgh · 2016-12-29T17:44:30Z