A nn model detacting an audio is Taiwanese or Chinese.
- ffmpeg
- youtube-dl
Requirement: Mno .wavfile with sample_rate = 8192.
ps: preprocess/collect.py, preprocess/convert.py and preprocess/convert.sh may be helpful.
Using audio in wav/ building dataset in feature/.
$ ./preprocess/build_feature
Set your data size in train.py.
Trining model:
$ ./train.py
Tensorboard:
tensorboard --logdir=logs/
$ ./eval.py