Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
241 commits
Select commit Hold shift + click to select a range
e8d1287
[src] Fix 'sausage-time' issue which occurs with disabled MBR decodin…
KarelVesely84 Jan 18, 2019
99dc4d8
[egs] Add scripts for yomdle Russian (OCR task) (#2953)
aarora8 Jan 21, 2019
7e529ed
[egs] Simplify lexicon preparation in Fisher callhome Spanish (#2999)
GoVivace Jan 21, 2019
25f09e8
[egs] Update GALE Arabic recipe (#2934)
aarora8 Jan 22, 2019
4338004
[egs] Remove outdated NN results from Gale Arabic recipe (#3002)
aarora8 Jan 22, 2019
05d9a3d
[egs] Add RESULTS file for the tedlium s5_r3 (release 3) setup (#3003)
huangruizhe Jan 23, 2019
1dcdf80
[src] Fixes to grammar-fst code to handle LM-disambig symbols properl…
danpovey Jan 26, 2019
6f56512
[src] Cosmetic change to mel computation (fix option string) (#3011)
boeddeker Jan 30, 2019
56cfb95
[src] Fix Visual Studio error due to alternate syntactic form of nore…
daanzu Feb 1, 2019
9e35898
[egs] Fix location of sequitur installation (#3017)
jybaek Feb 1, 2019
a51bd96
[src] Fix w/ ifdef Visual Studio error from alternate syntactic form …
daanzu Feb 3, 2019
41ea8cf
[egs] Some fixes to getting data in heroico recipe (#3021)
danpovey Feb 3, 2019
fb514dc
[egs] BABEL script fix: avoid make_L_align.sh generating invalid file…
jtrmal Feb 4, 2019
afc5e78
[src] Fix to older online decoding code in online/ (OnlineFeInput; wa…
jdieguez Feb 6, 2019
226cbf7
[script] Fix unset bash variable in make_mfcc.sh (#3030)
oplatek Feb 8, 2019
6fc4c60
[scripts] Extend limit_num_gpus.sh to support --num-gpus 0. (#3027)
oplatek Feb 8, 2019
2f92bd9
[scripts] fix bug in utils/add_lex_disambig.pl when sil-probs and pro…
Teddyang Feb 15, 2019
403c5ee
[egs] Fix path in Tedlium r3 rnnlm training script (#3039)
francoishernandez Feb 18, 2019
cbc8eeb
Spanish Gigaword LM recipe
Feb 19, 2019
e8aecbb
Some bug fixes
saikiranvalluri Feb 19, 2019
ece34bd
Update rnnlm.sh
saikiranvalluri Feb 19, 2019
0c4fe47
Combining lexicon words with pocolm wordslist for RNNLM training
Feb 19, 2019
92e241b
merge conflict resolved
Feb 19, 2019
abfbc56
[src] Thread-safety for GrammarFst (thx:armando.muscariello@gmail.com…
danpovey Feb 20, 2019
f09d48a
[scripts] Cosmetic fix to get_degs.sh (#3045)
Teddyang Feb 21, 2019
b0fc09d
[egs] Small bug fixes for IAM and UW3 recipes (#3048)
ChunChiehChang Feb 21, 2019
1439b0d
Integrated the 2 stage scientific method POCOLM training for Gigaword…
saikiranvalluri Feb 24, 2019
8ad0e01
Update train_pocolm.sh
saikiranvalluri Feb 26, 2019
4494a85
[scripts] Nnet3 segmentation: fix default params (#3051)
danpovey Feb 26, 2019
bf33f1f
[scripts] Allow perturb_data_dir_speed.sh to work with utt2lang (#3055)
igrinis Feb 26, 2019
5f05d59
[scripts] Make beam in monophone training configurable (#3057)
xiaohui-zhang Feb 27, 2019
f856ac2
Update run.sh
saikiranvalluri Feb 27, 2019
c0a555e
[scripts] Allow reverberate_data_dir.py to support unicode filenames …
rezame Feb 27, 2019
684f029
Text cleaning script for splitting Abbreviation words added
saikiranvalluri Feb 28, 2019
185da3a
Update clean_txt_dir.sh
saikiranvalluri Feb 28, 2019
cb393c8
Update clean_txt_dir.sh
saikiranvalluri Feb 28, 2019
18a9cb6
Update train_pocolm.sh
saikiranvalluri Feb 28, 2019
b023638
Update pocolm_cust.sh
saikiranvalluri Feb 28, 2019
46550f0
Cosmetic fixes
saikiranvalluri Feb 28, 2019
ce3c7d7
Update path.sh
saikiranvalluri Feb 28, 2019
deeaaa7
Bug fix in text normalisation script for gigaword corpus
saikiranvalluri Mar 1, 2019
633f21d
small Fix path.sh
saikiranvalluri Mar 1, 2019
8d6b14d
Update clean_abbrevs_text.py
saikiranvalluri Mar 1, 2019
8c9c37b
Added sparrowhawk installation script for text normalisation
saikiranvalluri Mar 1, 2019
2e26464
[scripts] Make some cleanup scripts work with python3 (#3054)
vimalmanohar Mar 1, 2019
c6b05d1
G2P training stage added into Spanish gigaword recipe
saikiranvalluri Mar 2, 2019
8c226cc
G2P seq2seq scripts added in steps/
saikiranvalluri Mar 2, 2019
7b67fc2
RNNLM scripts updated to UTF8 encoding
saikiranvalluri Mar 2, 2019
d21be2d
[scripts] bug fix to nnet2->3 conversion, fixes #886 (#3071)
jfainberg Mar 4, 2019
8fa9648
[src] Make copies occur in per-thread default stream (for GPUs) (#3068)
luitjens Mar 4, 2019
bd326dc
[src] Add GPU version of MergeTaskOutput().. relates to batch decodin…
luitjens Mar 4, 2019
17b7f3f
[src] Add device options to enable tensor core math mode. (#3066)
luitjens Mar 4, 2019
0a1f827
[src] Log nnet3 computation to VLOG, not std::cout (#3072)
kkm000 Mar 5, 2019
f2a89c2
[src] Allow upsampling in compute-mfcc-feats, etc. (#3014)
danpovey Mar 5, 2019
98b45c8
[src] fix problem with rand_r being undefined on Android (#3037)
keli78 Mar 5, 2019
197214d
[egs] Update swbd1_map_words.pl, fix them_1's -> them's (#3052)
Mar 5, 2019
991a75c
[src] Add const overload OnlineNnet2FeaturePipeline::IvectorFeature (…
kkm000 Mar 6, 2019
4432371
[src] Fix syntax error in egs/bn_music_speech/v1/local/make_musan.py …
antonstakhouski Mar 6, 2019
8460fa3
[src] Memory optimization for online feature extraction of long recor…
pzelasko Mar 6, 2019
b801b98
[build] fixed a bug in linux_configure_redhat_fat when use_cuda=no (#…
kan-bayashi Mar 7, 2019
ce97c47
[scripts] Add missing '. ./path.sh' to get_utt2num_frames.sh (#3076)
hhadian Mar 7, 2019
4d61452
[src,scripts,egs] Add count-based biphone tree tying for flat-start c…
hhadian Mar 7, 2019
4767c7c
Update pocolm_cust.sh
saikiranvalluri Mar 8, 2019
2cd5948
Update run.sh
saikiranvalluri Mar 8, 2019
01cef69
[scripts,egs] Remove sed from various scripts (avoid compatibility pr…
desh2608 Mar 8, 2019
2f95609
[src] Rework error logging for safety and cleanliness (#3064)
kkm000 Mar 8, 2019
bcfe3f8
[src] Change warp-synchronous to cub::BlockReduce (safer but slower) …
desh2608 Mar 10, 2019
1209c07
[src] Fix && and || uses where & and | intended, and other weird erro…
kkm000 Mar 11, 2019
5a5696f
[build] Some fixes to Makefiles (#3088)
kkm000 Mar 11, 2019
abd4869
[src] Fixed -Wreordered warnings in feat (#3090)
pzelasko Mar 12, 2019
9c8ba0f
[egs] Replace bc with perl -e (#3093)
entn-at Mar 12, 2019
8cbd582
[scripts] Fix python3 compatibility issue in data-perturbing script (…
nikhilm16 Mar 12, 2019
7435661
[doc] fix some typos in doc. (#3097)
csukuangfj Mar 12, 2019
5bdea69
[build] Make sure expf() speed probe times sensibly (#3089)
kkm000 Mar 12, 2019
b7a4fec
[scripts] Make sure merge_targets.py works in python3 (#3094)
XIAOYixuan Mar 12, 2019
94475d6
[src] ifdef to fix compilation failure on CUDA 8 and earlier (#3103)
desh2608 Mar 13, 2019
fc8c17b
[doc] fix typos and broken links in doc. (#3102)
csukuangfj Mar 13, 2019
3f8b6b2
[scripts] Fix frame_shift bug in egs/swbd/s5c/local/score_sclite_conf…
freewym Mar 13, 2019
633e61c
[src] Fix wrong assertion failure in nnet3-am-compute (#3106)
MartinKocour Mar 14, 2019
8cafd32
[src] Cosmetic changes to natural-gradient code (#3108)
danpovey Mar 14, 2019
b1b230c
[src,scripts] Python2 compatibility fixes and code cleanup for nnet1 …
KarelVesely84 Mar 14, 2019
9c875ef
[doc] Small documentation fixes; update on Kaldi history (#3031)
KarelVesely84 Mar 14, 2019
7a1908f
[src] Various mostly-cosmetic changes (copying from another branch) (…
danpovey Mar 15, 2019
fcd70d3
[scripts] Simplify text encoding in RNNLM scripts (now only support …
saikiranvalluri Mar 16, 2019
b4c7ab6
[egs] Add "formosa_speech" recipe (Taiwanese Mandarin ASR) (#2474)
yfliao Mar 16, 2019
461b50c
[egs] python3 compatibility in csj example script (#3123)
rickychanhoyin Mar 16, 2019
61637e6
[egs] python3 compatibility in example scripts (#3126)
danpovey Mar 17, 2019
1f068cd
[scripts] Bug-fix for removing deleted words (#3116)
psmit Mar 17, 2019
8d60ee3
[scripts] Add fix regarding num-jobs for segment_long_utterances*.sh(…
vimalmanohar Mar 17, 2019
6595b42
Added steps for generating POCOLM ARPA file
saikiranvalluri Mar 18, 2019
7fb716a
[src] Enable allow_{upsample,downsample} with online features (#3139)
jtrmal Mar 18, 2019
80c1437
[src] Fix bad assert in fstmakecontextsyms (#3142)
Mar 19, 2019
0d6ead5
[src] Fix to "Fixes to grammar-fst & LM-disambig symbols" (#3000) (#3…
daanzu Mar 19, 2019
338b586
[build] Make sure PaUtils exported from portaudio (#3144)
jtrmal Mar 19, 2019
73720e6
[src] cudamatrix: fixing a synchronization bug in 'normalize-per-row'…
KarelVesely84 Mar 20, 2019
f9276a5
[src] Fix typo in comment (#3147)
csukuangfj Mar 20, 2019
252690f
[src] Add binary that functions as a TCP server (#2938)
danijel3 Mar 20, 2019
6134c29
[scripts] Fix bug in comment (#3152)
Shujian2015 Mar 21, 2019
aead118
[scripts] Fix bug in steps/segmentation/ali_to_targets.sh (#3155)
saikiranvalluri Mar 21, 2019
213ae52
[scripts] Avoid holding out more data than the requested num-utts (du…
kkm000 Mar 21, 2019
1ac8c92
[src,scripts] Add support for two-pass agglomerative clustering. (#3058)
dogancan Mar 24, 2019
0902c9e
Update run.sh
saikiranvalluri Mar 24, 2019
d8a90ec
Merge branch 'master' into feature/Spanish_gigaword_LM
saikiranvalluri Mar 24, 2019
c10b0fe
Apply g2p part added to get extended lexicon
saikiranvalluri Mar 24, 2019
15a34e8
Merge branch 'feature/Spanish_gigaword_LM' of https://github.com/GoVi…
saikiranvalluri Mar 24, 2019
3df45ae
Small fix in run.sh rnnlm_wordlist
saikiranvalluri Mar 24, 2019
6bd9dad
[src] Disable unget warning in PeekToken (and other small fix) (#3163)
kkm000 Mar 24, 2019
37f4f44
[build] Add new nvidia tools to windows build (#3159)
btiplitz Mar 24, 2019
77ac79f
[doc] Fix documentation errors and add more docs for tcp-server decod…
danijel3 Mar 24, 2019
7e47695
Added sanity chack for Sparrowhawk normalizer in cleanup script
saikiranvalluri Mar 25, 2019
91a4611
Data preparation fixes
saikiranvalluri Mar 25, 2019
5f45dd1
Cosmetic options for gigaword textclean
saikiranvalluri Mar 26, 2019
27034a2
[scripts] Fix non-randomness in getting utt2uniq, introduced in #3142…
desh2608 Mar 27, 2019
f9828e9
[build] Don't build for Tegra sm_XX versions on x86/ppc and vice vers…
luitjens Mar 27, 2019
419e35c
[egs] Fixes Re encoding to IAM, uw3 recipes (#3012)
aarora8 Mar 29, 2019
2ebe976
[src] Efficiency improvement and extra checking for cudamarix, RE def…
luitjens Mar 30, 2019
abf7a8c
[egs] Fix small typo in tedlium download script (#3178)
Shujian2015 Mar 30, 2019
7691d00
[github] Add GitHub issue templates (#3187)
Mar 31, 2019
9ef700f
[build] Add missing dependency to Makefile (#3191)
danpovey Mar 31, 2019
5845334
[src] Fix bug in pruned lattice rescoring when input lattice has epsi…
hainan-xv Apr 1, 2019
e711d30
Some fixes in rnnlm training
saikiranvalluri Apr 1, 2019
8d521c6
Moved s5_gigaword directory to s5
saikiranvalluri Apr 1, 2019
be019cd
[scripts] Fix bug in extend_lang.sh regarding extra_disambig.txt (#3195)
armusc Apr 2, 2019
c57ed95
Merge branch 'master' into feature/Spanish_gigaword_LM
saikiranvalluri Apr 2, 2019
f610470
removed s5_gigaword folder
saikiranvalluri Apr 2, 2019
f810119
Small cleanup for scripts format
saikiranvalluri Apr 2, 2019
ffbe16b
[egs] Update Tedlium s5_r3 example with more up-to-date chain TDNN co…
jyhnnhyj Apr 3, 2019
b180707
[scripts] Fix bug in extend_lang.sh causing validation failure w/ ext…
lucas-jo Apr 3, 2019
7093dfa
[scripts] Bug-fix in make_lexicon_fst.py, which failed when --sil-pro…
armusc Apr 4, 2019
6f0a3a2
[egs] Fix very small typo in run_tdnn_1b.sh (#3207)
Shujian2015 Apr 4, 2019
ddeac98
[build] Tensorflow version update (#3204)
langep Apr 4, 2019
dc8a56e
Cosmetic fix
saikiranvalluri Apr 5, 2019
beb0151
[src] Optimizations to CUDA kernels (#3209)
kangshiyin Apr 6, 2019
a3a190b
[src] Move curand handle out of CuRand class and into CuDevice. (#3196)
luitjens Apr 7, 2019
faa7ff8
[build] Make MKL the default BLAS library, add installation scripts (…
Apr 7, 2019
76bdf20
[build] check for i686 as a valid prefix for Android triplets (#3213)
Dr-Desty-Nova Apr 7, 2019
4ae4bb0
[build] Fix configure breakage from #3194 (MKL default)
Apr 9, 2019
b96cab7
[build] Add missing line continuation '\' in tfrnnlmbin/Makefile (#3218)
teinhonglo Apr 10, 2019
9b730e0
[src] Fix nnet2 DctComponent test failure (#3225)
huangruizhe Apr 12, 2019
4cfbd21
[src] Update CUDA code to avoid synchronization errors on compute ca…
kangshiyin Apr 12, 2019
df41d4c
[src] fix nnet2 DCTCompnent test failure -- removing anther dct_keep_…
huangruizhe Apr 12, 2019
ec0edc5
Merge branch 'master' into feature/Spanish_gigaword_LM
saikiranvalluri Apr 12, 2019
ebfa3cb
[build] Remove references to deprecated MKL libs in gst_plugin (#3229)
Apr 14, 2019
4e8164c
[scripts] Fix default params in nnet3 segmentation script (#3230)
rezame Apr 14, 2019
0bfc307
[src] Correct sanity check in nnet-example-utils.cc (nnet3) (#3232)
KarelVesely84 Apr 16, 2019
f8021d7
Revert "[src] Update CUDA code to avoid synchronization errors on co…
danpovey Apr 16, 2019
06a21b1
[build] .gitignore autogenerated /tools/python/ (#3241)
mcalahan Apr 17, 2019
a2d0270
[scripts] Enhance argument checks in nnet3/align_lats.sh (#3243)
Apr 18, 2019
299b111
[egs] invoke 'python2.7' not 'python' when using mmseg (#3244)
naxingyu Apr 18, 2019
8b8222e
Remove virtenv dependency
saikiranvalluri Apr 18, 2019
4ff77c5
[scripts] Make getting nnet3 model context more robust (#3247)
KarelVesely84 Apr 18, 2019
b3a6e17
[egs] Fix hkust_data_prep.sh w.r.t. iconv mac compatibility issue (#3…
zh794390558 Apr 19, 2019
84ecd0e
[egs] Update RM chain recipe with more recent configuration (#3237)
indra622 Apr 19, 2019
0e7afa8
Update path.sh
saikiranvalluri Apr 19, 2019
56d2db9
Update install_sparrowhawk.sh
saikiranvalluri Apr 19, 2019
c3260f2
[egs] Make voxceleb recipe work with latest version of the dataset (…
sunshines14 Apr 19, 2019
fb6693e
Set lang to ESP
saikiranvalluri Apr 20, 2019
f107cdb
[egs] Improve chain example script for Resource Management (RM) (#3252)
indra622 Apr 21, 2019
2c25629
[src] GPU-related changes for speed and correctness on newer arch's. …
luitjens Apr 22, 2019
5a34a0a
[egs] Update voxceleb v1 preparation scripts (#3255)
jyhnnhyj Apr 23, 2019
ce0f420
Set pocolm option - --limit-unk-history=true
saikiranvalluri Apr 23, 2019
9487ce1
Removed unused code
saikiranvalluri Apr 23, 2019
96e7b0a
[build] Note default=MKL; cosmetic fix (#3257)
nshmyrev Apr 23, 2019
d47e36c
[egs] Fix to hkust_data_prep.sh w.r.t. how mmseg is checked for (#3240)
zh794390558 Apr 23, 2019
286e8af
[egs] In WSJ run_ivector_common.sh, expose i-vector #jobs config to r…
KarelVesely84 Apr 23, 2019
25609c5
Fix in checking for empty space lines in lexicon
saikiranvalluri Apr 23, 2019
e3a9844
[egs] Add Spanish dimex100 example (#3254)
alx741 Apr 23, 2019
0cc941f
[build] Build and configure OpenBLAS; default to it on non-x64 machin…
Apr 25, 2019
9e9ae13
[scripts] Fix of a bug in segmentation.pl (#3256)
songyf Apr 25, 2019
510db0f
Fix in RNNLM rescoring decode stage
saikiranvalluri Apr 25, 2019
f8cb5cc
[src] Fixes to cuda unit tests. (#3268)
luitjens Apr 25, 2019
b8a35fd
[src] Adding GPU/CUDA lattice batched decoder + binary (#3114)
hugovbraun Apr 26, 2019
9894f4c
Update run.sh
saikiranvalluri Apr 26, 2019
da4e2b8
[src] Fix unit-test failure UnitTestCuMatrixSetRandn() (#3274)
DongjiGao Apr 27, 2019
203ce37
[src,build] Removed cusolver for now (not needed yet; caused build p…
huangruizhe Apr 27, 2019
1da8ebd
[scripts] Make fix_data_dir.sh remove utterances which have bad durat…
hhadian Apr 30, 2019
939faf8
[scripts] Make generate_plots.py python3-compatible (#3280)
May 1, 2019
212474e
[scripts] Add --one-based option to split_scp.pl (#3279)
xsawyerx May 1, 2019
a045314
[scripts] Allow UTF utterance-ids by removing unnecessary assert (#3283)
rezame May 1, 2019
b1569db
[src] Keep nnet output in the [-30,30] range required by chain denomi…
danpovey May 2, 2019
b17fc84
[scripts] Clean up filehandle usage in split_scp.pl (#3285)
xsawyerx May 2, 2019
230992f
[src] Fix to bug in online-feature.cc that caused crash at end of utt…
danpovey May 2, 2019
f7117db
[scripts] Use correct compile-time regex syntax in split_scp.pl (#3287)
xsawyerx May 2, 2019
df1ebbc
[scripts] Fix a typo in steps/dict/learn_lexicon_bayesian.sh (#3288)
xiaohui-zhang May 2, 2019
155c658
[egs,scripts] Scripts and an example of BPE-based sub-word decoding (…
DongjiGao May 5, 2019
f2670c3
[scripts] Add trainer option --trainer.optimization.num-jobs-step (#3…
May 7, 2019
9702cbc
[egs] Add MGB-5 recipe; https://arabicspeech.org/mgb5 (#3299)
May 8, 2019
5ae3c19
Revert "[scripts] Clean up filehandle usage in split_scp.pl (#3285)" …
danpovey May 9, 2019
20fb648
[src] Fix bug in GeneralMatrix::Uncompress() (#3304)
bringtree May 9, 2019
a5695e9
[src] nnet1: lstm training, introducing cursors when slicing the trai…
KarelVesely84 May 9, 2019
9424f7a
[doc] add an omission in Doxyfile (#3309)
May 10, 2019
ba165c8
[scripts] Fix utils/split_scp.pl breakage (#3308)
May 10, 2019
4d7fe3b
[egs] Bug-fix to shebang in fisher_callhome_spanish (#3312)
saikiranvalluri May 11, 2019
19c88ac
[scripts] Fix error messages in run.pl (#3314)
May 11, 2019
e922333
[egs] New chime-5 recipe (#2893)
vimalmanohar May 12, 2019
a861e56
[scripts,egs] Made changes to the augmentation script to make it work…
phanisankar-nidadavolu May 13, 2019
cec8958
[egs] updated local/musan.sh to steps/data/make_musan.sh in speaker i…
phanisankar-nidadavolu May 13, 2019
d40222e
[src] Fix sample rounding errors in extract-segments (#3321)
May 14, 2019
35f96db
[src,scripts]Store frame_shift, utt2{dur,num_frames}, .conf with feat…
May 14, 2019
a2e7ba3
[build] Initial version of Docker images for (CPU and GPU versions) (…
mdoulaty May 15, 2019
91609c7
[scripts] fix typo/bug in make_musan.py (#3327)
May 15, 2019
95e81c0
[scripts] Fixed misnamed variable in data/make_musan.py (#3324)
phanisankar-nidadavolu May 15, 2019
c5aa3a9
[scripts] Trust frame_shift and utt2num_frames if found (#3313)
May 16, 2019
0ff318b
[scripts] typo fix in augmentation script (#3329)
May 16, 2019
62ebb44
[scripts] handle frame_shit and utt2num_frames in utils/ (#3323)
May 16, 2019
c8b93bc
[scripts] Extend combine_ali_dirs.sh to combine alignment lattices (#…
May 17, 2019
528e072
[src] Fix rare case when segment end rounding overshoots file end in …
alumae May 17, 2019
3bdb541
Update clean_txt_dir.sh
saikiranvalluri May 20, 2019
8397e05
[scripts] Change --modify-spk-id default to False; back-compatibility…
phanisankar-nidadavolu May 20, 2019
8b54ef8
[build] Add easier configure option in failure message of configure (…
danpovey May 20, 2019
ce8798b
[scripts,minor] Fix typo in comment (#3338)
Shujian2015 May 22, 2019
9e0a7f6
[src,egs] Add option for applying SVD on trained models (#3272)
saikiranvalluri May 23, 2019
0e5e07b
[src] Add interfaces to nnet-batch-compute that expects device input.…
luitjens May 23, 2019
52e7ecf
[build] Update GCC support check for CUDA toolkit 10.1 (#3345)
entn-at May 27, 2019
29f3c14
[egs] Fix to aishell1 v1 download script (#3344)
naxingyu May 27, 2019
a5dd6bd
[scripts] Support utf-8 files in some scripts (#3346)
vimalmanohar May 28, 2019
8c6cd31
[src] Fix potential underflow bug in MFCC, RE energy floor, thx: Zolt…
huangruizhe May 28, 2019
e643c73
[scripts]: add warning to nnet3/chain/train.py about ineffective opti…
bringtree May 28, 2019
8706f06
[scripts] Fix regarding UTF handling in cleanup script (#3352)
vimalmanohar May 29, 2019
800924d
[scripts] Change encoding to utf-8 in data augmentation scripts (#3360)
hhadian Jun 1, 2019
eedd9fa
[src] Add CUDA accelerated MFCC computation. (#3348)
luitjens Jun 3, 2019
0b443bd
[src] Optimizations for batch nnet3. The issue fixed here is that (#…
luitjens Jun 3, 2019
16097b4
[scripts,minor] Remove outdated comment (#3361)
Shujian2015 Jun 3, 2019
ced53e1
[egs] A kaldi recipe based on the corpus named "aidatatang_200zh". (#…
DatatangAI Jun 4, 2019
f8a4376
[src] nnet1: changing end-rule in 'nnet-train-multistream', (#3358)
KarelVesely84 Jun 4, 2019
9c734a5
[scripts] Fix how the empty (faulty?) segments are handled in data-cl…
jtrmal Jun 4, 2019
b276d70
[src] Fix to bug in ivector extraction causing assert failure, thx: s…
danpovey Jun 4, 2019
de4a3e3
[src] Fix to bug in ivector extraction causing assert failure, thx: s…
danpovey Jun 4, 2019
1a4aa52
[scripts] add script to compute dev PPL on kaldi-rnnlm (#3340)
hainan-xv Jun 4, 2019
1735003
[scripts,egs] Small fixes to diarization scripts (#3366)
HuangZiliAndy Jun 4, 2019
338cc58
[egs] Modify split_scp.pl usage to match its updated code (#3371)
danpovey Jun 5, 2019
254d636
[src] Fix non-cuda `make depend` build by putting compile guards arou…
luitjens Jun 6, 2019
3648df5
[build] Docker docs update and minor changes to the Docker files (#3…
mdoulaty Jun 6, 2019
0071003
[egs] Scripts for MATERIAL ASR (#2165)
mahsa7823 Jun 6, 2019
acff3f6
[src] Batch nnet3 optimizations. Batch some of the copies in and cop…
luitjens Jun 6, 2019
23ba982
[build] Widen cuda guard in cudafeat makefile. (#3379)
langep Jun 7, 2019
6636557
Update run.sh
saikiranvalluri Jun 9, 2019
69b1bca
Merge branch 'master' into feature/Spanish_gigaword_LM
saikiranvalluri Jun 9, 2019
36499a7
Update run.sh
saikiranvalluri Jul 7, 2019
8da5c3e
Reverse the order of Abbreviation process after punct syms
saikiranvalluri Jul 13, 2019
510b415
Update run_norm.sh
saikiranvalluri Aug 21, 2019
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
18 changes: 18 additions & 0 deletions .github/ISSUE_TEMPLATE/bug_report.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
---
name: Bug report
about: Create a report to help us improve
title: ''
labels: bug
assignees: ''

---

<!--
WARNING: THE KALDI ISSUE TRACKER IS **ONLY** USED FOR KALDI DEVELOPMENT!

If you have a question about using Kaldi, please use the kald-help discussion group:

https://groups.google.com/forum/#!forum/kaldi-help

Instructions for joining are available at: http://kaldi-asr.org/forums.html
-->
18 changes: 18 additions & 0 deletions .github/ISSUE_TEMPLATE/feature-proposal-discussion.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
---
name: Feature proposal or discussion
about: Suggest an idea for Kaldi
title: ''
labels: discussion
assignees: ''

---

<!--
WARNING: THE KALDI ISSUE TRACKER IS **ONLY** USED FOR KALDI DEVELOPMENT!

If you have a question about using Kaldi, please use the kald-help discussion group:

https://groups.google.com/forum/#!forum/kaldi-help

Instructions for joining are available at: http://kaldi-asr.org/forums.html
-->
2 changes: 2 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -83,6 +83,7 @@ GSYMS
/tools/ATLAS/
/tools/atlas3.8.3.tar.gz
/tools/irstlm/
/tools/mitlm/
/tools/openfst
/tools/openfst-1.3.2.tar.gz
/tools/openfst-1.3.2/
Expand Down Expand Up @@ -147,3 +148,4 @@ GSYMS
/tools/cub-1.8.0.zip
/tools/cub-1.8.0/
/tools/cub
/tools/python/
30 changes: 30 additions & 0 deletions docker/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,30 @@
# Kaldi Docker images

Kaldi offers two set of images: CPU-based images and GPU-based images. Daily builds of the latest version of the master branch (both CPU and GPU images) are pushed daily to [DockerHub](https://hub.docker.com/r/kaldiasr/kaldi).

## Using pre-built images
Sample usage of the CPU based images:
```bash
docker run -it kaldiasr/kaldi:latest bash
```

Sample usage of the GPU based images:

Note: use [nvidia-docker](https://github.com/NVIDIA/nvidia-docker) to run the GPU images.

```bash
docker run -it --runtime=nvidia kaldiasr/kaldi:gpu-latest bash
```

## Building images locally
For building the CPU-based image:
```bash
cd docker/debian9.8-cpu
docker build --tag kaldiasr/kaldi:latest .
```

and for GPU-based image:
```bash
cd docker/ubuntu16.04-gpu
docker build --tag kaldiasr/kaldi:gpu-latest .
```
40 changes: 40 additions & 0 deletions docker/debian9.8-cpu/Dockerfile
Original file line number Diff line number Diff line change
@@ -0,0 +1,40 @@

FROM debian:9.8
LABEL maintainer="mdoulaty@gmail.com"

RUN apt-get update && \
apt-get install -y --no-install-recommends \
g++ \
make \
automake \
autoconf \
bzip2 \
unzip \
wget \
sox \
libtool \
git \
subversion \
python2.7 \
python3 \
zlib1g-dev \
ca-certificates \
patch \
ffmpeg \
vim && \
rm -rf /var/lib/apt/lists/*

RUN ln -s /usr/bin/python2.7 /usr/bin/python

RUN git clone --depth 1 https://github.com/kaldi-asr/kaldi.git /opt/kaldi && \
cd /opt/kaldi && \
cd /opt/kaldi/tools && \
./extras/install_mkl.sh && \
make -j $(nproc) && \
cd /opt/kaldi/src && \
./configure --shared && \
make depend -j $(nproc) && \
make -j $(nproc)

WORKDIR /opt/kaldi/

40 changes: 40 additions & 0 deletions docker/ubuntu16.04-gpu/Dockerfile
Original file line number Diff line number Diff line change
@@ -0,0 +1,40 @@

FROM nvidia/cuda:10.0-cudnn7-devel-ubuntu16.04
LABEL maintainer="mdoulaty@gmail.com"

RUN apt-get update && \
apt-get install -y --no-install-recommends \
g++ \
make \
automake \
autoconf \
bzip2 \
unzip \
wget \
sox \
libtool \
git \
subversion \
python2.7 \
python3 \
zlib1g-dev \
ca-certificates \
patch \
ffmpeg \
vim && \
rm -rf /var/lib/apt/lists/*

RUN ln -s /usr/bin/python2.7 /usr/bin/python

RUN git clone --depth 1 https://github.com/kaldi-asr/kaldi.git /opt/kaldi && \
cd /opt/kaldi && \
cd /opt/kaldi/tools && \
./extras/install_mkl.sh && \
make -j $(nproc) && \
cd /opt/kaldi/src && \
./configure --shared --use-cuda && \
make depend -j $(nproc) && \
make -j $(nproc)

WORKDIR /opt/kaldi/

21 changes: 21 additions & 0 deletions egs/aidatatang_200zh/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,21 @@
Aidatatang_200zh is a free Chinese Mandarin speech corpus provided by Beijing DataTang Technology Co., Ltd under Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International Public License.

**About the aidatatang_200zh corpus:**

- The corpus contains 200 hours of acoustic data, which is mostly mobile recorded data.
- 600 speakers from different accent areas in China are invited to participate in the recording.
- The transcription accuracy for each sentence is larger than 98%.
- Recordings are conducted in a quiet indoor environment.
- The database is divided into training set, validation set, and testing set in a ratio of 7: 1: 2.
- Detail information such as speech data coding and speaker information is preserved in the metadata file.
- Segmented transcripts are also provided.

You can get the corpus from [here](https://www.datatang.com/webfront/opensource.html).

DataTang is a community of creators-of world-changers and future-builders. We're invested in collaborating with a diverse set of voices in the AI world, and are excited about working on large-scale projects. Beyond speech, we're providing multiple resources in image, and text. For more details, please visit [datatang](<https://www.datatang.com/>).

**About the recipe:**

To demonstrate that this corpus is a reasonable data resource for Chinese Mandarin speech recognition research, a baseline recipe is provided here for everyone to explore their own systems easily and quickly.

In this directory, each subdirectory contains the scripts for a sequence of experiments. The recipe in subdirectory "s5" is based on the hkust s5 recipe and aishell s5 recipe. It generates an integrated phonetic lexicon with CMU dictionary and cedit dictionary. This recipe follows the Mono+Triphone+SAT+fMLLR+DNN pipeline. In addition, this directory will be extended as scripts for speaker diarization and so on are created.
17 changes: 17 additions & 0 deletions egs/aidatatang_200zh/s5/RESULTS
Original file line number Diff line number Diff line change
@@ -0,0 +1,17 @@
%WER 37.09 [ 173936 / 468933, 4868 ins, 31143 del, 137925 sub ] exp/mono/decode_test/cer_10_0.0
%WER 17.98 [ 84305 / 468933, 4724 ins, 12637 del, 66944 sub ] exp/tri1/decode_test/cer_13_0.0
%WER 17.94 [ 84149 / 468933, 5025 ins, 12427 del, 66697 sub ] exp/tri2/decode_test/cer_13_0.0
%WER 17.26 [ 80945 / 468933, 4421 ins, 12958 del, 63566 sub ] exp/tri3a/decode_test/cer_14_0.0
%WER 14.16 [ 66424 / 468933, 4567 ins, 10224 del, 51633 sub ] exp/tri4a/decode_test/cer_14_0.0
%WER 12.22 [ 57304 / 468933, 4799 ins, 8197 del, 44308 sub ] exp/tri5a/decode_test/cer_14_0.0
%WER 5.59 [ 26232 / 468933, 1701 ins, 4377 del, 20154 sub ] exp/chain/tdnn_1a_sp/decode_test/cer_10_0.0

# nnet3 tdnn with online pitch, local/nnet3/tuning/run_tdnn_2a.sh
%WER 7.21 [ 33797 / 468933, 2141 ins, 6117 del, 25539 sub ] exp/nnet3/tdnn_sp/decode_test/cer_13_0.0
%WER 7.44 [ 34878 / 468933, 2252 ins, 5854 del, 26772 sub ] exp/nnet3/tdnn_sp_online/decode_test/cer_12_0.0
%WER 7.79 [ 36542 / 468933, 2527 ins, 5674 del, 28341 sub ] exp/nnet3/tdnn_sp_online/decode_test_per_utt/cer_12_0.0

# chain with online pitch, local/chain/tuning/run_tdnn_2a.sh
%WER 5.61 [ 26311 / 468933, 1773 ins, 4789 del, 19749 sub ] exp/chain/tdnn_2a_sp/decode_test/cer_11_0.0
%WER 5.69 [ 26661 / 468933, 1723 ins, 4724 del, 20214 sub ] exp/chain/tdnn_2a_sp_online/decode_test/cer_11_0.0
%WER 5.98 [ 28046 / 468933, 2031 ins, 4527 del, 21488 sub ] exp/chain/tdnn_2a_sp_online/decode_test_per_utt/cer_11_0.0
14 changes: 14 additions & 0 deletions egs/aidatatang_200zh/s5/cmd.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
# you can change cmd.sh depending on what type of queue you are using.
# If you have no queueing system and want to run on a local machine, you
# can change all instances 'queue.pl' to run.pl (but be careful and run
# commands one by one: most recipes will exhaust the memory on your
# machine). queue.pl works with GridEngine (qsub). slurm.pl works
# with slurm. Different queues are configured differently, with different
# queue names and different ways of specifying things like memory;
# to account for these differences you can create and edit the file
# conf/queue.conf to match your queue's configuration. Search for
# conf/queue.conf in http://kaldi-asr.org/doc/queue.html for more information,
# or search for the string 'default_config' in utils/queue.pl or utils/slurm.pl.

export train_cmd="queue.pl --mem 2G"
export decode_cmd="queue.pl --mem 4G"
39 changes: 39 additions & 0 deletions egs/aidatatang_200zh/s5/conf/cmu2pinyin
Original file line number Diff line number Diff line change
@@ -0,0 +1,39 @@
AA A
AE A
AH A
AO UO
AW U
AY AI
B B
CH CH
D D
DH S I
EH AI
ER E
EY AI
F F
G G
HH H
IH I
IY I
JH ZH
K K
L L
M M
N N
NG N
OW UO
OY UO
P P
R R
S S
SH SH
T T
TH S
UH U
UW U
V W
W W
Y Y
Z Z
ZH X
5 changes: 5 additions & 0 deletions egs/aidatatang_200zh/s5/conf/decode.config
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
beam=11.0 # beam for decoding. Was 13.0 in the scripts.
first_beam=8.0 # beam for 1st-pass decoding in SAT.



2 changes: 2 additions & 0 deletions egs/aidatatang_200zh/s5/conf/mfcc.conf
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
--use-energy=false # only non-default option.
--sample-frequency=16000
10 changes: 10 additions & 0 deletions egs/aidatatang_200zh/s5/conf/mfcc_hires.conf
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
# config for high-resolution MFCC features, intended for neural network training.
# Note: we keep all cepstra, so it has the same info as filterbank features,
# but MFCC is more easily compressible (because less correlated) which is why
# we prefer this method.
--use-energy=false # use average of log energy, not energy.
--sample-frequency=16000 # Switchboard is sampled at 8kHz
--num-mel-bins=40 # similar to Google's setup.
--num-ceps=40 # there is no dimensionality reduction.
--low-freq=40 # low cutoff frequency for mel bins
--high-freq=-200 # high cutoff frequently, relative to Nyquist of 8000 (=3800)
1 change: 1 addition & 0 deletions egs/aidatatang_200zh/s5/conf/online_cmvn.conf
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
# configuration file for apply-cmvn-online, used when invoking online2-wav-nnet3-latgen-faster.
4 changes: 4 additions & 0 deletions egs/aidatatang_200zh/s5/conf/online_pitch.conf
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
--sample-frequency=16000
--simulate-first-pass-online=true
--normalization-right-context=25
--frames-per-chunk=10
58 changes: 58 additions & 0 deletions egs/aidatatang_200zh/s5/conf/pinyin2cmu
Original file line number Diff line number Diff line change
@@ -0,0 +1,58 @@
A AA
AI AY
AN AE N
ANG AE NG
AO AW
B B
CH CH
C T S
D D
E ER
EI EY
EN AH N
ENG AH NG
ER AA R
F F
G G
H HH
IA IY AA
IANG IY AE NG
IAN IY AE N
IAO IY AW
IE IY EH
I IY
ING IY NG
IN IY N
IONG IY UH NG
IU IY UH
J J
K K
L L
M M
N N
O AO
ONG UH NG
OU OW
P P
Q Q
R R
SH SH
S S
T T
UAI UW AY
UANG UW AE NG
UAN UW AE N
UA UW AA
UI UW IY
UN UW AH N
UO UW AO
U UW
UE IY EH
VE IY EH
V IY UW
VN IY N
W W
X X
Y Y
ZH JH
Z Z
23 changes: 23 additions & 0 deletions egs/aidatatang_200zh/s5/conf/pinyin_initial
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
B
C
CH
D
F
G
H
J
K
L
M
N
P
Q
R
S
SH
T
W
X
Y
Z
ZH
1 change: 1 addition & 0 deletions egs/aidatatang_200zh/s5/conf/pitch.conf
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
--sample-frequency=16000
Loading