Refactoring #4

csukuangfj · 2021-07-31T12:27:00Z

TODOs

Add tests and documentation to transformer.py and conformer.py; Fix its style issues.

danpovey · 2021-07-31T13:08:28Z

That was fast! Thanks!

csukuangfj · 2021-08-03T09:34:04Z

egs/librispeech/ASR/conformer_ctc/transformer.py

        )

+        # TODO: Use eos_id as ignore_id.
+        #  tgt_key_padding_mask = decoder_padding_mask(ys_in_pad, ignore_id=eos_id)


It is commented out since existing models are trained with it disabled.
If it is enabled, the WER becomes worse.
We should enable it when we start to train a new model.

csukuangfj · 2021-08-03T12:16:06Z

The following is the WER from the model trained by #3 and decoded with this pull-request:
(With n-gram LM rescoring and attention decoder. The model is trained for 26 epochs)

For test-clean, WER of different settings are:
ngram_lm_scale_0.7_attention_scale_0.6  2.96    best for test-clean
ngram_lm_scale_0.9_attention_scale_0.5  2.96
ngram_lm_scale_0.7_attention_scale_0.5  2.97
ngram_lm_scale_0.7_attention_scale_0.7  2.97
ngram_lm_scale_0.9_attention_scale_0.6  2.97
ngram_lm_scale_0.9_attention_scale_0.7  2.97
ngram_lm_scale_0.9_attention_scale_0.9  2.97
ngram_lm_scale_1.0_attention_scale_0.7  2.97
ngram_lm_scale_1.0_attention_scale_0.9  2.97
ngram_lm_scale_1.0_attention_scale_1.0  2.97
ngram_lm_scale_1.0_attention_scale_1.1  2.97
ngram_lm_scale_1.0_attention_scale_1.2  2.97
ngram_lm_scale_1.0_attention_scale_1.3  2.97
ngram_lm_scale_1.1_attention_scale_0.9  2.97

---

For test-other, WER of different settings are:
ngram_lm_scale_1.0_attention_scale_0.9  6.65    best for test-other
ngram_lm_scale_1.1_attention_scale_1.1  6.65
ngram_lm_scale_0.9_attention_scale_0.7  6.66
ngram_lm_scale_1.0_attention_scale_1.0  6.66
ngram_lm_scale_1.0_attention_scale_1.1  6.66
ngram_lm_scale_0.9_attention_scale_1.0  6.67
ngram_lm_scale_1.0_attention_scale_0.7  6.67
ngram_lm_scale_1.0_attention_scale_1.2  6.67
ngram_lm_scale_1.0_attention_scale_1.3  6.67
ngram_lm_scale_0.9_attention_scale_0.5  6.68
ngram_lm_scale_0.9_attention_scale_0.6  6.68
ngram_lm_scale_0.9_attention_scale_0.9  6.68
ngram_lm_scale_0.9_attention_scale_1.1  6.68
ngram_lm_scale_0.9_attention_scale_1.3  6.68
ngram_lm_scale_0.9_attention_scale_1.5  6.68

Epochs 14-26 are used in model averaging.

I have uploaded the above checkpoints to
https://huggingface.co/csukuangfj/conformer_ctc/tree/main

To reproduce the decoding result:

clone the above repo containing checkpoints and put it into conformer_ctc/exp/
after step 1, you should have conformer_ctc/exp/epoch-{14,15,...,26}.pt
run

./prepare.sh
./conformer_ctc/decode.py --epoch 26 --avg 13 --max-duration=50

You should get the above result.

The results are expected to become better if trained with more epochs.
I will rerun the training with the bug in k2-fsa/snowfall#242 fixed.

danpovey · 2021-08-03T12:31:33Z

Great!!

…

On Tue, Aug 3, 2021 at 8:16 PM Fangjun Kuang ***@***.***> wrote: The following is the WER from the model trained by #3 <#3> and decoded with this pull-request: (With n-gram LM rescoring and attention decoder. The model is trained for 26 epochs) For test-clean, WER of different settings are: ngram_lm_scale_0.7_attention_scale_0.6 2.96 best for test-clean ngram_lm_scale_0.9_attention_scale_0.5 2.96 ngram_lm_scale_0.7_attention_scale_0.5 2.97 ngram_lm_scale_0.7_attention_scale_0.7 2.97 ngram_lm_scale_0.9_attention_scale_0.6 2.97 ngram_lm_scale_0.9_attention_scale_0.7 2.97 ngram_lm_scale_0.9_attention_scale_0.9 2.97 ngram_lm_scale_1.0_attention_scale_0.7 2.97 ngram_lm_scale_1.0_attention_scale_0.9 2.97 ngram_lm_scale_1.0_attention_scale_1.0 2.97 ngram_lm_scale_1.0_attention_scale_1.1 2.97 ngram_lm_scale_1.0_attention_scale_1.2 2.97 ngram_lm_scale_1.0_attention_scale_1.3 2.97 ngram_lm_scale_1.1_attention_scale_0.9 2.97 Epochs 14-26 are used in model averaging. ------------------------------ I have uploaded the above checkpoints to https://huggingface.co/csukuangfj/conformer_ctc/tree/main To reproduce the decoding result: 1. clone the above repo containing checkpoints and put it into conformer_ctc/exp/ 2. after step 1, you should have conformer_ctc/exp/epoch-{14,15,...,26}.pt 3. run ./prepare.sh ./conformer_ctc/decode.py --epoch 26 --avg 13 --max-duration=50 1. You should get the above result. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#4 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAZFLOYRK6U225FIAUPRC2TT27MZDANCNFSM5BJ7IYRA> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&utm_campaign=notification-email> .

pzelasko · 2021-08-03T12:38:45Z

Nice! I'm curious -- did you ever try to run the same thing but with MMI instead of CTC?

csukuangfj · 2021-08-03T12:42:15Z

Nice! I'm curious -- did you ever try to run the same thing but with MMI instead of CTC?

yes, I am planning to do that with a pretrained P. All the related code can be found in snowfall.

csukuangfj · 2021-08-04T06:52:35Z

Merging it to avoid conflicts.

* Fix an error in TDNN-LSTM training. * WIP: Refactoring * Refactor transformer.py * Remove unused code. * Minor fixes.

csukuangfj added 3 commits July 31, 2021 15:55

Fix an error in TDNN-LSTM training.

c9222bd

Merge remote-tracking branch 'dan/master' into style-check

c72a11e

WIP: Refactoring

1fa3099

csukuangfj added 2 commits August 2, 2021 23:48

Refactor transformer.py

f6091b1

Remove unused code.

2be7a0a

csukuangfj commented Aug 3, 2021

View reviewed changes

csukuangfj changed the title ~~WIP: Refactoring~~ Refactoring Aug 3, 2021

Minor fixes.

a6d9b3c

csukuangfj merged commit 5a0b9bc into k2-fsa:master Aug 4, 2021

csukuangfj deleted the refactor branch August 4, 2021 06:53

danpovey mentioned this pull request Sep 2, 2021

RuntimeError: Specified device cuda:0 does not match device of data cuda:-2 #33

Closed

Lzhang-hub mentioned this pull request Oct 11, 2021

CUDA out of memory in decoding #70

Open

danpovey mentioned this pull request Nov 27, 2021

Decoding error 'Fsa' object doesn't support assignment. #133

Open

wwxm0523 mentioned this pull request Jan 30, 2022

LF-MMI GPU OOM #196

Open

ahazned mentioned this pull request Apr 13, 2022

Illegal memory error when training with multi-GPU #247

Open

iggygeek mentioned this pull request Nov 29, 2023

Zipformer training crash : 'cannot set number of interop threads ' ... #1395

Closed

ngoel17 mentioned this pull request Sep 30, 2024

Illegal memory access during zipformer training #1764

Closed

baileyeet referenced this pull request in reazon-research/icefall Jul 16, 2025

Refactoring (#4)

9c8d473

* Fix an error in TDNN-LSTM training. * WIP: Refactoring * Refactor transformer.py * Remove unused code. * Minor fixes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactoring #4

Refactoring #4

Uh oh!

csukuangfj commented Jul 31, 2021

Uh oh!

danpovey commented Jul 31, 2021

Uh oh!

csukuangfj Aug 3, 2021

Uh oh!

csukuangfj commented Aug 3, 2021 •

edited

Loading

Uh oh!

danpovey commented Aug 3, 2021 via email

Uh oh!

pzelasko commented Aug 3, 2021

Uh oh!

csukuangfj commented Aug 3, 2021

Uh oh!

csukuangfj commented Aug 4, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Refactoring #4

Refactoring #4

Uh oh!

Conversation

csukuangfj commented Jul 31, 2021

TODOs

Uh oh!

danpovey commented Jul 31, 2021

Uh oh!

csukuangfj Aug 3, 2021

Choose a reason for hiding this comment

Uh oh!

csukuangfj commented Aug 3, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

danpovey commented Aug 3, 2021 via email

Uh oh!

pzelasko commented Aug 3, 2021

Uh oh!

csukuangfj commented Aug 3, 2021

Uh oh!

csukuangfj commented Aug 4, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

csukuangfj commented Aug 3, 2021 •

edited

Loading