bpe ctc decoder with a released model #217

glynpu · 2021-06-21T06:32:41Z

This pr release a snowfall trained model together with related decode code.
Wer on test-clean is lower than previously trained model with snowfall, detailed comparison as following:

avg epoch 26-30	no rescore	no rescore	4-gram lattice rescore	4-gram lattice rescore
	test-clean	test-other	test-clean	test-other
before with LF-MMI loss	4.14	8.41	3.69	7.68
current	3.97	9.78	*	*

INFO:root:[test-clean] %WER 3.97% [2087 / 52576, 220 ins, 166 del, 1701 sub ]
INFO:root:[test-other] %WER 9.78% [5121 / 52343, 535 ins, 439 del, 4147 sub ]

A thing worth to mention is that： current no-rescore result(3.97 on test-clean) is got WITHOUT a 3-gram.
Maybe the result will get lower with composing currnet ctc_topo with a 3-gram fst (I am working on this).

Another baseline of this model is an espnet released model, detailed comparison as following:
num_paths = 100 when doing n-best rescoring of row 2;
result of row 2 is got using similar techiniques used in #201, by loading espnet released model with snowfall code.

decoding algorithm	training tool	encoder + k2 ctc decode+no rescore	encoder + k2 ctc decode+ decoder nbest rescore	encoder + k2 ctc decode+ transformer lm nbest rescore	encoder + k2 ctc decode+ decoder nbest rescore+ transformer lm nbest rescore
decoder algorithm in espnet	espnet	*	*	*	2.1%
k2 ctc decode in this pr	espnet	2.97	2.64	2.43	2.35
k2 ctc decode in this pr	snowfall	3.97	*	*	*

Conclusions:

a better snowfall trained model is got before rescore.
current training pipeline is still inferior to it's espnet counterpart; if fix this, current wer 3.97% on test-clean should close to 2.97%
(related training code will be submited soon in this week; make a promise here to force me to do this quickly).

danpovey · 2021-06-21T06:38:08Z

This was trained on 960 hours, I assume?
I'm surprised that your "k2 ctc decode" number was got without the 3-gram LM; I had thought you were using that.
What do you think are the differences between our trained model and the espnet-trained model? Is it possible to compare the diagnostics from training?

glynpu · 2021-06-21T06:57:36Z

This was trained on 960 hours, I assume?

Yes, with full libri.

What do you think are the differences between our trained model and the espnet-trained model?

A known huge difference is espnet use warm_up scheduler, while I use Noam optimizer in snowfall.
with warmup_step = 40000, and model_size=512, at each step, learning rate of espnet is around 10 times that in this exp.
So I am going to retrain to the model with changing lr-factor from 1.0 to 10.0 after current exp finished.

Is it possible to compare the diagnostics from training?

Yes, I have reproduce espnet result and got detail training log, which will be used to diagnose my training process.

pzelasko · 2021-06-21T23:03:01Z

You might want to check what data augmentation techniques and settings they are using and compare them with our setup. If we’re missing some techniques in Lhotse we can add them.

danpovey · 2021-06-22T02:54:38Z

So I guess this is ready to merge?

glynpu · 2021-06-22T02:58:56Z

Maybe @csukuangfj is going to review this afternoon.

csukuangfj · 2021-06-22T12:05:04Z

snowfall/models/transformer.py

+            ys_out_pad = pad_list(ys_in, -1)
+
+        else:
+            raise VAlueError("Invalid input for decoder self attetion")


VAlueError -> ValueError

csukuangfj · 2021-06-22T12:06:33Z

+2

csukuangfj · 2021-06-22T12:33:51Z

Thanks! Merging

bpe ctc decoder with a released model

0ecd245

csukuangfj reviewed Jun 22, 2021

View reviewed changes

fix typo

82b5b85

csukuangfj merged commit 2dda31e into k2-fsa:master Jun 22, 2021

glynpu mentioned this pull request Jun 24, 2021

WIP: BPE Training ctc loss and label smooth loss #219

Open

pkufool mentioned this pull request Jul 21, 2021

WIP: BPE Training with k2 ctc loss #234

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bpe ctc decoder with a released model #217

bpe ctc decoder with a released model #217

Uh oh!

glynpu commented Jun 21, 2021

Uh oh!

danpovey commented Jun 21, 2021

Uh oh!

glynpu commented Jun 21, 2021 •

edited

Loading

Uh oh!

pzelasko commented Jun 21, 2021

Uh oh!

danpovey commented Jun 22, 2021

Uh oh!

glynpu commented Jun 22, 2021

Uh oh!

csukuangfj Jun 22, 2021

Uh oh!

glynpu Jun 22, 2021

Uh oh!

csukuangfj commented Jun 22, 2021

Uh oh!

csukuangfj commented Jun 22, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

bpe ctc decoder with a released model #217

bpe ctc decoder with a released model #217

Uh oh!

Conversation

glynpu commented Jun 21, 2021

Uh oh!

danpovey commented Jun 21, 2021

Uh oh!

glynpu commented Jun 21, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pzelasko commented Jun 21, 2021

Uh oh!

danpovey commented Jun 22, 2021

Uh oh!

glynpu commented Jun 22, 2021

Uh oh!

csukuangfj Jun 22, 2021

Choose a reason for hiding this comment

Uh oh!

glynpu Jun 22, 2021

Choose a reason for hiding this comment

Uh oh!

csukuangfj commented Jun 22, 2021

Uh oh!

csukuangfj commented Jun 22, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

glynpu commented Jun 21, 2021 •

edited

Loading