[new recipe] Add Cosyvoice TTS GRPO training recipe based on veRL. #2615

yuekaizhang · 2025-07-18T05:55:39Z

This PR introduces a recipe using veRL to conduct RL training experiments on cosyvoice2 llm models.

Specifically, we conducted GRPO experiments and got below results:

Model	Seed-TTS `test_zh` CER(%) ⬇️	Cosyvoice3 `zero_shot_zh` CER (%)⬇️	Comment
SFT (initialized from Qwen2-0.5B-Instruct)	1.81 %	4.83%	See PR #1887
GRPO (this project, trained on AIShell-3)	1.06 %	4.03%	See here

Features:

Using Pytriton based Sensevoice ASR sever to achieve fast reward calculation.
Using phoneme error rate (PER) as the reward metrics.
Support both pretrained cosyvoice2 llm and custom sft version of cosyvoice2

gemini-code-assist

Code Review

This pull request adds a new project, CosyVoice-TTS-GRPO, to the list of projects using verl in the main README.md. The change is straightforward, but I've pointed out a minor formatting inconsistency that should be addressed to maintain consistency with the rest of the document.

README.md

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

eric-haibin-lin · 2025-07-24T21:09:39Z

nice! do you plan to contribute the full recipe, or just the readme? BTW please resolve conflict with main branch. thanks!

yuekaizhang · 2025-07-25T01:53:23Z

nice! do you plan to contribute the full recipe, or just the readme? BTW please resolve conflict with main branch. thanks!

I'd love to contribute the recipe here. Let me update the recipe into this PR.

CLAassistant · 2025-07-25T03:37:09Z

All committers have signed the CLA.

yuekaizhang · 2025-07-25T03:40:32Z

@eric-haibin-lin Updated. Would you mind checking it again? Many thanks.

update readme

bf71d0a

gemini-code-assist bot reviewed Jul 18, 2025

View reviewed changes

README.md Outdated Show resolved Hide resolved

Update README.md

dea6a72

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

yuekaizhang added 2 commits July 25, 2025 09:55

Merge branch 'main' into tts

c382b31

update recipe

0c27fe3

yuekaizhang changed the title ~~[doc] Add Cosyvoice TTS GRPO training project based on veRL.~~ [new recipe] Add Cosyvoice TTS GRPO training recipe based on veRL. Jul 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[new recipe] Add Cosyvoice TTS GRPO training recipe based on veRL. #2615

[new recipe] Add Cosyvoice TTS GRPO training recipe based on veRL. #2615

Uh oh!

yuekaizhang commented Jul 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

eric-haibin-lin commented Jul 24, 2025

Uh oh!

yuekaizhang commented Jul 25, 2025

Uh oh!

CLAassistant commented Jul 25, 2025 •

edited

Loading

Uh oh!

yuekaizhang commented Jul 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[new recipe] Add Cosyvoice TTS GRPO training recipe based on veRL. #2615

Are you sure you want to change the base?

[new recipe] Add Cosyvoice TTS GRPO training recipe based on veRL. #2615

Uh oh!

Conversation

yuekaizhang commented Jul 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

eric-haibin-lin commented Jul 24, 2025

Uh oh!

yuekaizhang commented Jul 25, 2025

Uh oh!

CLAassistant commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yuekaizhang commented Jul 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CLAassistant commented Jul 25, 2025 •

edited

Loading