support rl vit lora with vLLM by hjh0119 · Pull Request #147 · modelscope/twinkle

hjh0119 · 2026-04-09T09:48:51Z

No description provided.

gemini-code-assist

Code Review

This pull request refactors LoRA configurations across grpo.py, grpo_mm.py, and short_math_grpo.py to better handle text-only and multimodal training scenarios, including enabling tower_connector_lora for multimodal setups. It also includes a minor logical reordering in megatron.py for checking model_keys. The review feedback suggests improving the conciseness and PEP 8 compliance of an inline comment in grpo_mm.py.

cookbook/rl/grpo_mm.py

hjh0119 added 3 commits April 9, 2026 17:39

lint

7689583

Merge branch 'main' into vit-lora

9b613a0

more comment

b873e2f

gemini-code-assist bot reviewed Apr 9, 2026

View reviewed changes

cookbook/rl/grpo_mm.py Show resolved Hide resolved

tastelikefeet approved these changes Apr 9, 2026

View reviewed changes

tastelikefeet merged commit 9b4d0f0 into modelscope:main Apr 9, 2026
1 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support rl vit lora with vLLM#147

support rl vit lora with vLLM#147
tastelikefeet merged 3 commits intomodelscope:mainfrom
hjh0119:vit-lora

hjh0119 commented Apr 9, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

hjh0119 commented Apr 9, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants