Skip to content

[Question] Guidance on integrating MiMo-V2-Flash with VeRL (v0.6.1/v0.7.0): Should I use the mbridge/mcore path? #69

@Eisenhower

Description

@Eisenhower

Description I am currently using verl versions 0.6.1 and 0.7.0. My goal is to integrate the MiMo-V2-Flash model for RL training.

Questions

Should I use the mbridge (Megatron-Core integration) scheme for this model?

Do you have any specific suggestions or best practices for integrating a model with this architecture?

Context

VeRL Version: 0.6.1 / 0.7.0

Target Model: MiMo-V2-Flash

Current Plan: Exploring the mbridge / Megatron-Core path.

Any advice would be appreciated. Thanks!
@ISEEKYAN

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions