Skip to content
View kxfan2002's full-sized avatar

Block or report kxfan2002

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. SophiaVL-R1 SophiaVL-R1 Public

    SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward

    Python 91 3

  2. Reagent Reagent Public

    Agent-RRM: Exploring Reasoning Reward Model for Agents

    Python 42 4

  3. R1-Collection R1-Collection Public

    A collection of R1-based repos.

    2

  4. examPapers examPapers Public

  5. kxfan.github.io kxfan.github.io Public template

    Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

    JavaScript

  6. EasyR1 EasyR1 Public

    Forked from hiyouga/EasyR1

    EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

    Python