Skip to content

Commit 2c3a20e

Browse files
authored
stand-alone rl (#42)
1 parent 2af73f2 commit 2c3a20e

26 files changed

Lines changed: 3871 additions & 1322 deletions

cookbook/legacy/grpo/lora.py

Lines changed: 196 additions & 417 deletions
Large diffs are not rendered by default.

cookbook/legacy/grpo/lora_npu.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -315,6 +315,7 @@ def create_dataset():
315315

316316

317317
def train():
318+
raise NotImplementedError("Not implemented")
318319
nproc_per_node, actor_ranks, ref_ranks = parse_device_config()
319320

320321
device_groups = create_device_groups(actor_ranks, ref_ranks)

0 commit comments

Comments
 (0)