Skip to content

Commit f8ae876

Browse files
committed
Merge branch 'dev' of github.com:modelscope/twinkle into dev
2 parents 5890afa + 46cd9ad commit f8ae876

File tree

36 files changed

+930
-997
lines changed

36 files changed

+930
-997
lines changed

README.md

Lines changed: 154 additions & 102 deletions
Large diffs are not rendered by default.

cookbook/client/tinker/megatron/server.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@
99
import os
1010

1111
# Enable Ray debug mode for verbose logging during development
12-
os.environ['RAY_DEBUG'] = '1'
12+
os.environ['TWINKLE_TRUST_REMOTE_CODE'] = '1'
1313

1414
from twinkle.server import launch_server
1515

cookbook/client/tinker/transformer/grpo.py

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -34,14 +34,14 @@
3434
logger = get_logger()
3535

3636
# ========== Configuration ==========
37-
BASE_MODEL = 'Qwen/Qwen2.5-0.5B-Instruct'
37+
BASE_MODEL = 'Qwen/Qwen2.5-3B-Instruct'
3838
NUM_GENERATIONS = 4
3939
MAX_NEW_TOKENS = 1024
4040
LEARNING_RATE = 1e-5
41-
MAX_STEPS = 10
42-
BATCH_SIZE = 1
41+
MAX_STEPS = 100
42+
BATCH_SIZE = 2
4343
TEMPERATURE = 1.0
44-
SYNC_INTERVAL = 5 # Save weights for sampler every N steps
44+
SYNC_INTERVAL = 2 # Save weights for sampler every N steps
4545
LORA_RANK = 8
4646

4747

0 commit comments

Comments
 (0)