Skip to content

Commit f47dbc7

Browse files
committed
Merge remote-tracking branch 'origin/dev' into kernels_unittest_fix_ljl
2 parents e344ba2 + 3ed9edd commit f47dbc7

File tree

5 files changed

+5
-8
lines changed

5 files changed

+5
-8
lines changed

cookbook/client/tinker/megatron/server_config.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -48,6 +48,7 @@ applications:
4848
max_model_len: 16000 # Maximum sequence length the engine supports
4949
gpu_memory_utilization: 0.85 # Fraction of GPU memory to use (0.0-1.0)
5050
enable_lora: true # Allow loading LoRA adapters during inference
51+
max_loras: 5 # Max allowed loras working on vLLM at the same time
5152
device_group: # Logical device group for the sampler
5253
name: sampler
5354
gpus_per_worker: 1

tests/sampler/test_30b_weight_sync.py

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,11 +17,10 @@
1717
import argparse
1818
import datetime
1919
import os
20+
import pytest
2021
import sys
2122
import time
2223

23-
import pytest
24-
2524
os.environ['VLLM_WORKER_MULTIPROC_METHOD'] = 'spawn'
2625
os.environ['VLLM_LOGGING_LEVEL'] = 'WARNING'
2726
os.environ['NCCL_CUMEM_ENABLE'] = '0'

tests/sampler/test_megatron_weight_sync.py

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -30,11 +30,10 @@
3030
import argparse
3131
import logging
3232
import os
33+
import pytest
3334
import sys
3435
import time
3536

36-
import pytest
37-
3837
# Must set before importing anything
3938
os.environ['VLLM_WORKER_MULTIPROC_METHOD'] = 'spawn'
4039
os.environ['VLLM_LOGGING_LEVEL'] = 'WARNING'

tests/sampler/test_sampler_e2e.py

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -20,11 +20,10 @@
2020

2121
import argparse
2222
import os
23+
import pytest
2324
import sys
2425
import traceback
2526

26-
import pytest
27-
2827
# Set environment variables before imports
2928
os.environ.setdefault('TRUST_REMOTE_CODE', '1')
3029

tests/sampler/test_weight_sync.py

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -26,11 +26,10 @@
2626
import argparse
2727
import logging
2828
import os
29+
import pytest
2930
import sys
3031
import time
3132

32-
import pytest
33-
3433
# Must set before importing anything
3534
os.environ['VLLM_WORKER_MULTIPROC_METHOD'] = 'spawn'
3635
os.environ['VLLM_LOGGING_LEVEL'] = 'WARNING'

0 commit comments

Comments
 (0)