Skip to content

Commit 43ed0a7

Browse files
committed
Merge branch 'dev' of github.com:modelscope/twinkle into dev
2 parents 80fa6de + af9ee69 commit 43ed0a7

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

src/twinkle/sampler/vllm_sampler/vllm_engine.py

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -47,12 +47,12 @@ def __init__(
4747
model_id: str,
4848
*,
4949
tensor_parallel_size: int = 1,
50-
gpu_memory_utilization: float = 0.9,
50+
gpu_memory_utilization: float = 0.7,
5151
max_model_len: Optional[int] = None,
5252
max_num_seqs: int = 256,
5353
enable_lora: bool = True,
54-
max_loras: int = 64,
55-
max_lora_rank: int = 64,
54+
max_loras: int = 5,
55+
max_lora_rank: int = 32,
5656
enable_sleep_mode: bool = False,
5757
enable_prefix_caching: bool = False,
5858
enforce_eager: bool = False,

0 commit comments

Comments
 (0)