GitHub - MisterBrookT/Scorpio: SCORPIO is a system-algorithm co-designed LLM serving engine that prioritizes heterogeneous Service Level Objectives (SLOs) like TTFT and TPOT across all scheduling stages.

SLO-Oriented LLM Serving for Heterogeneous Workloads

🔥 What's SCORPIO?

SCORPIO is a system-algorithm co-designed LLM serving engine that prioritizes heterogeneous Service Level Objectives (SLOs) like TTFT and TPOT across all scheduling stages. It improves both goodput and SLO adherence through adaptive queueing, batching, and rejection mechanisms.

✨ Key Features

🕒 TTFT Guard: Least-Deadline-First (LDF) scheduling and rejection of unattainable requests.
⚖️ TPOT Guard: VBS-based admission + credit-based batching for fine-grained control.
🔮 Lightweight Predictor: Sequence length prediction with calibrated bucketing.
🚀 Built on vLLM: Extends vLLM with SLO-oriented scheduling logic.
📊 Up to 14.4× Goodput and 46.5% SLO Improvement vs state-of-the-art.

🛠️ Installation

Create the environment and install the SCORPIO engine:

conda create -n scorpio python=3.12
conda activate scorpio

export VLLM_COMMIT=635b897246da121238454ed4b2bbc87cb4d4166b
export VLLM_PRECOMPILED_WHEEL_LOCATION=https://wheels.vllm.ai/${VLLM_COMMIT}/vllm-1.0.0.dev-cp38-abi3-manylinux1_x86_64.whl

pip install --editable .

📥 Download Datasets and Models

Datasets

mkdir datasets && cd datasets
huggingface-cli download --repo-type dataset --resume-download Brookseeworld/Scropio-dataset --local-dir .

Models

mkdir MODELS && cd MODELS
huggingface-cli download --resume-download Brookseeworld/Scropio-seq-len-predictor --local-dir .

⚙️ Quickstart

Note: Ensure all paths and configurations are correct before launching.

1. Launch Sequence Length Predictor

conda activate scorpio
python benchmarks/script/entry_predict.py --dataset sharegpt --model 8b

2. Start the Inference Engine (SCORPIO)

conda activate scorpio
python benchmarks/script/entry_serving.py --config benchmarks/config/llama8b-sharegpt/minitest.json

🧠 Citation

If you use SCORPIO, please cite us:

@article{tang2025scorpio,
  title={SCORPIO: Serving the Right Requests at the Right Time for Heterogeneous SLOs in LLM Inference},
  author={Tang, Yinghao and Lan, Tingfeng and Huang, Xiuqi and Lu, Hui and Chen, Wei},
  journal={arXiv preprint arXiv:2505.23022},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.buildkite		.buildkite
benchmarks		benchmarks
cmake		cmake
csrc		csrc
draw		draw
examples		examples
misc		misc
predictor		predictor
script		script
tests		tests
tools		tools
vllm		vllm
.clang-format		.clang-format
.dockerignore		.dockerignore
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
.shellcheckrc		.shellcheckrc
.yapfignore		.yapfignore
CMakeLists.txt		CMakeLists.txt
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
DCO		DCO
Dockerfile		Dockerfile
Dockerfile.ppc64le		Dockerfile.ppc64le
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
SECURITY.md		SECURITY.md
__init__.py		__init__.py
collect_env.py		collect_env.py
find_cuda_init.py		find_cuda_init.py
format.sh		format.sh
pyproject.toml		pyproject.toml
python_only_dev.py		python_only_dev.py
requirements-build.txt		requirements-build.txt
requirements-common.txt		requirements-common.txt
requirements-cuda.txt		requirements-cuda.txt
requirements-dev.txt		requirements-dev.txt
requirements-lint.txt		requirements-lint.txt
requirements-neuron.txt		requirements-neuron.txt
requirements-openvino.txt		requirements-openvino.txt
requirements-rocm.txt		requirements-rocm.txt
requirements-test.in		requirements-test.in
requirements-test.txt		requirements-test.txt
setup.py		setup.py
use_existing_torch.py		use_existing_torch.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SLO-Oriented LLM Serving for Heterogeneous Workloads

🔥 What's SCORPIO?

✨ Key Features

🛠️ Installation

📥 Download Datasets and Models

Datasets

Models

⚙️ Quickstart

1. Launch Sequence Length Predictor

2. Start the Inference Engine (SCORPIO)

🧠 Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

MisterBrookT/Scorpio

Folders and files

Latest commit

History

Repository files navigation

SLO-Oriented LLM Serving for Heterogeneous Workloads

🔥 What's SCORPIO?

✨ Key Features

🛠️ Installation

📥 Download Datasets and Models

Datasets

Models

⚙️ Quickstart

1. Launch Sequence Length Predictor

2. Start the Inference Engine (SCORPIO)

🧠 Citation

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages