LLM Serving on Heterogeneous Spot Cluster

Currently used version of vllm : v0.8.1

Install build essential & cmake

sudo apt update
sudo apt install -y build-essential
sudo apt install -y cmake

gcc --version   # gcc (Ubuntu 13.3.0-6ubuntu2~24.04) 13.3.0
cmake --version # cmake version 3.28.3

Install cuda toolkit (12.8) & GPU driver (570)

You can check comparability from https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html

wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2404/x86_64/cuda-keyring_1.1-1_all.deb
sudo dpkg -i cuda-keyring_1.1-1_all.deb
sudo apt-get update
sudo apt-get -y install cuda-toolkit-12-8

sudo apt-get install -y nvidia-open-570

echo 'export PATH=/usr/local/cuda/bin${PATH:+:${PATH}}' >> ~/.bashrc
echo 'export LD_LIBRARY_PATH=/usr/local/cuda/lib64${LD_LIBRARY_PATH:+:${LD_LIBRARY_PATH}}' >> ~/.bashrc
sudo reboot

nvcc --version
nvidia-smi --version

Install nccl (https://developer.nvidia.com/nccl/nccl-download)

keyring 은 위에서 이미 받았으므로 과정에서 제외

sudo apt update
sudo apt install libnccl2=2.26.2-1+cuda12.8 libnccl-dev=2.26.2-1+cuda12.8

I installed miniconda from : https://www.anaconda.com/docs/getting-started/miniconda/install#macos-linux-installation

conda create -n vllm-example python=3.12 -y && conda activate vllm-example

Install vLLM

cd HeteroSpotLLMServe
git submodule update --init --recursive
cd submodules/vLLM
VLLM_USE_PRECOMPILED=1 pip install --editable .

I am using vscode. If your vscode can't detect vllm package, add "python.analysis.extraPaths": ["./submodules/vllm"] to settings.json for debugging.

I don't use V1 Engine. So I commanded export VLLM_USE_V1=0

Name		Name	Last commit message	Last commit date
Latest commit History 195 Commits
Estimator		Estimator
GlobalServer		GlobalServer
IaC		IaC
InferenceServer		InferenceServer
TensorStore		TensorStore
benchmark		benchmark
profiling		profiling
submodules		submodules
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
protocols.py		protocols.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM Serving on Heterogeneous Spot Cluster

Install build essential & cmake

Install cuda toolkit (12.8) & GPU driver (570)

Install nccl (https://developer.nvidia.com/nccl/nccl-download)

Install vLLM

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

ddps-lab/HeteroSpotLLMServe

Folders and files

Latest commit

History

Repository files navigation

LLM Serving on Heterogeneous Spot Cluster

Install build essential & cmake

Install cuda toolkit (12.8) & GPU driver (570)

Install nccl (https://developer.nvidia.com/nccl/nccl-download)

Install vLLM

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages