haic0 · haic0 · Aug 6, 2025 · Aug 7, 2025 · Aug 7, 2025 · Aug 7, 2025
diff --git a/Qwen/AMD/AMD-Qwen3-Next-Usage.md b/Qwen/AMD/AMD-Qwen3-Next-Usage.md
@@ -0,0 +1,30 @@
+#### Step by Step Guide
+Please follow the steps here to install and run Qwen3-Next-80B-A3B-Instruct models on AMD MI300X GPU.
+#### Step 1
+Pull the latest vllm docker:
+```shell
+docker pull rocm/vllm-dev:nightly
+```
+Launch the Rocm-vllm docker: 
+```shell
+docker run -d -it --ipc=host --network=host --privileged --cap-add=CAP_SYS_ADMIN --device=/dev/kfd --device=/dev/dri --device=/dev/mem --group-add video --cap-add=SYS_PTRACE --security-opt seccomp=unconfined -v /:/work -e SHELL=/bin/bash  --name Qwen3-next rocm/vllm-dev:nightly
+```
+#### Step 2
+  Huggingface login
+```shell
+   huggingface-cli login 
+```   
+#### Step 3
+##### FP8
+
+Run the vllm online serving
+Sample Command
+```shell
+VLLM_ALLOW_LONG_MAX_MODEL_LEN=1 vllm serve Qwen/Qwen3-Next-80B-A3B-Instruct --tensor-parallel-size 4 --max-model-len 32768  --no-enable-prefix-caching 
+```
+#### Step 4 
+Open a new terminal, enter into the running docker and run the following benchmark script.
+```shell
+docker exec -it Qwen3-next /bin/bash 
+python3 /vllm-workspace/benchmarks/benchmark_serving.py --model Qwen/Qwen3-Next-80B-A3B-Instruct --dataset-name random --ignore-eos --num-prompts 500  --max-concurrency 128 --random-input-len 3200 --random-output-len 800  --percentile-metrics ttft,tpot,itl,e2el
+```
diff --git a/README.md b/README.md
@@ -18,6 +18,9 @@ This repo intends to host community maintained common recipes to run vLLM answer
 ### Qwen <img src="https://qwenlm.github.io/favicon.png" alt="Qwen" width="16" height="16" style="vertical-align:middle;">
 - [Qwen3-Coder-480B-A35B](Qwen/Qwen3-Coder-480B-A35B.md)
 
+### AMD GPU Support
+For the user guide,kindly review the AMD-GPU repository within the model directory.
+
 ## Contributing
 Please feel free to contribute by adding a new recipe or improving an existing one, just send us a PR!
 
@@ -31,4 +34,4 @@ uv run mkdocs serve
 ```
 
 ## License
-This project is licensed under the Apache License 2.0 - see the [LICENSE](LICENSE) file for details.
+This project is licensed under the Apache License 2.0 - see the [LICENSE](LICENSE) file for details.