Update main #4

Gastron · 2025-07-15T13:05:32Z

No description provided.

…oad for inference on Llama 3.1 8B and 70B * added the newest version on llm-inference-megatron-lm combined * Update workloads/llm-inference-megatron-lm/helm/mount/run_megatron.sh * precommit fixed * Resolve the last comments in the code * resolving and testing tokenizer path * adding the kaiwo comments * precommits made --------- Co-authored-by: aivanni <4340981+aivanni@users.noreply.github.com>

* Initial commit, skeleton, data prep Signed-off-by: Robert Talling <rtalling@amd.com> * Add Megatron checkpoint conversion details to the tutorial; Add override file for llama-3.1-8B in workloads/download-huggingface-model-to-bucket Signed-off-by: Robert Talling <rtalling@amd.com> * Rename tutorial and fix Signed-off-by: Robert Talling <rtalling@amd.com> * Fix line splits Signed-off-by: Robert Talling <rtalling@amd.com> * fixes and llama 70 overrides for model delivery Signed-off-by: Robert Talling <rtalling@amd.com> * Update readme Signed-off-by: Robert Talling <rtalling@amd.com> * Add multinode instructions + values Signed-off-by: Robert Talling <rtalling@amd.com> * Refine inference workload instructions for Llama-3.1-8B model and update k9s commands Signed-off-by: Robert Talling <rtalling@amd.com> * places were Values.yaml were adjusted for testing were reverted Signed-off-by: Robert Talling <rtalling@amd.com> * Update workloads/download-huggingface-model-to-bucket/helm/values.yaml Signed-off-by: Robert Talling <rtalling@amd.com> * Update docs/tutorials/tutorial-03-deliver-resources-and-run-megatron-cpt.md Signed-off-by: Robert Talling <rtalling@amd.com> * Add 16ddp template for multinode, remove explicit namespace mentions Signed-off-by: Robert Talling <rtalling@amd.com> * Update tutorial template for llama 8b inference Signed-off-by: Robert Talling <rtalling@amd.com> * Fix pre-commit errors Signed-off-by: Robert Talling <rtalling@amd.com> * Remove unused overrides and update readme Signed-off-by: Robert Talling <rtalling@amd.com> * Update readme Signed-off-by: Robert Talling <rtalling@amd.com> * Replace set up section with tutorial-0-prerequisites. * correction the helm template path for llm-inference-megatron-lm * resolved override path inside helm, and update README * Update docs/tutorials/tutorial-03-deliver-resources-and-run-megatron-cpt.md --------- Signed-off-by: Robert Talling <rtalling@amd.com> Co-authored-by: Saroosh Shabbir <saroosh.shabbir@amd.com> Co-authored-by: Robert Talling <rtalling@amd.com> Co-authored-by: eliecer diaz <eliecerecology@gmail.com>

* jupyterlab: improve documentation with finding correct url and reminding of namespace * jupyterlab docs: fix namespace

* vllm0.9 best-known config update * update api benchmarking scripts to match vllm version * Add benchmark configuration options for input/output lengths and QPS * Update container environment variable handling to properly support secret references * Remove deprecated image references from model configuration files in the LLM inference Helm overrides for DeepSeek, Google Gemma, Meta Llama, Mistral, and Qwen models. * revert NUMA config for it is read-only * Remove extra sleep --------- Co-authored-by: Aku Rouhe <akurouhe@gmail.com>

Brednas

Approving

eliecerecology and others added 3 commits July 10, 2025 18:01

Jupyterlab Documentation (#377)

9b1e6fd

* jupyterlab: improve documentation with finding correct url and reminding of namespace * jupyterlab docs: fix namespace

Gastron requested review from Brednas and aivanni July 15, 2025 13:10

alexander-aurell-amd self-requested a review July 15, 2025 13:12

alexander-aurell-amd previously approved these changes Jul 15, 2025

View reviewed changes

Gastron dismissed alexander-aurell-amd’s stale review via 5110bc8 July 15, 2025 13:36

Gastron requested a review from alexander-aurell-amd July 15, 2025 13:36

aivanni approved these changes Jul 15, 2025

View reviewed changes

Brednas approved these changes Jul 15, 2025

View reviewed changes

Gastron merged commit 7844f4d into main Jul 15, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update main #4

Update main #4

Uh oh!

Gastron commented Jul 15, 2025

Uh oh!

Brednas left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Update main #4

Update main #4

Uh oh!

Conversation

Gastron commented Jul 15, 2025

Uh oh!

Brednas left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants