Skip to content

Conversation

@Gastron
Copy link
Collaborator

@Gastron Gastron commented Jul 8, 2025

No description provided.

emeirola and others added 12 commits June 26, 2025 13:12
* Add model and data loading from minio

* Add deepspeed config example

* Validate starting of sync process, escape and quote path argument

* Wait 1s before checking if sync process started

* Fix for quotes in checkpointsRemote

* Update readme and other edits for clarity

* Update workloads/llm-finetune-llama-factory/helm/README.md

Co-authored-by: Aku Rouhe <akurouhe@amd.com>

---------

Co-authored-by: Aku Rouhe <akurouhe@amd.com>
* veRL GRPO finetuning ROCm example workload

* Refactor and complete VeRL workload

* Fix comments and typos

---------

Co-authored-by: Emil Eirola <emil.eirola@amd.com>
* Add basic MLFlow export

* Upgrade ROCm image.

* Fix nested folders for artifacts on MLFlow EVEN BETTER!

* Fix extra f-string

---------

Co-authored-by: Sander Bijl de Vroe <Sander.BijldeVroe@amd.com>
…rmonise model names. Harmonise arg parser. (#354)

* Fix erroneously removed LLM client URL prefix.
* Quote paths and escape chars in mc mirror

* Fix handling of minio paths

* Fix handling of quotes in echo statements
* Add on-boarding documentation for pre-commit

* clarify cd in docs and fix <br />

* small edit
* WandB downloader

* Make it work

* Correct override name, always mount

* No ephemeral storage, just emptyDir
@Gastron Gastron requested a review from Brednas July 8, 2025 10:22
@Gastron Gastron merged commit a77b46c into main Jul 8, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants