Skip to content

Conversation

@Gastron
Copy link
Collaborator

@Gastron Gastron commented Oct 10, 2025

Besides the data format documentation, now this also fixes the github action checks that lead to a segmentation fault. Because of lower prioritization and inability to reproduce the segmentation fault elsewhere, I've simply switched out the tiny-random-MistralForCausalLM that lead to the issue.

@Gastron Gastron requested a review from emeirola October 10, 2025 13:56
@emeirola
Copy link
Contributor

The indentation here was intentional, to make it interpreted as a code block in markdown.

@Gastron Gastron merged commit 1ce7866 into main Oct 15, 2025
2 checks passed
Gastron added a commit that referenced this pull request Oct 20, 2025
* Fix the data format documentation

* Try to avoid segmentation fault on GHA

* Try to avoid segmentation fault on GHA - trigger fix

* Use gemma2 instead of mistral

* gemma2 loss margins lower

* gemma2 loss margins even lower

* Add the indentation back
Gastron added a commit that referenced this pull request Oct 21, 2025
* Make SFT and DPO config documentation

* Fix the data format documentation (#13)

* Fix the data format documentation

* Try to avoid segmentation fault on GHA

* Try to avoid segmentation fault on GHA - trigger fix

* Use gemma2 instead of mistral

* gemma2 loss margins lower

* gemma2 loss margins even lower

* Add the indentation back

* Generate with DPO-specific text, handle versions in links

* Fix typo

---------

Co-authored-by: Emil Eirola <emil.eirola@amd.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants