Skip to content

Update paper URL#15

Merged
andresgr96 merged 3 commits intomainfrom
development
Aug 14, 2025
Merged

Update paper URL#15
andresgr96 merged 3 commits intomainfrom
development

Conversation

@andresgr96
Copy link
Owner

  • updated paper link and authorship

  • fixed toml error

- Added LICENSE file
* start implementation of rlhf inspired dataset generation methods

* started implementing the human in the loop dataset generation using langgraph

* small change

* added the tree validator node

* added simulator node

* added ui for simulation with metrics by the side

* datapoint saving works, need to save in proper json format

* dataset now saves as json, main datapoint creation loop works, onto fail cases and feedback

* added UI to saver node to decide to continue generating or exit generation

* added human feedback node, now users can give feedback on previus attempt

* added proper class dependant system prompt builder

* start router node

* added failure message for trees not passing validator

* added proper dataset size tracking

* started unified implementation with persistent ui

* added functionality to ui to explore current dataset

* added functionality for displaying agent capabilities

* added agent class as param and imporved node listing visibility

* clean up, started checking code tools

* parameterized environment and simulation config for generelization

* separated args for environment and sim config for clarity

* fixed leftover mypy errors for agent control package

* fixed ui workflow

* fixed all pyflakes errors

* fixed pydocstyle errors

* cleaned up code and fixed msot errors, will build example

* first version of polished rlhf dataset gen
* updated paper link and authorship

* fixed toml error
@andresgr96 andresgr96 merged commit d2b5d6b into main Aug 14, 2025
12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant