Merged
Conversation
Owner
andresgr96
commented
Aug 2, 2025
- Implemented RLHF inspired dataset gen method
- Added UI to control the process
- Added example and modified the README
- Added LICENSE file
* start implementation of rlhf inspired dataset generation methods * started implementing the human in the loop dataset generation using langgraph * small change * added the tree validator node * added simulator node * added ui for simulation with metrics by the side * datapoint saving works, need to save in proper json format * dataset now saves as json, main datapoint creation loop works, onto fail cases and feedback * added UI to saver node to decide to continue generating or exit generation * added human feedback node, now users can give feedback on previus attempt * added proper class dependant system prompt builder * start router node * added failure message for trees not passing validator * added proper dataset size tracking * started unified implementation with persistent ui * added functionality to ui to explore current dataset * added functionality for displaying agent capabilities * added agent class as param and imporved node listing visibility * clean up, started checking code tools * parameterized environment and simulation config for generelization * separated args for environment and sim config for clarity * fixed leftover mypy errors for agent control package * fixed ui workflow * fixed all pyflakes errors * fixed pydocstyle errors * cleaned up code and fixed msot errors, will build example * first version of polished rlhf dataset gen
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.