-
Notifications
You must be signed in to change notification settings - Fork 0
Readme #15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Added variables from run_experiments.sh and eval.sh, i added short descriptions for the ones I know
README.md
Outdated
| - all_tasks | ||
| - list of tasks to be evaluated | ||
| - models | ||
| - Models being used to evaluate results |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This actually describes the eval model
README.md
Outdated
| - | ||
| - all_tasks | ||
| - list of tasks to be evaluated | ||
| - models |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The models that we are evaluating on above tasks
README.md
Outdated
| - log_dir | ||
| - directory that the llm placed the experiment logs | ||
| - json_folder | ||
| - |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The path in which the evaluation results will be placed
README.md
Outdated
| - edit_script_model | ||
|
|
||
| - fast_llm |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Using a different model to do smaller tasks as in editing scripts and understanding file contents rather than using the same main agent model is optional.
For our experiment we are using the same model to do all the work
README.md
Outdated
| How well can an LLM agent improve the training script to achieve high fairness metrics. | ||
|
|
||
| ## Fairness Metrics: | ||
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@surbhir08 could you please just put the names of the metrics here?
No description provided.