Skip to content

Conversation

@AymanBx
Copy link
Collaborator

@AymanBx AymanBx commented Jul 9, 2025

No description provided.

@AymanBx AymanBx requested review from brownsarahm and lanij21 July 9, 2025 16:49
@AymanBx AymanBx changed the title Initial draft for readme.md Readme Jul 9, 2025
Added variables from run_experiments.sh and eval.sh, i added short descriptions for the ones I know
README.md Outdated
- all_tasks
- list of tasks to be evaluated
- models
- Models being used to evaluate results
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This actually describes the eval model

README.md Outdated
-
- all_tasks
- list of tasks to be evaluated
- models
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The models that we are evaluating on above tasks

README.md Outdated
- log_dir
- directory that the llm placed the experiment logs
- json_folder
-
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The path in which the evaluation results will be placed

README.md Outdated
Comment on lines 47 to 49
- edit_script_model

- fast_llm
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Using a different model to do smaller tasks as in editing scripts and understanding file contents rather than using the same main agent model is optional.
For our experiment we are using the same model to do all the work

README.md Outdated
How well can an LLM agent improve the training script to achieve high fairness metrics.

## Fairness Metrics:

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@surbhir08 could you please just put the names of the metrics here?

@AymanBx AymanBx linked an issue Aug 12, 2025 that may be closed by this pull request
@AymanBx AymanBx requested review from brownsarahm and removed request for brownsarahm August 16, 2025 15:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Create a beatiful README

3 participants