Readme #15

AymanBx · 2025-07-09T16:49:27Z

No description provided.

Added variables from run_experiments.sh and eval.sh, i added short descriptions for the ones I know

AymanBx · 2025-07-22T10:52:58Z

README.md

+- all_tasks
+  - list of tasks to be evaluated
+- models
+  - Models being used to evaluate results


This actually describes the eval model

AymanBx · 2025-07-22T10:53:26Z

README.md

+  - 
+- all_tasks
+  - list of tasks to be evaluated
+- models


The models that we are evaluating on above tasks

AymanBx · 2025-07-22T10:53:49Z

README.md

+- log_dir
+  - directory that the llm placed the experiment logs
+- json_folder
+  - 


The path in which the evaluation results will be placed

AymanBx · 2025-07-22T10:56:41Z

README.md

+- edit_script_model
+
+- fast_llm


Using a different model to do smaller tasks as in editing scripts and understanding file contents rather than using the same main agent model is optional.
For our experiment we are using the same model to do all the work

AymanBx · 2025-07-22T13:24:55Z

README.md

+How well can an LLM agent improve the training script to achieve high fairness metrics.
+
+## Fairness Metrics:
+


@surbhir08 could you please just put the names of the metrics here?

README.md

Initial draft for readme.md

c89a1bb

AymanBx requested review from brownsarahm and lanij21 July 9, 2025 16:49

AymanBx changed the title ~~Initial draft for readme.md~~ Readme Jul 9, 2025

added variables/filled some descriptions

2ce2187

Added variables from run_experiments.sh and eval.sh, i added short descriptions for the ones I know

AymanBx commented Jul 22, 2025

View reviewed changes

Update README.md

dd0ed63

AymanBx commented Jul 22, 2025

View reviewed changes

AymanBx linked an issue Aug 12, 2025 that may be closed by this pull request

Create a beatiful README #21

Open

Update README.md with more details

f68d9a4

AymanBx requested review from brownsarahm and removed request for brownsarahm August 16, 2025 15:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Readme #15

Readme #15

Uh oh!

AymanBx commented Jul 9, 2025

Uh oh!

AymanBx Jul 22, 2025

Uh oh!

AymanBx Jul 22, 2025

Uh oh!

AymanBx Jul 22, 2025

Uh oh!

AymanBx Jul 22, 2025

Uh oh!

AymanBx Jul 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		How well can an LLM agent improve the training script to achieve high fairness metrics.

		## Fairness Metrics:

Readme #15

Are you sure you want to change the base?

Readme #15

Uh oh!

Conversation

AymanBx commented Jul 9, 2025

Uh oh!

AymanBx Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

AymanBx Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

AymanBx Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

AymanBx Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

AymanBx Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants