Refactor evaluators #107

MaHaWo · 2025-12-12T13:28:49Z

refactor evaluators to be more general and compatible with configs
add tests
some smaller error corrections

depends on #101

Copilot

Pull request overview

This PR refactors the evaluation system to be more flexible and config-driven, removing the Julia integration, adding new model architectures (Sequential, GPSTransformer), and expanding test coverage. Key changes include:

Renamed evaluators to Evaluator, Validator, Tester with configurable monitor tasks
Made early stopping optional and updated Trainer schema to support both constructor and evaluator config formats
Added comprehensive tests for evaluators, sequential models, and GPS transformers
Removed Julia worker integration and tests
Updated dataset handling for Zarr stores without root groups

Reviewed changes

Copilot reviewed 34 out of 36 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
`evaluate.py`	Refactored evaluators with DataFrame-based results and configurable monitor tasks
`train.py`, `train_ddp.py`	Updated trainer to use new evaluator names and optional early stopping
`test_evaluate.py`	Added comprehensive evaluator tests with custom monitors
`test_sequential.py`, `test_gps_transformer.py`	New tests for Sequential and GPSTransformer models
`test_trainer.py`	Updated tests for default evaluators and trainer configuration
`test_tune.py`	Refactored fixture to use `yaml_config` fixture
`conftest.py`	Replaced Julia-based fixtures with pure Python data generation
`config_utils.py`	Added support for list sweeps and numeric string conversion
`dataset_base.py`, `dataset_ondisk.py`	Updated to handle Zarr stores without root groups
`gnn_model.py`	Fixed pooling layer validation logic
`sequential.py`, `gps_transformer.py`	New configurable model architectures
`models/__init__.py`	Exported new Sequential model
`utils.py`	Added `maybe_number` utility
`pyproject.toml`	Removed `juliacall`, added `huggingface-hub`
Documentation files	Updated guides and API docs for new evaluator system
Config files	Added example configs for training

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-29T20:05:12Z

QuantumGravPy/test/test_trainer.py

+
+    original_weights = [param.clone() for param in trainer.model.parameters()]
+
+    training_data, valid_data = trainer.run_training(


Variable training_data is not used.

Copilot · 2025-12-29T20:05:13Z

QuantumGravPy/test/test_trainer.py

+
+    original_weights = [param.clone() for param in trainer.model.parameters()]
+
+    training_data, valid_data = trainer.run_training(


Variable valid_data is not used.

MaHaWo added 30 commits November 25, 2025 14:00

put model defs into a separate dir

7b59d22

add SkipConnection

89d7388

adjust imports

ce1d70f

correct errors, exports

275527c

fix gnn_block tests

feaba08

adjust fixtures and some tests

09bbde0

add jsonschema and new way to initialize it to GNNBlock

0b56787

fix linearsequential model tests

6ffc6a2

fix utils tests

ae30e45

adjust model exports

c48e0ca

fix problem with type vs string

ac75cb7

don´t allow additional props in config for skipconnections

23f7103

write skip connection test

92c8d23

make skip connection tests work

f888875

don't allow additional stuff in config

a45b8a0

rework gnnmodel init, work on json schema

4e6d7c2

work on gnn model

7fc110a

fix type annotation

59697f6

work on gnn_model tests

4e412e9

remove verify_config

ad31b3e

work on gnn_model tests

ba91ee2

remove to_config requirement

69551e5

add missing fixture

664cac4

add pre-commit stuff,

172a77b

add arbitrary model test.

4e52ed9

correct some docstrings, imports

8f0d3c1

make eval tests work again

8bc428b

ajust trainer code for new model config

e7c26b3

finish test adjustments

1336a4e

remove old early stopper

8865e19

MaHaWo added 23 commits December 16, 2025 17:04

finish general sequential model

aba9099

add docstrings

cf4b01d

add correction to config sweep system

539958d

add name to trainer

769f4e8

add sweep_targets member to confighandler

ccf6373

add random seed for numpy, learning rate scheduler

bc4a792

config correction

4fb3212

remove some unnecessesarily strict requirements

bd70340

rework the docs to accomodate the current code

ff77017

add huggingface hub dependency

2eebf95

add config tests

bdf663b

add from_config tests and corrections

f6ed221

Merge branch 'main' into refactor-evaluators

1c030ed

fix trainer and tune tests

17f3444

add transformer model

9650937

add fromconfig

917e37d

add tests

1a4162c

small corrections

1eeb23b

add transformer tests

9378643

rename test

07c29db

exclude tests from coverage report

b1252a3

remove julia worker

ac1ef00

remove superfluous fixtures, old juliacall import

27b1329

MaHaWo marked this pull request as ready for review December 29, 2025 20:02

Copilot AI review requested due to automatic review settings December 29, 2025 20:02

Copilot started reviewing on behalf of MaHaWo December 29, 2025 20:02 View session

Copilot AI reviewed Dec 29, 2025

View reviewed changes

MaHaWo added 3 commits January 2, 2026 17:54

add input sanitization for factories

e0b3a98

fix mistakes in cset factories

b0b91bd

fix some problems

48a544a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor evaluators #107

Refactor evaluators #107

Uh oh!

MaHaWo commented Dec 12, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 29, 2025

Uh oh!

Copilot AI Dec 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		original_weights = [param.clone() for param in trainer.model.parameters()]

		training_data, valid_data = trainer.run_training(

Refactor evaluators #107

Are you sure you want to change the base?

Refactor evaluators #107

Uh oh!

Conversation

MaHaWo commented Dec 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MaHaWo commented Dec 12, 2025 •

edited

Loading