Skip to content

Conversation

@MaHaWo
Copy link
Collaborator

@MaHaWo MaHaWo commented Dec 12, 2025

  • refactor evaluators to be more general and compatible with configs
  • add tests
  • some smaller error corrections

depends on #101

@MaHaWo MaHaWo marked this pull request as ready for review December 29, 2025 20:02
Copilot AI review requested due to automatic review settings December 29, 2025 20:02
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR refactors the evaluation system to be more flexible and config-driven, removing the Julia integration, adding new model architectures (Sequential, GPSTransformer), and expanding test coverage. Key changes include:

  • Renamed evaluators to Evaluator, Validator, Tester with configurable monitor tasks
  • Made early stopping optional and updated Trainer schema to support both constructor and evaluator config formats
  • Added comprehensive tests for evaluators, sequential models, and GPS transformers
  • Removed Julia worker integration and tests
  • Updated dataset handling for Zarr stores without root groups

Reviewed changes

Copilot reviewed 34 out of 36 changed files in this pull request and generated 2 comments.

Show a summary per file
File Description
evaluate.py Refactored evaluators with DataFrame-based results and configurable monitor tasks
train.py, train_ddp.py Updated trainer to use new evaluator names and optional early stopping
test_evaluate.py Added comprehensive evaluator tests with custom monitors
test_sequential.py, test_gps_transformer.py New tests for Sequential and GPSTransformer models
test_trainer.py Updated tests for default evaluators and trainer configuration
test_tune.py Refactored fixture to use yaml_config fixture
conftest.py Replaced Julia-based fixtures with pure Python data generation
config_utils.py Added support for list sweeps and numeric string conversion
dataset_base.py, dataset_ondisk.py Updated to handle Zarr stores without root groups
gnn_model.py Fixed pooling layer validation logic
sequential.py, gps_transformer.py New configurable model architectures
models/__init__.py Exported new Sequential model
utils.py Added maybe_number utility
pyproject.toml Removed juliacall, added huggingface-hub
Documentation files Updated guides and API docs for new evaluator system
Config files Added example configs for training

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.


original_weights = [param.clone() for param in trainer.model.parameters()]

training_data, valid_data = trainer.run_training(
Copy link

Copilot AI Dec 29, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Variable training_data is not used.

Copilot uses AI. Check for mistakes.

original_weights = [param.clone() for param in trainer.model.parameters()]

training_data, valid_data = trainer.run_training(
Copy link

Copilot AI Dec 29, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Variable valid_data is not used.

Copilot uses AI. Check for mistakes.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants