Add new category containing interstitial benchmarks by zhonganr · Pull Request #337 · ddmms/ml-peg

zhonganr · 2026-02-04T10:33:58Z

Add a new category interstitial to assess the models' predictive performance for interstitial defect properties. Two benchmarks FE1SIA and Relastab are included:

FE1SIA evaluates the formation energy of a single self-interstitial atom (SIA) in a host lattice for distinct configurations.
Relastab evaluates the ability of models to correctly rank the stability of different interstitial configurations.
Related to Interstitial benchmarks #339

ElliottKasoar · 2026-02-20T13:10:58Z

ml_peg/calcs/interstitial/FE1SIA/calc_FE1SIA.py

+    calc = model.get_calculator()
+
+    data_path = download_s3_data(
+        key="inputs/interstitial/FE1SIA/DB.zip",


Is there a reason to keep the data for the two tests together?

Ideally, we'd have inputs/interstitial/FE1SIA/FE1SIA.zip and inputs/interstitial/relative_stability/relative_stability.zip unless there's anything connecting them?

This also reduces the chance of clashes between directories on unzipping in the cache.

Also note: normally we'd want relative_stability to be consistent with the test name, so if Relastab is the name you're happy with, then actually I'd go with inputs/interstitial/Relastab/Relastab.zip

If you're happy with this, I'm happy to upload zipped versions of the two individual folders within DB.zip in this form?

ElliottKasoar · 2026-02-20T13:12:23Z

ml_peg/calcs/interstitial/FE1SIA/calc_FE1SIA.py

+            ref_formation_energy = energy_ref_raw - (n_config / n_bulk) * energy_bulk
+
+        # Read structure
+        atoms = read(poscar_path, format="vasp")


Can you add a default spin (multiplicity) and charge? See similar changes here: https://github.com/ddmms/ml-peg/pull/384/changes

I very recently discovered Orb's omol model always requires both to be set, annoyingly

ElliottKasoar · 2026-02-20T13:17:08Z

ml_peg/calcs/interstitial/FE1SIA/calc_FE1SIA.py

+        except (ValueError, IndexError):
+            print("Skipping ref.poscar: distinct energy value not found in header.")
+            energy_bulk = 0.0  # Fallback or break


If there is only a single reference file, shouldn't we be quite confident that we can read it?

ElliottKasoar · 2026-02-20T13:18:39Z

ml_peg/calcs/interstitial/Relastab/calc_Relastab.py

+    calc = model.get_calculator()
+
+    data_path = download_s3_data(
+        key="inputs/interstitial/relative_stability/DB.zip",


ElliottKasoar · 2026-02-20T13:19:49Z

ml_peg/calcs/interstitial/Relastab/calc_Relastab.py

+            except (IndexError, ValueError) as e:
+                print(f"Warning: Could not extract energy from header '{header}': {e}")
+                atoms.info["ref"] = None


Does this happen? Given we only have a few files, we probably don't need to keep files with missing references?

ElliottKasoar · 2026-02-20T13:20:05Z

ml_peg/calcs/interstitial/Relastab/calc_Relastab.py

+
+        # Calculate
+        atoms.calc = calc
+


See above regarding spin/charge

ElliottKasoar · 2026-02-20T13:20:29Z

.gitignore

+calculate_rmsd.py
+DB/
+DB.zip


Suggested change

calculate_rmsd.py

DB/

DB.zip

I don't think we should need any of these?

ElliottKasoar · 2026-02-20T16:12:59Z

Thanks for adding this and sharing the data! It's looking great so far!

zhonganr and others added 2 commits February 3, 2026 19:17

Add new category containing two interstitial benchmarks

fc06321

Merge branch 'main' into benchmark_interstitial

9947aa6

ElliottKasoar added the new benchmark Proposals and suggestions for new benchmarks label Feb 6, 2026

Merge branch 'main' into benchmark_interstitial

e526ac1

ElliottKasoar reviewed Feb 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new category containing interstitial benchmarks#337

Add new category containing interstitial benchmarks#337
zhonganr wants to merge 3 commits intoddmms:mainfrom
zhonganr:benchmark_interstitial

zhonganr commented Feb 4, 2026 •

edited

Loading

Uh oh!

ElliottKasoar Feb 20, 2026

Uh oh!

ElliottKasoar Feb 20, 2026

Uh oh!

ElliottKasoar Feb 20, 2026

Uh oh!

ElliottKasoar Feb 20, 2026

Uh oh!

ElliottKasoar Feb 20, 2026

Uh oh!

ElliottKasoar Feb 20, 2026

Uh oh!

ElliottKasoar Feb 20, 2026

Uh oh!

ElliottKasoar commented Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

zhonganr commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ElliottKasoar Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

ElliottKasoar Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

ElliottKasoar Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

ElliottKasoar Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

ElliottKasoar Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

ElliottKasoar Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

ElliottKasoar Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

ElliottKasoar commented Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zhonganr commented Feb 4, 2026 •

edited

Loading