Feature/sp4 deo flops #14

RChrHill · 2025-11-18T01:49:10Z

Add Sp(4) versions of DeoFlops benchmarks.
There is only a double-precision version due to a bug in the Sp(4) Grid implementation causing FP32 code to call into FP64 template specialisations.

(Thanks to Ed and Alexis for providing the implementation -- marked as draft since I want to do a small amount of harmonisation with the rest of the codebase)

Co-authored-by: Ed Bennett <e.j.bennett@swansea.ac.uk>

Co-authored-by: Ryan Hill <rchrys.hill@gmail.com>

…ructure

Sp4 benchmarks

aportelli · 2025-11-19T04:00:44Z

Hi @RChrHill thanks! I am just appreciating the only FP64 aspect. On can live with it but it is quite awkward to have different benchmarks for different precisions. How much time would be needed to correct this in Grid?

We could have the benchmark logic pretending this will be fixed and having a warning meanwhile.

RChrHill · 2025-11-19T19:53:17Z

Hi @aportelli, I've submitted a fix for the FP32 version to Grid. While we wait for it to be merged, I've committed a workaround in 49ff1bd.

The general problem is the Sp<4>::HotConfiguration doesn't work for FP32 gauge fields, due to the issue in the Grid PR. So, I propose that we work around this by detecting an Sp4 single-precision instantiation and generate the gauge field in FP64, then just cast it to FP32.

Also caught two FP32 templates instantiated as FP64, thanks to logging the Action's precision with the compile-time metadata structs...

Attached is an example output JSON for the flops results (hacked to only run on 8^4 and 12^4 locally): result.json

aportelli · 2025-11-20T05:30:51Z

I understand. The random gauge field is essentially irrelevant to the benchmarking, so this should not be a constraint here. For the moment I am happy with the workaround you proposed, any other way to initialised the field would be fine as well.

Alexis Provatas and others added 8 commits November 6, 2025 21:34

first draft of Sp(4) benchmarks

1c31a1a

Co-authored-by: Ed Bennett <e.j.bennett@swansea.ac.uk>

use delete, not free, for objects created with new

1ae65fe

first attempt to implement code review feedback

0b99be9

Co-authored-by: Ryan Hill <rchrys.hill@gmail.com>

make strings for new classes constexpr

99ce8df

adjust running and output of Sp(4) results to match new refactored st…

c67cb09

…ructure

Merge pull request #1 from edbennett/sp4

b9f1e11

Sp4 benchmarks

Refactor DeoFlops helpers to naturally integrate Sp4 benchmark

7b5340e

Fix merge conflicts

7624640

RChrHill force-pushed the feature/sp4-deo-flops branch from d07d7b7 to 7624640 Compare November 19, 2025 02:15

RChrHill marked this pull request as ready for review November 19, 2025 02:17

Add Sp(4) FP32 benchmarks

49ff1bd

RChrHill force-pushed the feature/sp4-deo-flops branch from 254a2a1 to 49ff1bd Compare November 19, 2025 18:12

aportelli merged commit edce6ac into aportelli:main Nov 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature/sp4 deo flops #14

Feature/sp4 deo flops #14

Uh oh!

RChrHill commented Nov 18, 2025

Uh oh!

aportelli commented Nov 19, 2025

Uh oh!

RChrHill commented Nov 19, 2025 •

edited

Loading

Uh oh!

aportelli commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Feature/sp4 deo flops #14

Feature/sp4 deo flops #14

Uh oh!

Conversation

RChrHill commented Nov 18, 2025

Uh oh!

aportelli commented Nov 19, 2025

Uh oh!

RChrHill commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aportelli commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

RChrHill commented Nov 19, 2025 •

edited

Loading