-
Notifications
You must be signed in to change notification settings - Fork 3
Feature/sp4 deo flops #14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Co-authored-by: Ed Bennett <e.j.bennett@swansea.ac.uk>
Co-authored-by: Ryan Hill <rchrys.hill@gmail.com>
Sp4 benchmarks
d07d7b7 to
7624640
Compare
|
Hi @RChrHill thanks! I am just appreciating the only FP64 aspect. On can live with it but it is quite awkward to have different benchmarks for different precisions. How much time would be needed to correct this in Grid? We could have the benchmark logic pretending this will be fixed and having a warning meanwhile. |
254a2a1 to
49ff1bd
Compare
|
Hi @aportelli, I've submitted a fix for the FP32 version to Grid. While we wait for it to be merged, I've committed a workaround in 49ff1bd. The general problem is the Also caught two FP32 templates instantiated as FP64, thanks to logging the Action's precision with the compile-time metadata structs... Attached is an example output JSON for the flops results (hacked to only run on 8^4 and 12^4 locally): result.json |
|
I understand. The random gauge field is essentially irrelevant to the benchmarking, so this should not be a constraint here. For the moment I am happy with the workaround you proposed, any other way to initialised the field would be fine as well. |
Add Sp(4) versions of DeoFlops benchmarks.
There is only a double-precision version due to a bug in the Sp(4) Grid implementation causing FP32 code to call into FP64 template specialisations.
(Thanks to Ed and Alexis for providing the implementation -- marked as draft since I want to do a small amount of harmonisation with the rest of the codebase)