Added time log for each step and removed Bfloat16cast #68

LinlinCui-NOAA · 2025-12-10T16:10:22Z

This PR made two changes to run_graphcast.py:

Added time log for each step
Removed conversion to Bfloat16. Bfloat16 should not be used for prediction. The reduced precision greatly increases run-to-run variance.

aerorahul

looks good.
One could introduce timing stats in rollout.chunked_predictions as well as converter.save_grib2 to further get info on computation and IO respectively through the model integration.

RussellManser-NCO

The additional time logging looks good, but I have concerns about the change to float casting if this is intended for the current production code.

oper/run_graphcast.py

Requested changes were made. Thank you.

RussellManser-NCO · 2025-12-10T18:45:31Z

I will run a test for this on WCOSS this afternoon.

RussellManser-NCO · 2025-12-10T20:01:40Z

The print statements are not being written to output while run_graphcast.py is executing. Could you modify the shebang to the following please?

#!/usr/bin/env -S python3 -u

aerorahul · 2025-12-10T20:08:05Z

The print statements are not being written to output while run_graphcast.py is executing. Could you modify the shebang to the following please?

#!/usr/bin/env -S python3 -u

could also add flush=True to the print statement. However, seems like there is something else at play.

aerorahul · 2025-12-10T20:09:03Z

If you don't want to make code changes, add export PYTHONUNBUFFERED=1 to the runtime environment.

LinlinCui-NOAA · 2025-12-10T20:17:47Z

I can switch to use absl.logging, which can write to output while the script is executing, e.g.:

[2025-12-10 20:13:39,365] absl INFO: Elapsed time for loading input: 41.77620339393616 seconds
[2025-12-10 20:13:48,159] absl INFO: Elapsed time for extracting inputs, targets, and forcings: 8.793529987335205 seconds
[2025-12-10 20:13:49,123] absl INFO: Elapsed time for normalization: 0.9637486934661865 seconds

@RussellManser-NCO Please let me know which one you prefer.

RussellManser-NCO · 2025-12-10T20:19:40Z

I can switch to use absl.logging, which can write to output while the script is executing, e.g.:
[2025-12-10 20:13:39,365] absl INFO: Elapsed time for loading input: 41.77620339393616 seconds
[2025-12-10 20:13:48,159] absl INFO: Elapsed time for extracting inputs, targets, and forcings: 8.793529987335205 seconds
[2025-12-10 20:13:49,123] absl INFO: Elapsed time for normalization: 0.9637486934661865 seconds
@RussellManser-NCO Please let me know which one you prefer.

absl.logging works. It's nice to have the timestamps and logging info.

RussellManser-NCO · 2025-12-10T21:20:59Z

Logging was not unbuffered, unfortunately, even with a modified shebang. I tried export PYTHONUNBUFFERED=1 as well, which also did not work. The latest push does work.

LinlinCui-NOAA · 2025-12-10T21:31:34Z

OK. Thanks for testing.

add time logs; remove Bfloat16cast

38f68b4

LinlinCui-NOAA requested review from RussellManser-NCO, aerorahul and junwang-noaa December 10, 2025 16:10

junwang-noaa approved these changes Dec 10, 2025

View reviewed changes

aerorahul approved these changes Dec 10, 2025

View reviewed changes

RussellManser-NCO previously requested changes Dec 10, 2025

View reviewed changes

oper/run_graphcast.py Show resolved Hide resolved

revert to bfloat16

ef6cac0

LinlinCui-NOAA and others added 2 commits December 10, 2025 20:32

replace print with absl.logging

a130db3

Revert to print statements, use unbuffered output

aa7642c

LinlinCui-NOAA merged commit a03a127 into production/mlglobal.v1 Dec 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added time log for each step and removed Bfloat16cast #68

Added time log for each step and removed Bfloat16cast #68

Uh oh!

LinlinCui-NOAA commented Dec 10, 2025

Uh oh!

aerorahul left a comment

Uh oh!

RussellManser-NCO left a comment

Uh oh!

Uh oh!

RussellManser-NCO commented Dec 10, 2025

Uh oh!

RussellManser-NCO commented Dec 10, 2025

Uh oh!

aerorahul commented Dec 10, 2025

Uh oh!

aerorahul commented Dec 10, 2025

Uh oh!

LinlinCui-NOAA commented Dec 10, 2025

Uh oh!

RussellManser-NCO commented Dec 10, 2025

Uh oh!

RussellManser-NCO commented Dec 10, 2025

Uh oh!

LinlinCui-NOAA commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Added time log for each step and removed Bfloat16cast #68

Added time log for each step and removed Bfloat16cast #68

Uh oh!

Conversation

LinlinCui-NOAA commented Dec 10, 2025

Uh oh!

aerorahul left a comment

Choose a reason for hiding this comment

Uh oh!

RussellManser-NCO left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

RussellManser-NCO commented Dec 10, 2025

Uh oh!

RussellManser-NCO commented Dec 10, 2025

Uh oh!

aerorahul commented Dec 10, 2025

Uh oh!

aerorahul commented Dec 10, 2025

Uh oh!

LinlinCui-NOAA commented Dec 10, 2025

Uh oh!

RussellManser-NCO commented Dec 10, 2025

Uh oh!

RussellManser-NCO commented Dec 10, 2025

Uh oh!

LinlinCui-NOAA commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants