fix(framework): Move Token deletion from PushAppOutputs into UpdateRunStatus by msheller · Pull Request #6924 · flwrlabs/flower

msheller · 2026-04-01T21:16:02Z

Issue

Fixes a run-finalization sequencing bug in ServerAppIo by keeping app tokens alive until terminal status is successfully persisted.

Prior to this fix, PushAppOutputs deletes the token before the final UpdateRunStatus(FINISHED, ...) call. If that terminal status update fails during shutdown, the token is already gone, and heartbeat-expiry fallback can not finalize the run.

Proposal

Changes:

ServerAppIo.PushAppOutputs no longer deletes token.
ServerAppIo.UpdateRunStatus now:
- Checks state.update_run_status(...) result.
- Returns FAILED_PRECONDITION if transition is rejected.
- Deletes token only after successful FINISHED persistence (after existing object-store cleanup).
Added concise comments clarifying the sequencing rationale.

Updated serverappio_servicer_test.py with focused regressions:

Token is retained after successful PushAppOutputs.
Invalid UpdateRunStatus transition returns FAILED_PRECONDITION and keeps token.
Successful RUNNING -> FINISHED(COMPLETED) deletes token.

Checklist

Implement proposed change
Write tests
Update documentation
Address LLM-reviewer comments, if applicable (e.g., GitHub Copilot)
Make CI checks pass
Ping maintainers on Slack (channel #contributions)

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 93f4a9570e

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Copilot

Pull request overview

Fixes a run-finalization sequencing issue by retaining ServerApp tokens until a terminal run status (FINISHED) has been successfully persisted, enabling heartbeat-expiry fallback finalization if shutdown status updates fail.

Changes:

Stop deleting the run token in ServerAppIoServicer.PushAppOutputs.
In ServerAppIoServicer.UpdateRunStatus, check the result of state.update_run_status(...), return FAILED_PRECONDITION on invalid transitions, and delete the token after a successful FINISHED update.
Add regression tests to validate token retention after PushAppOutputs, token retention on invalid status transitions, and token deletion on successful FINISHED transitions.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
framework/py/flwr/server/superlink/serverappio/serverappio_servicer.py	Moves token deletion responsibility to the FINISHED status update path and adds transition failure handling.
framework/py/flwr/server/superlink/serverappio/serverappio_servicer_test.py	Adds/updates tests to enforce the new token lifecycle and invalid-transition behavior.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 581c77b7f3

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-04-02T04:55:25Z

+        # Keep token until terminal status is committed. If shutdown finalization fails,
+        # heartbeat expiry still needs the token to trigger run finalization fallback.
        return PushAppOutputsResponse()


Delete token before entering shutdown cleanup

Keeping the token alive in PushAppOutputs creates a new failure window during normal shutdown: on_exit in framework/py/flwr/server/serverapp/app.py stops the heartbeat thread before it calls UpdateRunStatus, so if log upload/join or network teardown delays finalization for ~60s, token cleanup can mark the still-successful run as FINISHED/FAILED before UpdateRunStatus(FINISHED/COMPLETED) arrives. At that point abort_if(...[Status.FINISHED]...) rejects the completion update, so a successful run is persisted as failed under slow shutdown conditions.

Useful? React with 👍 / 👎.

Token deletion moved from PushAppOutputs into update run status

93f4a95

msheller marked this pull request as ready for review April 1, 2026 21:16

msheller requested review from danieljanes and tanertopal as code owners April 1, 2026 21:16

Copilot AI review requested due to automatic review settings April 1, 2026 21:16

Copilot started reviewing on behalf of msheller April 1, 2026 21:16 View session

chatgpt-codex-connector bot reviewed Apr 1, 2026

View reviewed changes

Comment thread framework/py/flwr/server/superlink/serverappio/serverappio_servicer.py Outdated

Copilot AI reviewed Apr 1, 2026

View reviewed changes

Comment thread framework/py/flwr/server/superlink/serverappio/serverappio_servicer.py Outdated

Comment thread framework/py/flwr/server/superlink/serverappio/serverappio_servicer.py Outdated

github-actions bot added the Maintainer Used to determine what PRs (mainly) come from Flower maintainers. label Apr 1, 2026

Address copilot comments

b5d2416

msheller requested a review from Copilot April 2, 2026 03:17

Copilot started reviewing on behalf of msheller April 2, 2026 03:18 View session

Copilot AI reviewed Apr 2, 2026

View reviewed changes

Comment thread framework/py/flwr/server/superlink/serverappio/serverappio_servicer.py Outdated

Comment thread framework/py/flwr/server/superlink/serverappio/serverappio_servicer.py

Updated to call delete token prior to delete objects

581c77b

chatgpt-codex-connector bot reviewed Apr 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(framework): Move Token deletion from PushAppOutputs into UpdateRunStatus#6924

fix(framework): Move Token deletion from PushAppOutputs into UpdateRunStatus#6924
msheller wants to merge 3 commits intomainfrom
fix-token-deletion-for-reliable-run-finalization

msheller commented Apr 1, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

msheller commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Issue

Proposal

Checklist

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

msheller commented Apr 1, 2026 •

edited

Loading