Skip to content

Conversation

@sarmuru2
Copy link

@sarmuru2 sarmuru2 commented Nov 2, 2023

When multiple task updates done on the same task at the same time, its possible that a racing condition could occur. When IN_PROGRESS and COMPLETED task updates are processed at the same time, its possible that IN_PROGRESS state is updated to task even after conductor updating COMPLETED state. Ideally if there is IN_PROGRESS notification, after COMPLETED notification, it should fail and log error. But in this case, IN_PROGRESS update is through after COMPLETED update and after some time it throws task TIMED_OUT error.

Introduced locking to avoid parallel task updates for the same task. We faced an issue in conductor when two
threads are updating the same task status as COMPLETED and IN_PROGRESS. Both these updates went successful
where IN_PROGRESS after COMPLETED should fail. As IN_PROGRESS went successful, task is TIMED_OUT after sometime
as there are no further updates from fusion. Workflow TIMED_OUT notification sent to fusion which makes the workflow as

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

let the comment be generic and not add any fusion details here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants