fix(repos): Have repo sync batch up the work by wedamija · Pull Request #113131 · getsentry/sentry

wedamija · 2026-04-15T22:55:40Z

We've had a few task timeouts with the sync, caused by extremely large batches. Switching this over to limit batch sizes to 100, and to just fire parallel tasks instead.

This pr just creates the tasks and call them directly rather than scheduling them. This is so that we can be sure the tasks are deployed before we start firing them off. There will be a follow up pr to actually schedule them.

Follow up to #113131. This fires off the tasks rather than calling them directly.

We've had a few task timeouts with the sync, caused by extremely large batches. Switching this over to limit batch sizes to 100, and to just fire parallel tasks instead. This pr just creates the tasks and call them directly rather than scheduling them. This is so that we can be sure the tasks are deployed before we start firing them off. There will be a follow up pr to actually schedule them.

Follow up to #113131. This fires off the tasks rather than calling them directly.

sentry · 2026-04-15T23:05:11Z

+        providers=[provider],
+    )
+    restore_set = set(external_ids)
+    for repo in all_repos:


Bug: The refactored repo sync tasks can create duplicate REPO_ENABLED and REPO_DISABLED audit logs on retry because they don't filter repositories by their current status before logging.
_{Severity: MEDIUM}

Suggested Fix

In restore_repos_batch, filter the initial repository query to only fetch repos with status=ObjectStatus.DISABLED. In disable_repos_batch, ensure the audit logging logic only considers repos that were actually transitioned from an ACTIVE state in the current task execution, rather than re-fetching all repos without a status filter.

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: src/sentry/integrations/source_code_management/sync_repos.py#L373 Potential issue: The refactored `restore_repos_batch` and `disable_repos_batch` tasks are vulnerable to creating duplicate audit log entries upon retry. The original code performed these operations atomically, but the new, separate tasks can be retried. In `restore_repos_batch`, the function fetches all repos without filtering by status, causing already-restored repos to be processed again, generating a duplicate `REPO_ENABLED` log. Similarly, in `disable_repos_batch`, the audit logging logic fetches repos without a status filter, leading to duplicate `REPO_DISABLED` logs for already-disabled repos if a retry occurs. This results in an inaccurate and misleading audit trail.

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit a6ce51e. Configure here.}

cursor · 2026-04-15T23:11:11Z

+        organization_id=rpc_org.id,
+        integration_id=integration.id,
+        providers=[provider],
+    )


restore_repos_batch missing disabled status filter on query

Medium Severity

restore_repos_batch fetches all repos regardless of status, while the old code only iterated disabled_repos (filtered to ObjectStatus.DISABLED). The get_repositories call here doesn't pass the available status parameter. When these tasks are moved to async execution (the stated follow-up), a repo that transitioned from DISABLED to PENDING_DELETION between the parent task and this batch task would be incorrectly set back to ACTIVE, overriding the deletion. Even synchronously, this needlessly updates already-active repos and emits false REPO_ENABLED audit log entries if duplicate external_id values exist.

^{Reviewed by Cursor Bugbot for commit a6ce51e. Configure here.}

wedamija requested a review from a team April 15, 2026 22:55

wedamija requested review from a team as code owners April 15, 2026 22:55

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label Apr 15, 2026

vercel bot deployed to Preview April 15, 2026 22:57 View deployment

wedamija added a commit that referenced this pull request Apr 15, 2026

fix(repos): Call repo sync batch tasks

280365d

Follow up to #113131. This fires off the tasks rather than calling them directly.

wedamija mentioned this pull request Apr 15, 2026

fix(repos): Call repo sync batch tasks #113132

Open

sentry bot reviewed Apr 15, 2026

View reviewed changes

Comment thread src/sentry/integrations/source_code_management/sync_repos.py Outdated

wedamija force-pushed the danf/repo-sync-batch branch from e70900c to a6ce51e Compare April 15, 2026 23:01

wedamija added a commit that referenced this pull request Apr 15, 2026

fix(repos): Call repo sync batch tasks

ef57504

Follow up to #113131. This fires off the tasks rather than calling them directly.

vercel bot deployed to Preview April 15, 2026 23:03 View deployment

sentry bot reviewed Apr 15, 2026

View reviewed changes

cursor bot reviewed Apr 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(repos): Have repo sync batch up the work#113131

fix(repos): Have repo sync batch up the work#113131
wedamija wants to merge 1 commit intomasterfrom
danf/repo-sync-batch

wedamija commented Apr 15, 2026

Uh oh!

Uh oh!

sentry bot Apr 15, 2026

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

wedamija commented Apr 15, 2026

Uh oh!

Uh oh!

sentry bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Apr 15, 2026

Choose a reason for hiding this comment

restore_repos_batch missing disabled status filter on query

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant