Remove rate limit on reserved tests by shioyama · Pull Request #334 · Shopify/ci-queue

shioyama · 2025-05-23T04:34:18Z

Currently, ci-queue only allows one test to be reserved at a time. We are attempting to implement test parallelism within ci-queue jobs, which requires that the server process reserves more than this.

I've made some changes here to support that change:

Implement reserved_test as a set instead of a single test (renamed to reserved_tests)
Only raise on acknowledge and requeue, not on reserve since there is nothing to check when reserving a test.

I've implemented this for everything, i.e. without some configuration to switch it on and off. This simplifies things a lot, because I don't think ensuring that we're not reserving when a test has already been reserved is actually doing much atm. But maybe I'm missing something.

ChrisBr · 2025-05-23T18:51:02Z

ruby/lib/ci/queue/redis/worker.rb


        def initialize(redis, config)
-          @reserved_test = nil
+          @reserved_tests = Set.new


I guess it would be difficult to send the process ID back up and scope by that?

Hmm, not sure that would be possible 🤔

We have a master process that is reserving tests, which forked processes pull from. The master process fills the queue as soon as it has less than the max (currently 1).

When we are reserving a test, therefore, we don't know which process will end up running it, so we can't check for a "mismatch" at that point.

However, we might be able to improve the check in acknowledge such that when a worker process takes a test (pops it from the server via drb), we track its process ID and associate it with the reserved test. Then in theory when the result comes back, we could check that the reserved test exists, and that the pid we associated with it matches the worker that is trying to record the result.

I'm not sure that we can get the pid though from a request to the drb server. 🤔

Oh never mind we can just make the worker send its pid.

Ok so we can indeed check that the worker recording a result for a test matches the pid for the worker that asked for that same test. I don't think it would be easy to do that on the gem side of this, though. And I'm not really convinced that having that check would do much for us.

OTOH I don't see how we can do anything about reserve, given that when a server process reserves a test, it doesn't know which worker will end up taking it. So there's no check to be made at that moment.

shioyama · 2025-06-13T05:05:55Z

ruby/lib/ci/queue/redis/worker.rb


-        def initialize(redis, config)
-          @reserved_test = nil
+        def initialize(redis, config, max_reserved_tests: 1)


@ChrisBr I added a parameter we can use to ensure we are not reserving more tests than we allow in the in-memory queue. This is fairly easy and I think would catch the obvious issues? It would be a duplicate check since we already check this in the parallelization server but can't be too careful.

shioyama · 2025-06-16T07:11:17Z

Ok I extracted the stuff not related to reserving tests to #337, which should be uncontroversial. Then we can deal with the change to reserved tests on its own.

shioyama · 2025-06-16T07:17:45Z

Updated this PR to only include the part about reserving tests; the rest is in #337.

casperisfine · 2025-06-16T07:50:11Z

ruby/lib/ci/queue/redis/worker.rb

+          if reserved_tests.include?(test)
+            reserved_tests.delete(test)
          else
-            raise ReservationError, "Acknowledged #{test.inspect} but #{@reserved_test.inspect} was reserved"
+            raise ReservationError, "Acknowledged #{test.inspect} but only #{reserved_tests.map(&:inspect).join(", ")} reserved"


unless reserved_tests.delete?(test) raise ... end

shioyama · 2025-06-18T01:20:49Z

Rebased this on main.

shioyama changed the base branch from main to cbruckmayer/check-we-executed-all-tests May 23, 2025 04:35

shioyama force-pushed the shioyama/parallel_ci_queue branch 3 times, most recently from 8a1e3eb to b1203e0 Compare May 23, 2025 05:37

shioyama changed the title ~~Refactor~~ Move Minitest Minitest.__run patch internals to Minitest::Queue.run and remove rate limit on reserved tests May 23, 2025

shioyama requested a review from ChrisBr May 23, 2025 07:16

ChrisBr requested a review from casperisfine May 23, 2025 18:50

ChrisBr reviewed May 23, 2025

View reviewed changes

casperisfine removed their request for review May 23, 2025 19:49

shioyama force-pushed the shioyama/parallel_ci_queue branch 2 times, most recently from 2521f8c to 65bfaed Compare June 13, 2025 03:28

shioyama commented Jun 13, 2025

View reviewed changes

shioyama force-pushed the shioyama/parallel_ci_queue branch from 47ff8be to 65bfaed Compare June 13, 2025 05:18

shioyama mentioned this pull request Jun 16, 2025

Move Minitest.__run patch internals to Minitest::Queue.run #337

Merged

shioyama changed the title ~~Move Minitest Minitest.__run patch internals to Minitest::Queue.run and remove rate limit on reserved tests~~ Remove rate limit on reserved tests Jun 16, 2025

shioyama force-pushed the shioyama/parallel_ci_queue branch from 65bfaed to b8cb23b Compare June 16, 2025 07:13

shioyama requested a review from casperisfine June 16, 2025 07:17

shioyama marked this pull request as ready for review June 16, 2025 07:18

casperisfine approved these changes Jun 16, 2025

View reviewed changes

ChrisBr approved these changes Jun 16, 2025

View reviewed changes

shioyama added 2 commits June 18, 2025 10:20

Allow reserving multiple tests

a01c867

Use reserved_tests reader

3035c5f

shioyama force-pushed the shioyama/parallel_ci_queue branch from 2f0f332 to 3035c5f Compare June 18, 2025 01:20

shioyama changed the base branch from cbruckmayer/check-we-executed-all-tests to main June 18, 2025 01:20

Update test to use reserved_tests

aaf2f7e

shioyama merged commit d06466a into main Jun 18, 2025
13 of 23 checks passed

shopify-shipit bot temporarily deployed to rubygems June 18, 2025 02:06 Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove rate limit on reserved tests#334

Remove rate limit on reserved tests#334
shioyama merged 3 commits intomainfrom
shioyama/parallel_ci_queue

shioyama commented May 23, 2025 •

edited

Loading

Uh oh!

ChrisBr May 23, 2025

Uh oh!

shioyama Jun 13, 2025 •

edited

Loading

Uh oh!

shioyama Jun 13, 2025

Uh oh!

shioyama Jun 13, 2025

Uh oh!

shioyama Jun 13, 2025

Uh oh!

shioyama commented Jun 16, 2025

Uh oh!

shioyama commented Jun 16, 2025

Uh oh!

casperisfine Jun 16, 2025

Uh oh!

shioyama commented Jun 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

shioyama commented May 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ChrisBr May 23, 2025

Choose a reason for hiding this comment

Uh oh!

shioyama Jun 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

shioyama Jun 13, 2025

Choose a reason for hiding this comment

Uh oh!

shioyama Jun 13, 2025

Choose a reason for hiding this comment

Uh oh!

shioyama Jun 13, 2025

Choose a reason for hiding this comment

Uh oh!

shioyama commented Jun 16, 2025

Uh oh!

shioyama commented Jun 16, 2025

Uh oh!

casperisfine Jun 16, 2025

Choose a reason for hiding this comment

Uh oh!

shioyama commented Jun 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

shioyama commented May 23, 2025 •

edited

Loading

shioyama Jun 13, 2025 •

edited

Loading