WIP test-bot: parallelize dependent test with sharding by GunniBusch · Pull Request #21871 · Homebrew/brew

GunniBusch · 2026-03-29T16:58:57Z

Have you followed the guidelines in our Contributing document?
Have you checked to ensure there aren't other open Pull Requests for the same change?
Have you added an explanation of what your changes do and why you'd like us to include them?
Have you written new tests (excluding integration tests) for your changes? Here's an example.
Have you successfully run brew lgtm (style, typechecking and tests) with your changes locally?

AI was used to generate or assist with generating this PR. Please specify below how you used AI to help you, and what steps you have taken to manually verify the changes.

AI was used to help iterate on the implementation and tests. The resulting changes were manually reviewed and verified locally with ./bin/brew lgtm --online.

Info: This is still WIP and not the final version. I am opening this PR early so I can test the branch in CI and make progress visible.

This PR changes dependent test-bot execution to use sharding.

The goal is to reduce total dependent-test runtime by splitting the dependent workload across multiple shard instances while keeping the implementation small and reusing the existing runner-selection flow.

The current approach is:

determine how many runners to use for a dependent runner category from the dependent count and the configured spillover settings
export shard metadata in the runner matrix without changing the default non-sharded path
assign each dependent to exactly one shard by deterministic modulo partitioning
keep the shard workloads balanced within one dependent where divisibility does not allow a perfectly even split
avoid duplicate dependent execution across shards, including dependents reachable from multiple changed formulae

The intent is to make dependent testing faster without introducing a breaking change for the current homebrew-core CI pipeline.

Overall flow

flowchart TB
  cf[changed formulae] --> tr[determine-test-runners]
  tr --> rm[runner matrix]
  rm --> rc[runner category]
  rc --> n[n shard jobs]

  note[each shard job recomputes the same sorted dependent universe locally]

  subgraph j0[job 0]
    d0[collect names]
    u0[uniq plus sort]
    p0[index 0 total 2]
    a0[owned dependents]
    t0[install and test]
    d0 --> u0 --> p0 --> a0 --> t0
  end

  subgraph j1[job 1]
    d1[collect names]
    u1[uniq plus sort]
    p1[index 1 total 2]
    a1[owned dependents]
    t1[install and test]
    d1 --> u1 --> p1 --> a1 --> t1
  end

  n --> d0
  n --> d1
  note -.-> d0
  note -.-> d1

PS: Currently this is opt-in

Configuration details

Shard Count From Dependents + Config

flowchart TD
  d[dependent count] --> m{runner penalty == 0?}
  t[base threshold] --> l[count / threshold]
  d --> l

  d --> q[quadratic solve]
  t --> q
  p[runner penalty] --> q

  m -- yes --> l
  m -- no --> q

  l --> c[ceil]
  q --> c

  c --> k[clamp to max runners]
  x[max runners] --> k
  k --> o[selected shard count]

Shard count selection

This is the actual sizing rule used by the implementation: choose the smallest r where coverage(r) = r*t + p*r*(r-1) reaches the compatible dependent count, then clamp to max runners.

What the knobs do:

higher t moves all expansion cutoffs to the right, so runners expand later overall
higher p leaves the first split unchanged, but pushes later splits out, so extra runners become harder to unlock
higher max runners only raises the cap; it does not change the cutoff curve

MikeMcQuaid

Hey @GunniBusch! I think this misunderstands the goal of "parallelising" here; we're not wanting to parallelise within a single GitHub Actions worker/job but to fan out to many different GitHub Actions workers/jobs.

Let me know if that doesn't make sense!

GunniBusch · 2026-03-30T15:25:38Z

Hey @GunniBusch! I think this misunderstands the goal of "parallelising" here; we're not wanting to parallelise within a single GitHub Actions worker/job but to fan out to many different GitHub Actions workers/jobs.

Let me know if that doesn't make sense!

Good to know. So like my prev one.
Will update this pr

GunniBusch · 2026-03-30T21:38:54Z

Hey @GunniBusch! I think this misunderstands the goal of "parallelising" here; we're not wanting to parallelise within a single GitHub Actions worker/job but to fan out to many different GitHub Actions workers/jobs.

Let me know if that doesn't make sense!

One question indeed.

Do you have any preferences for how this should be configured?
Right now I’m using environment variables for settings such as max_runners, including per-runner-type overrides.

If anything is unclear or you want a different configuration approach, let me know.

MikeMcQuaid · 2026-03-31T07:43:14Z

@GunniBusch Look at dev-cmd/generate-cask-ci-matrix.rb, that's an example of an existing matrix fanout implementation.

Just a high-level comment: I'm happy for you to try this but managing the rollout of this and testing it on our production CI infrastructure is going to be very hard/time consuming give you're not a maintainer.

GunniBusch · 2026-03-31T18:13:46Z

@GunniBusch Look at dev-cmd/generate-cask-ci-matrix.rb, that's an example of an existing matrix fanout implementation.

Just a high-level comment: I'm happy for you to try this but managing the rollout of this and testing it on our production CI infrastructure is going to be very hard/time consuming give you're not a maintainer.

I already assumed that. I did my testing I a test repo https://github.com/GunniBusch/brew-dependent-shard-e2e. (the latest test https://github.com/GunniBusch/brew-dependent-shard-e2e/actions/runs/23776799656)

I hope this might be at least a bit useful for the future. But thanks for letting me know.

MikeMcQuaid · 2026-03-31T18:18:19Z

@GunniBusch The repository is a good testing ground but the main thing that's probably worth doing is figuring out a gradual rollout strategy.

GunniBusch · 2026-03-31T18:27:32Z

@GunniBusch The repository is a good testing ground but the main thing that's probably worth doing is figuring out a gradual rollout strategy.

This is exactly what I was thinking. Imagine pushing this and breaking homebrew-core workflows.

I think the best solution would be to make this opt in at first. This means, by default there is no change in behaviour. (this is currently the case).

What I had in mind how one could do a roll out like this, it would might be useful to have a label in homebrew core that maintainers could add to some selected prs to enable this fan-out (and disable if something broke) and test.

And this change actually would not even be one that has to happen in the brew repository, because I already added a ENV variable to enable or disable (opt in) sharding.

MikeMcQuaid · 2026-04-01T12:41:40Z

I think the best solution would be to make this opt in at first. This means, by default there is no change in behaviour. (this is currently the case).

Yeh, I just don't think we can merge it without enough tests to be convinced that this is gonna work. Look at the test-bot CI jobs for potential inspiration here.

What I had in mind how one could do a roll out like this, it would might be useful to have a label in homebrew core that maintainers could add to some selected prs to enable this fan-out (and disable if something broke) and test.

Yes, labels to enable/disable are a good fit here.

Would be good to open that (draft) PR too.

Also, here or in the other PR, it'd be good to write out a proposal for how we do a stage-by-stage rollout. I can help provide feedback and input on that.

GunniBusch · 2026-04-02T03:56:01Z

I think the best solution would be to make this opt in at first. This means, by default there is no change in behaviour. (this is currently the case).

Yeh, I just don't think we can merge it without enough tests to be convinced that this is gonna work. Look at the test-bot CI jobs for potential inspiration here.

What I had in mind how one could do a roll out like this, it would might be useful to have a label in homebrew core that maintainers could add to some selected prs to enable this fan-out (and disable if something broke) and test.

Yes, labels to enable/disable are a good fit here.

Would be good to open that (draft) PR too.

Also, here or in the other PR, it'd be good to write out a proposal for how we do a stage-by-stage rollout. I can help provide feedback and input on that.

No problem, will do that, but will probably start next week as I have 2 exams there.

GunniBusch force-pushed the feat/add-parallel-dependent-testing branch 2 times, most recently from 12de736 to 68d5e26 Compare March 29, 2026 17:17

GunniBusch changed the title ~~WIP test-bot: parallelize bottled dependents~~ WIP test-bot: parallelize dependent test Mar 29, 2026

GunniBusch force-pushed the feat/add-parallel-dependent-testing branch 15 times, most recently from ece3685 to 3025738 Compare March 29, 2026 20:22

MikeMcQuaid requested changes Mar 30, 2026

View reviewed changes

GunniBusch changed the title ~~WIP test-bot: parallelize dependent test~~ WIP test-bot: parallelize dependent test with sharding Mar 30, 2026

GunniBusch force-pushed the feat/add-parallel-dependent-testing branch 2 times, most recently from 634fcb7 to 9542419 Compare March 30, 2026 20:58

GunniBusch force-pushed the feat/add-parallel-dependent-testing branch 6 times, most recently from 744a9a0 to 2bd8a98 Compare March 30, 2026 23:26

GunniBusch force-pushed the feat/add-parallel-dependent-testing branch 7 times, most recently from 866ed91 to 6716ba0 Compare March 31, 2026 01:14

test-bot: shard dependent runs across runners

fe25bbf

GunniBusch force-pushed the feat/add-parallel-dependent-testing branch from 6716ba0 to fe25bbf Compare March 31, 2026 01:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP test-bot: parallelize dependent test with sharding#21871

WIP test-bot: parallelize dependent test with sharding#21871
GunniBusch wants to merge 1 commit intoHomebrew:mainfrom
GunniBusch:feat/add-parallel-dependent-testing

GunniBusch commented Mar 29, 2026 •

edited

Loading

Uh oh!

MikeMcQuaid left a comment

Uh oh!

GunniBusch commented Mar 30, 2026

Uh oh!

GunniBusch commented Mar 30, 2026 •

edited

Loading

Uh oh!

MikeMcQuaid commented Mar 31, 2026

Uh oh!

GunniBusch commented Mar 31, 2026

Uh oh!

MikeMcQuaid commented Mar 31, 2026

Uh oh!

GunniBusch commented Mar 31, 2026

Uh oh!

MikeMcQuaid commented Apr 1, 2026

Uh oh!

GunniBusch commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

GunniBusch commented Mar 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overall flow

Shard Count From Dependents + Config

Shard count selection

Uh oh!

MikeMcQuaid left a comment

Choose a reason for hiding this comment

Uh oh!

GunniBusch commented Mar 30, 2026

Uh oh!

GunniBusch commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MikeMcQuaid commented Mar 31, 2026

Uh oh!

GunniBusch commented Mar 31, 2026

Uh oh!

MikeMcQuaid commented Mar 31, 2026

Uh oh!

GunniBusch commented Mar 31, 2026

Uh oh!

MikeMcQuaid commented Apr 1, 2026

Uh oh!

GunniBusch commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

GunniBusch commented Mar 29, 2026 •

edited

Loading

GunniBusch commented Mar 30, 2026 •

edited

Loading