Skip to content

Fix root cause of nil return in queryWithRetry and labelsWithRetry#7375

Merged
friedrichg merged 2 commits intomasterfrom
fix-retry-root-cause
Mar 28, 2026
Merged

Fix root cause of nil return in queryWithRetry and labelsWithRetry#7375
friedrichg merged 2 commits intomasterfrom
fix-retry-root-cause

Conversation

@friedrichg
Copy link
Copy Markdown
Member

Summary

  • Propagate ctx.Err() from queryWithRetry and labelsWithRetry when the backoff loop exits without executing (e.g. context cancelled before the first attempt), preventing (nil, nil) returns
  • Remove the caller-side nil guard in streamingSelect added by Fix nil when ingesterQueryMaxAttempts > 1 #7369, since the root cause is now fixed at the source
  • Align labelsWithRetry early-return check to use <= 1 consistently with queryWithRetry

Context

PR #7369 fixed the immediate panic in streamingSelect by guarding against nil results at the call site. However the root cause — both retry functions returning (nil, nil) — was still present, leaving labelsWithRetry callers (LabelNames, LabelValues) silently returning empty results with nil error on cancelled context.

Test plan

  • Updated TestDistributorQuerier_Select_CancelledContext to assert context.Canceled error (not just no-panic)
  • Added TestDistributorQuerier_Labels_CancelledContext covering LabelNames and LabelValues with cancelled context and ingesterQueryMaxAttempts > 1
  • All existing retry tests pass

When ingesterQueryMaxAttempts > 1 and the context is cancelled before
the backoff loop starts, retries.Ongoing() returns false immediately
and the loop never executes, causing both retry functions to return
(nil, nil). This propagates ctx.Err() from the retry functions
themselves so all callers are protected, and removes the caller-side
nil guard added in #7369. Also aligns the early-return check in
labelsWithRetry to use <= 1 consistently with queryWithRetry.

Signed-off-by: Friedrich Gonzalez <1517449+friedrichg@users.noreply.github.com>
@friedrichg friedrichg force-pushed the fix-retry-root-cause branch from be6821e to a1ed0ca Compare March 25, 2026 06:53
@dosubot dosubot Bot added the lgtm This PR has been approved by a maintainer label Mar 26, 2026
Signed-off-by: Friedrich Gonzalez <1517449+friedrichg@users.noreply.github.com>
@friedrichg friedrichg merged commit d2596a3 into master Mar 28, 2026
64 of 65 checks passed
@friedrichg friedrichg deleted the fix-retry-root-cause branch March 28, 2026 20:10
CharlieTLe pushed a commit to CharlieTLe/cortex that referenced this pull request Apr 6, 2026
friedrichg added a commit that referenced this pull request Apr 16, 2026
…7375)

Signed-off-by: Friedrich Gonzalez <1517449+friedrichg@users.noreply.github.com>
friedrichg added a commit that referenced this pull request Apr 17, 2026
* Memberlist cas error code false positive (#7408)

* use errors.As in getCasErrorCode to unwrap memberlist errors

Signed-off-by: SungJin1212 <tjdwls1201@gmail.com>

* fix test

Signed-off-by: SungJin1212 <tjdwls1201@gmail.com>

---------

Signed-off-by: SungJin1212 <tjdwls1201@gmail.com>
Signed-off-by: Friedrich Gonzalez <1517449+friedrichg@users.noreply.github.com>

* Fix nil when ingesterQueryMaxAttempts > 1 (#7369)

* Trigger nil with test

Signed-off-by: Friedrich Gonzalez <1517449+friedrichg@users.noreply.github.com>

* Fix nil results

Signed-off-by: Friedrich Gonzalez <1517449+friedrichg@users.noreply.github.com>

* fix changelog

Signed-off-by: Friedrich Gonzalez <1517449+friedrichg@users.noreply.github.com>

---------

Signed-off-by: Friedrich Gonzalez <1517449+friedrichg@users.noreply.github.com>

* fix: alertmanager user config disappearing when ring is unreachable  (#7372)

* Fix multitenant alertmanager user config disappearing when ring is unreachable

Signed-off-by: Kishore K G <kishorekg@google.com>

* Add change log

Signed-off-by: Kishore K G <kishorekg@google.com>

* format multitenant

Signed-off-by: Kishore K G <kishorekg@google.com>

* fix pr number

Signed-off-by: Kishore K G <kishorekg@google.com>

* use ErrNotFound for error validation in unit test

Signed-off-by: kishorekg1999 <kishorekg.github@gmail.com>

---------

Signed-off-by: Kishore K G <kishorekg@google.com>
Signed-off-by: kishorekg1999 <kishorekg@google.com>
Signed-off-by: kishorekg1999 <kishorekg.github@gmail.com>
Signed-off-by: Friedrich Gonzalez <1517449+friedrichg@users.noreply.github.com>

* Clean Symbol Tables (#7373)

Signed-off-by: SungJin1212 <tjdwls1201@gmail.com>
Signed-off-by: Friedrich Gonzalez <1517449+friedrichg@users.noreply.github.com>

* Fix root cause of nil return in queryWithRetry and labelsWithRetry (#7375)

Signed-off-by: Friedrich Gonzalez <1517449+friedrichg@users.noreply.github.com>

* fix regex resolver match 0 or 1 tenant bug (#7424)

* fix regex resolver match 0 or 1 tenant bug

Signed-off-by: SungJin1212 <tjdwls1201@gmail.com>

* fix test

Signed-off-by: SungJin1212 <tjdwls1201@gmail.com>

---------

Signed-off-by: SungJin1212 <tjdwls1201@gmail.com>
Signed-off-by: Friedrich Gonzalez <1517449+friedrichg@users.noreply.github.com>

* skip nil values in Memberlist WatchPrefix (#7429)

* skip nil values in Memberlist WatchPrefix

Signed-off-by: SungJin1212 <tjdwls1201@gmail.com>

* fix lint

Signed-off-by: SungJin1212 <tjdwls1201@gmail.com>

---------

Signed-off-by: SungJin1212 <tjdwls1201@gmail.com>
Signed-off-by: Friedrich Gonzalez <1517449+friedrichg@users.noreply.github.com>

* Remove duplicate CHANGELOG entry for #7373

Signed-off-by: Friedrich Gonzalez <1517449+friedrichg@users.noreply.github.com>

* Fix integration test flag name for release-1.21

The cherry-pick of #7424 brought the master flag name
-limits.query-ingesters-within, but release-1.21 still uses
-querier.query-ingesters-within (renamed in #7160, master-only).

Signed-off-by: Friedrich Gonzalez <1517449+friedrichg@users.noreply.github.com>

---------

Signed-off-by: SungJin1212 <tjdwls1201@gmail.com>
Signed-off-by: Friedrich Gonzalez <1517449+friedrichg@users.noreply.github.com>
Signed-off-by: Kishore K G <kishorekg@google.com>
Signed-off-by: kishorekg1999 <kishorekg@google.com>
Signed-off-by: kishorekg1999 <kishorekg.github@gmail.com>
Co-authored-by: SungJin1212 <tjdwls1201@gmail.com>
Co-authored-by: kishorekg1999 <kishorekg.github@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

lgtm This PR has been approved by a maintainer size/M type/bug type/tests

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants