fix(e2e): strict wireserver validation — fail fast on unexpected curl exits by r2k1 · Pull Request #8580 · Azure/AgentBaker

r2k1 · 2026-05-25T00:32:57Z

What this PR does / why we need it:

Tightens the e2e wireserver-block validator so it fails fast on any unexpected curl exit code from an unprivileged pod, instead of retrying for a minute and only failing if no acceptable exit code ever showed up.

Why

validateWireServerBlocked is a security check: pods must not be able to reach the wireserver IP (168.63.129.16). The previous implementation retried for 1 minute, passing the check if curl eventually returned exit 28 (timeout). Two problems with that:

It treats the result of a security check as something to wait out — but a successful curl from a pod is never a transient condition. It means FORWARD DROP/REJECT rules are missing right now, and the test should surface that immediately.
The retry loop bounded the budget by time, not by observations. If the first curl returned exit 0 (reachable) but the last one returned 28 (timeout), the test would pass while a real regression was present somewhere in the window.

What

Whitelist exit codes 28 (FORWARD DROP timeout) and 7 (FORWARD REJECT refused) as the only valid "wireserver blocked" signals.
Anything else fails loudly with full diagnostics: FORWARD chain, KUBE-FORWARD chain, iptables-save filter, and conntrack entries for the wireserver IP. This makes regressions trivial to triage from the test log.
Retry the exec call only on transient kube-apiserver exec failures, never on the curl result itself — one observation of an unexpected exit code is enough to fail.

This is strictly more defensive than the original (which only accepted exit 28) because it also accepts REJECT-based blocks, while failing on every other class of regression instead of swallowing them.

Scope

e2e/validation.go only. Test-only change, no product code touched. Extracted from #8480.

Which issue(s) this PR fixes:

N/A — test-only hardening, no linked issue.

… exits The previous validation retried for 1 minute, passing if curl eventually timed out (exit 28). This had two problems: 1. Silently accepted other "success-looking" exit codes (e.g. 0 = reachable) if they happened on the last poll iteration in earlier variants. 2. Retried through what is fundamentally a binary security check — any successful curl from a pod means the FORWARD DROP/REJECT rules are missing or wrong, which is a regression to surface immediately, not a transient condition to wait out. Changes: - Whitelist exit codes 28 (FORWARD DROP timeout) and 7 (FORWARD REJECT refused) as the only valid "wireserver blocked" signals. - Anything else fails loudly with full diagnostics: FORWARD chain, KUBE-FORWARD chain, iptables-save filter, and conntrack entries for the wireserver IP. - Retry the exec call only on transient kube-apiserver exec failures, never on the curl result itself — a single observation of an unexpected exit code is enough to fail the security check. This is strictly more defensive than the original (which only accepted exit 28) because it also accepts REJECT-based blocks, while failing on every other class of regression instead of swallowing them. Extracted from #8480. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Copilot

Pull request overview

This PR tightens the e2e security validator that ensures unprivileged pods cannot reach the Azure WireServer IP (168.63.129.16), changing it from “retry until timeout exit code appears” to “fail fast on any unexpected curl result,” while still retrying transient Kubernetes exec failures.

Changes:

Accept curl exit codes 28 (timeout/DROP) and 7 (connect failed/REJECT) as the only valid signals that WireServer is blocked.
Stop retrying based on curl outcomes; instead, fail immediately on any other curl exit code.
Add richer failure diagnostics (FORWARD + KUBE-FORWARD chain, iptables-save filter excerpt, conntrack entries) when an unexpected exit code occurs.

timmy-wright · 2026-05-25T01:10:16Z

 		},
 	}

+	allowedExitCodes := map[string]bool{"28": true, "7": true}


This would be easier to read with a list of allowable exit codes rather than a map. The constant time lookup benefit we get below doesn't seem worth it given the length of time these tests take to run.

Converting to slice won't improve much. You will trade it for slightly more comlicated lookup.
Either way it isolated to function and doesn't seem important

Copilot AI review requested due to automatic review settings May 25, 2026 00:32

r2k1 requested review from AbelHu, Devinwong, SriHarsha001, awesomenix, calvin197, cameronmeissner, djsly, ganeshkumarashok, junjiezhang1997, lilypan26, mxj220, pdamianov-dev, phealy, sulixu, surajssd, timmy-wright and zachary-bailey as code owners May 25, 2026 00:32

r2k1 temporarily deployed to test May 25, 2026 00:33 — with GitHub Actions Inactive

Copilot started reviewing on behalf of r2k1 May 25, 2026 00:33 View session

r2k1 mentioned this pull request May 25, 2026

fix(e2e): reduce E2E test flakiness (sandbox events, duplicate CSE timing) #8480

Merged

Copilot AI reviewed May 25, 2026

View reviewed changes

timmy-wright reviewed May 25, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(e2e): strict wireserver validation — fail fast on unexpected curl exits#8580

fix(e2e): strict wireserver validation — fail fast on unexpected curl exits#8580
r2k1 wants to merge 1 commit into
mainfrom
e2e-wireserver-strict-validation

r2k1 commented May 25, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

timmy-wright May 25, 2026

Uh oh!

r2k1 May 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

r2k1 commented May 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why

What

Scope

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

timmy-wright May 25, 2026

Choose a reason for hiding this comment

Uh oh!

r2k1 May 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

r2k1 commented May 25, 2026 •

edited

Loading