test: extract site explorer power shelf tests by poroh · Pull Request #2250 · NVIDIA/infra-controller

poroh · 2026-06-05T17:39:07Z

Description

Move power shelf integration coverage out of api-core site explorer tests into the site-explorer crate, and introduce a shared test harness foundation for cross-crate test setup.

Type of Change

Add - New feature or capability
Change - Changes in existing functionality
Fix - Bug fixes
Remove - Removed features or deprecated functionality
Internal - Internal changes (refactoring, tests, docs, etc.)

Related Issues (Optional)

#2001

Breaking Changes

This PR contains breaking changes

Testing

Unit tests added/updated
Integration tests added/updated
Manual testing performed
No testing required (docs, internal refactor, etc.)

Additional Notes

coderabbitai · 2026-06-05T17:39:15Z

Important

Review skipped

Auto reviews are disabled on this repository. Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: db06db40-58ce-4669-9c7d-76f53a7d72ee

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

Walkthrough

This PR consolidates test infrastructure by establishing a centralized test-harness crate and migrating power-shelf integration tests from api-core to site-explorer. The changes expose API accessors and unified logging initialization, create a comprehensive test harness builder with network segment controllers, and replace ad-hoc power-shelf tests with a production-grade integration test suite.

Changes

API Core Test Support Consolidation

Layer / File(s)	Summary
Centralized logging and API field accessors `crates/api-core/src/test_support/mod.rs`, `crates/api-core/src/tests/mod.rs`	Introduces `setup_test_logging()` for uniform tracing subscriber configuration with environment filtering and panic-on-failure semantics. Exposes three `Api` accessor methods (`work_lock_manager_handle()`, `common_pools()`, `credential_manager()`) and exports `endpoint_explorer` test-support module. Test initialization delegates to the new centralized function.
Test fixture import reorganization `crates/api-core/src/tests/common/api_fixtures/mod.rs`	Consolidates re-exports by grouping `endpoint_explorer` and `network_segment` from `crate::test_support`, removes local module declaration, and cleans up import statements.

New Test-Harness Crate Infrastructure

Layer / File(s)	Summary
Test harness core and builder pattern `crates/test-harness/Cargo.toml`, `crates/test-harness/src/lib.rs`, `crates/test-harness/src/builder.rs`	Defines `TestHarness` struct with builder pattern for test composition. `TestHarnessBuilder::build()` either uses a provided `ApiHandle` or constructs a default API via transaction-scoped resource pool initialization, work-lock-manager keepalive loop, and metrics emission. Includes processor id generation, cancellation token lifecycle, and crate-level tracing initialization hook.
Network segment controller and types `crates/test-harness/src/network/controller.rs`, `crates/test-harness/src/network/mod.rs`, `crates/test-harness/src/network/segment.rs`, `crates/test-harness/src/dns.rs`	Implements `TestNetworkController` that manually iterates `StateController` for network segments. Provides async helpers `create_underlay_segment()` and `create_admin_segment()` using fixture gateway/prefix values, domain context, and two-iteration advancement. Defines `TestDomain` (id + static name) and `TestNetworkSegment` (id + relay address) data types.
Prelude module `crates/test-harness/src/prelude.rs`	Exports commonly-used test infrastructure: `sqlx_test` macro, database pool, testing utilities, `TestHarness`, `Api`, and RPC `Forge` server type for ergonomic test imports.

Power-Shelf Test Suite Migration

Layer / File(s)	Summary
Dependencies updated and old tests removed `crates/site-explorer/Cargo.toml`, `crates/api-core/src/tests/site_explorer.rs`	Adds test-harness, bmc-vendor, and RPC test-support dev-dependencies to site-explorer. Removes 13 power-shelf-focused test cases and `FakePowerShelf` helper from api-core, retaining new `test_get_machine_position_info` gRPC tests.
Comprehensive power-shelf integration tests `crates/site-explorer/tests/power_shelf.rs`	Implements extensive integration test suite: `FakePowerShelf` helper (moved from api-core) and async `Env` builder for test scaffolding. Covers DHCP/static IP discovery, deduplication, expected configuration validation, per-run creation limits, disable flag enforcement, error handling, direct `create_power_shelf()` behavior, and single/multi-shelf state history persistence with round-trip JSON serialization verification. Tests validate explored endpoint/report persistence, metric assertions, and error record storage.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

This PR exhibits substantial scope: a new crate with non-trivial test infrastructure (harness builder with background task management, state controller iteration, RPC integration), significant test suite migration (removal of 13 tests from api-core, addition of 12 comprehensive integration tests in site-explorer), and API surface expansion. The test-harness builder requires careful review of lifecycle management (cancellation tokens, JoinSet cleanup, drop guards), transaction handling, and metric wiring. The power-shelf test suite is dense with multiple independent test scenarios, database assertions, metric validation, and state machine behavior. Cross-module refactoring (endpoint-explorer re-exports, logging consolidation) adds moderate coupling complexity. Heterogeneous changes across configuration, types, infrastructure, and integration tests require separate reasoning per layer.

Possibly related PRs

NVIDIA/infra-controller#2230: Both PRs modify crates/api-core/src/tests/site_explorer.rs power-shelf test helpers and fixtures, with this PR moving tests to site-explorer while the related PR updates the same test infrastructure.
NVIDIA/infra-controller#2149: The new Api accessor methods (work-lock-manager handle, common pools, credential manager) and endpoint_explorer re-exports introduced here directly support that PR's TestApiBuilder-based refactor of test fixture initialization.

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly and concisely summarizes the primary change: extracting power shelf integration tests from api-core into the site-explorer crate, which is evident from the changeset structure.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Description check	✅ Passed	The pull request description accurately reflects the changeset: moving power shelf integration tests from api-core to site-explorer and introducing a shared test harness foundation.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 6

🧹 Nitpick comments (10)

crates/site-explorer/tests/power_shelf.rs (8)

1305-1314: 💤 Low value

Remove commented-out assertion code.

This large block of commented-out state version verification logic should be deleted if it's no longer needed, or re-enabled if it's required for the test.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@crates/site-explorer/tests/power_shelf.rs` around lines 1305 - 1314, Remove
the large commented-out assertion block that builds state_versions from
final_state (the commented lines referencing state_versions, final_state, and
the assert! about state_versions.len() > 1); either delete these comment lines
entirely if the verification is no longer needed, or re-enable them by
uncommenting and ensuring the test compiles and uses
state_versions/state_version properly inside the test function where final_state
is available.

1087-1094: 💤 Low value

Remove commented-out code and second early return.

Lines 1089-1091 contain commented-out debug println! statements that should be deleted. Lines 1092-1094 contain another if 1 == 1 { return Ok(()); } block that's redundant given the earlier return at line 1067-1069. Clean up this section entirely.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@crates/site-explorer/tests/power_shelf.rs` around lines 1087 - 1094, Remove
the commented-out debug loop and the redundant early-return by deleting the
lines containing the commented
println!/env.run_power_shelf_controller_iteration() block and the `if 1 == 1 {
return Ok(()); }` statement so the test no longer contains dead commented code
or a second early return; search for the
`println!`/`env.run_power_shelf_controller_iteration().await` snippet and the
`if 1 == 1` block in the test and remove them, leaving the function flow as
originally intended with only the earlier return.

254-366: 💤 Low value

Remove commented-out code.

Line 350 contains a commented-out call to explorer.run_single_iteration().await.unwrap();. Either remove this line entirely or document why it's disabled with a clear TODO/FIXME comment.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@crates/site-explorer/tests/power_shelf.rs` around lines 254 - 366, In
test_site_explorer_power_shelf_discovery_with_static_ip remove the commented-out
call to explorer.run_single_iteration().await.unwrap(); (or replace it with a
one-line TODO/FIXME explaining why it's disabled) so there is no stray commented
code left in the test; locate the commented line referencing
explorer.run_single_iteration() and either delete it or add a concise TODO/FIXME
comment directly above the line documenting the reason for disabling the extra
iteration.

1481-1491: ⚡ Quick win

Avoid unused variable pattern.

Line 1481 uses let _history_by_ids = ... followed by commented-out assertions at lines 1489-1491. If this query result is genuinely unused, use let _ = ... (without a variable name) to make the intent explicit. Otherwise, re-enable the assertions or remove the query entirely.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@crates/site-explorer/tests/power_shelf.rs` around lines 1481 - 1491, The
local binding `_history_by_ids` is unused after calling
`db::state_history::find_by_object_ids` in the test; either replace `let
_history_by_ids = ...` with `let _ =
db::state_history::find_by_object_ids(...).await?;` to explicitly discard the
result, or re-enable the commented assertions that reference `history_by_ids`
(and rename `_history_by_ids` to `history_by_ids`) so the query result is
actually asserted against; update the call site around
`db::state_history::find_by_object_ids`, the `_history_by_ids` binding, and the
commented lines that assert
`contains_key(&power_shelf1_id)`/`contains_key(&power_shelf2_id)` accordingly.

1700-1720: ⚖️ Poor tradeoff

Fragile JSON comparison using string manipulation.

Lines 1701-1702 and 1717-1719 compare JSON by removing spaces with .replace(" ", ""). This approach is brittle and fails if the serialization format changes in other ways (key ordering, escaping, etc.). Consider either:

Deserializing both strings back to the state enum and comparing structurally
Using a proper JSON equality library
Comparing the typed PowerShelfControllerState values directly without round-tripping through JSON

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@crates/site-explorer/tests/power_shelf.rs` around lines 1700 - 1720, The test
currently compares JSON strings by stripping spaces
(history_entry.state.replace(" ", "") vs serde_json::to_string(&state)?), which
is brittle; instead parse the stored JSON and compare structurally — either
deserialize history_entry.state into the concrete type (e.g.,
PowerShelfControllerState) and assert equality with state, or deserialize both
into serde_json::Value via serde_json::from_str and compare the Value to
serde_json::to_value(&state). Update the assertions around history_entry.state
and found_entry.unwrap().state to perform these structural
deserializations/comparisons rather than string replacement.

1279-1286: 💤 Low value

Remove commented-out code and redundant early return.

Lines 1281-1283 contain commented-out iteration code. Lines 1284-1286 contain another if 1 == 1 { return Ok(()); } block that's redundant. Remove this entire section.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@crates/site-explorer/tests/power_shelf.rs` around lines 1279 - 1286, Remove
the commented-out loop and the redundant early-return block: delete the three
commented lines calling env.run_power_shelf_controller_iteration() and the
subsequent if 1 == 1 { return Ok(()); } so the test continues normally; look for
the env.run_power_shelf_controller_iteration() commented loop and the literal if
1 == 1 return Ok(()) and remove those lines together.

127-1755: 🏗️ Heavy lift

Consider extracting common test setup patterns.

The test suite contains significant code duplication:

EndpointExplorationReport construction (appears ~11 times with similar fields)
DHCP discovery + power shelf creation flow (repeated in multiple tests)
Expected power shelf database seeding

Extract these into helper methods on Env or standalone functions to improve maintainability. For example:

impl Env {
    fn create_basic_exploration_report(&self, serial: &str, model: &str) -> EndpointExplorationReport {
        EndpointExplorationReport {
            endpoint_type: EndpointType::Bmc,
            vendor: Some(bmc_vendor::BMCVendor::Nvidia),
            systems: vec![ComputerSystem {
                serial_number: Some(serial.to_string()),
                ..Default::default()
            }],
            chassis: vec![Chassis {
                model: Some(model.to_string()),
                ..Default::default()
            }],
            model: Some(model.to_string()),
            ..Default::default()
        }
    }
    
    async fn setup_power_shelf_with_dhcp(&self, power_shelf: &mut FakePowerShelf) -> Result<(), Box<dyn std::error::Error>> {
        let response = self.api()
            .discover_dhcp(
                DhcpDiscovery::builder(
                    power_shelf.bmc_mac_address.to_string(),
                    power_shelf.relay_address.to_string(),
                )
                .tonic_request(),
            )
            .await?
            .into_inner();
        power_shelf.ip = response.address;
        Ok(())
    }
}

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@crates/site-explorer/tests/power_shelf.rs` around lines 127 - 1755, Several
tests duplicate constructing EndpointExplorationReport, running DHCP discovery
to set FakePowerShelf.ip, and seeding expected_power_shelf rows; extract helpers
to reduce duplication: add Env::create_basic_exploration_report(&self, serial:
&str, model: &str) returning EndpointExplorationReport and
Env::setup_power_shelf_with_dhcp(&self, power_shelf: &mut FakePowerShelf) ->
Result<(), Box<dyn std::error::Error>> to perform the DhcpDiscovery call and set
power_shelf.ip, plus a helper Env::seed_expected_power_shelf(&self, power_shelf:
&FakePowerShelf) that wraps db::expected_power_shelf::create; update tests like
test_site_explorer_power_shelf_discovery,
test_site_explorer_power_shelf_with_static_ip,
test_site_explorer_power_shelf_creation_limit, etc. to call these helpers and
replace the repeated EndpointExplorationReport literals with
create_basic_exploration_report calls.

1513-1520: 💤 Low value

Remove commented-out iteration code.

Lines 1515-1517 contain commented-out controller iteration calls that should be deleted.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@crates/site-explorer/tests/power_shelf.rs` around lines 1513 - 1520, Delete
the commented-out controller iteration block (the three lines calling
env.run_power_shelf_controller_iteration() that are prefixed with //) so the
test no longer contains dead commented code; keep the surrounding TODO comment
if desired and leave the remaining early return logic unchanged—look for the
env.run_power_shelf_controller_iteration() calls to locate the commented lines
to remove.

crates/test-harness/src/network/controller.rs (1)

68-110: ⚡ Quick win

Document the state iteration count requirement.

Both create_underlay_segment() and create_admin_segment() run exactly 2 state controller iterations (lines 101-102, 145-146) without explaining why this specific count is needed. This "magic number" should be documented to clarify the provisioning lifecycle requirements.

📋 Suggested documentation

     let segment = self
         .api
         .create_network_segment(tonic::Request::new(request))
         .await
         .expect("Unable to create network segment")
         .into_inner();

+    // Run two iterations to fully provision the segment:
+    // 1st iteration: allocate resources (VLAN, VNI)
+    // 2nd iteration: transition to active state
     self.run_single_iteration().await;
     self.run_single_iteration().await;

     TestNetworkSegment {

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@crates/test-harness/src/network/controller.rs` around lines 68 - 110, Add a
brief explanatory comment above the two run_single_iteration().await calls in
both create_underlay_segment() and create_admin_segment() that documents why
exactly two state controller iterations are required for provisioning (e.g.,
first iteration to enqueue/apply the network creation and a second to reconcile
and persist resulting state/allocate IPs), mention any assumptions (such as
async job scheduling or reconciliation semantics), and reference
run_single_iteration() so future readers understand this “magic number” is
intentional and tied to the controller’s two-step provisioning lifecycle.

crates/test-harness/src/lib.rs (1)

57-72: 💤 Low value

Consider consolidating error handling in test setup.

The test_domain() method chains multiple .unwrap() calls, including a particularly awkward .map().unwrap().unwrap() pattern at lines 68-70. While panicking on test setup failure is acceptable, the readability could be improved.

♻️ Suggested refactor for clarity

 pub async fn test_domain(&self) -> TestDomain {
     let name = "testharness.example.com";
-    let id = self
+    let response = self
         .api
         .create_domain(Request::new(rpc::protos::dns::CreateDomainRequest {
             name: name.to_string(),
         }))
         .await
-        .unwrap()
-        .into_inner()
-        .id
-        .map(::carbide_uuid::domain::DomainId::try_from)
-        .unwrap()
-        .unwrap();
+        .expect("create_domain RPC failed");
+    
+    let proto_id = response
+        .into_inner()
+        .id
+        .expect("create_domain returned no id");
+    
+    let id = ::carbide_uuid::domain::DomainId::try_from(proto_id)
+        .expect("invalid DomainId from create_domain");
+    
     TestDomain { id, name }
 }

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@crates/test-harness/src/lib.rs` around lines 57 - 72, The test_domain
function uses multiple chained .unwrap() calls (notably the
.map(::carbide_uuid::domain::DomainId::try_from).unwrap().unwrap()) which hurts
readability; replace these with explicit, contextual failures by using expect()
with clear messages on the RPC/result unwraps and convert the map/unwrap/unwrap
into .map(|s| ::carbide_uuid::domain::DomainId::try_from(s).expect("failed to
parse DomainId from response")) (or alternatively pattern-match the
Option/Result to produce a clear expect message) so callers still panic on setup
failure but the failure points (create_domain RPC, missing id, parse error) are
explicit; update test_domain, create_domain call handling and the DomainId
conversion accordingly while preserving the returned TestDomain { id, name }.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@crates/api-core/src/tests/site_explorer.rs`:
- Line 3239: Remove the redundant local import of the Forge trait by deleting
the line that says "use rpc::forge::forge_server::Forge;" in this test file; the
Forge trait is already imported at the module level, so remove the duplicate to
avoid unused/duplicate imports while leaving the module-level import intact.
- Around line 3292-3303: The test function
test_get_machine_position_info_no_endpoint is misnamed and miscommented because
create_managed_host actually creates an explored endpoint; rename the test
(e.g., test_get_machine_position_info_endpoint_without_position) and update the
leading comment to state that an explored endpoint exists but contains no
position data, and remove the redundant use rpc::forge::forge_server::Forge
import (duplicate of the previous test) so imports are not repeated; keep
existing variables like dpu_machine_id and the rest of the test logic unchanged.

In `@crates/site-explorer/tests/power_shelf.rs`:
- Around line 1319-1520: The test_power_shelf_state_history_multiple test is
prematurely short-circuited by the unconditional early return (if 1 == 1 {
return Ok(()); }), so the later state-history assertions never run; remove the
early-return block (the "if 1 == 1 { return Ok(()); }" stub) and either
implement the intended controller iterations and assertions (call
env.run_power_shelf_controller_iteration() the required number of times and then
run the assertions that follow) or, if not ready, mark the test with #[ignore]
and keep the rest of the test intact so the current assertions are either
executed or intentionally skipped.
- Around line 929-1069: The test_site_explorer_creates_power_shelf test is
short-circuited by the early return (if 1 == 1 { return Ok(()); }) which
prevents the final controller state assertions from running; either remove that
conditional return and call/run the controller iteration
(env.run_power_shelf_controller_iteration().await) and then re-enable the
subsequent assertions that verify state transitions, or mark the test with
#[ignore] and add a TODO explaining which state-machine wiring is required to
enable the controller assertions; update references in the test to
test_site_explorer_creates_power_shelf,
env.run_power_shelf_controller_iteration, and the post-return assertions
accordingly.
- Around line 1117-1247: The test_power_shelf_state_history test is being
short-circuited by the unconditional early return (if 1 == 1 { return Ok(());
}), so remove that guard and either (A) invoke the real controller iteration
(call env.run_power_shelf_controller_iteration().await; or the proper async
method that advances the power shelf state machine) and then run the remaining
state history assertions against db::state_history::for_object, or (B) if the
state machine isn't ready, mark the test with #[ignore] and add a brief comment
explaining why; ensure references to created_power_shelf.id / power_shelf_id and
the post-iteration assertions remain reachable.

In `@crates/test-harness/Cargo.toml`:
- Around line 18-23: The Cargo.toml for the crate named "carbide-test-harness"
is missing the required package description; open the [package] block in that
Cargo.toml and add a descriptive description field (e.g., description = "Short
one-line summary of the crate's purpose") so the package metadata includes a
clear description for the carbide-test-harness crate.

---

Nitpick comments:
In `@crates/site-explorer/tests/power_shelf.rs`:
- Around line 1305-1314: Remove the large commented-out assertion block that
builds state_versions from final_state (the commented lines referencing
state_versions, final_state, and the assert! about state_versions.len() > 1);
either delete these comment lines entirely if the verification is no longer
needed, or re-enable them by uncommenting and ensuring the test compiles and
uses state_versions/state_version properly inside the test function where
final_state is available.
- Around line 1087-1094: Remove the commented-out debug loop and the redundant
early-return by deleting the lines containing the commented
println!/env.run_power_shelf_controller_iteration() block and the `if 1 == 1 {
return Ok(()); }` statement so the test no longer contains dead commented code
or a second early return; search for the
`println!`/`env.run_power_shelf_controller_iteration().await` snippet and the
`if 1 == 1` block in the test and remove them, leaving the function flow as
originally intended with only the earlier return.
- Around line 254-366: In
test_site_explorer_power_shelf_discovery_with_static_ip remove the commented-out
call to explorer.run_single_iteration().await.unwrap(); (or replace it with a
one-line TODO/FIXME explaining why it's disabled) so there is no stray commented
code left in the test; locate the commented line referencing
explorer.run_single_iteration() and either delete it or add a concise TODO/FIXME
comment directly above the line documenting the reason for disabling the extra
iteration.
- Around line 1481-1491: The local binding `_history_by_ids` is unused after
calling `db::state_history::find_by_object_ids` in the test; either replace `let
_history_by_ids = ...` with `let _ =
db::state_history::find_by_object_ids(...).await?;` to explicitly discard the
result, or re-enable the commented assertions that reference `history_by_ids`
(and rename `_history_by_ids` to `history_by_ids`) so the query result is
actually asserted against; update the call site around
`db::state_history::find_by_object_ids`, the `_history_by_ids` binding, and the
commented lines that assert
`contains_key(&power_shelf1_id)`/`contains_key(&power_shelf2_id)` accordingly.
- Around line 1700-1720: The test currently compares JSON strings by stripping
spaces (history_entry.state.replace(" ", "") vs serde_json::to_string(&state)?),
which is brittle; instead parse the stored JSON and compare structurally —
either deserialize history_entry.state into the concrete type (e.g.,
PowerShelfControllerState) and assert equality with state, or deserialize both
into serde_json::Value via serde_json::from_str and compare the Value to
serde_json::to_value(&state). Update the assertions around history_entry.state
and found_entry.unwrap().state to perform these structural
deserializations/comparisons rather than string replacement.
- Around line 1279-1286: Remove the commented-out loop and the redundant
early-return block: delete the three commented lines calling
env.run_power_shelf_controller_iteration() and the subsequent if 1 == 1 { return
Ok(()); } so the test continues normally; look for the
env.run_power_shelf_controller_iteration() commented loop and the literal if 1
== 1 return Ok(()) and remove those lines together.
- Around line 127-1755: Several tests duplicate constructing
EndpointExplorationReport, running DHCP discovery to set FakePowerShelf.ip, and
seeding expected_power_shelf rows; extract helpers to reduce duplication: add
Env::create_basic_exploration_report(&self, serial: &str, model: &str) returning
EndpointExplorationReport and Env::setup_power_shelf_with_dhcp(&self,
power_shelf: &mut FakePowerShelf) -> Result<(), Box<dyn std::error::Error>> to
perform the DhcpDiscovery call and set power_shelf.ip, plus a helper
Env::seed_expected_power_shelf(&self, power_shelf: &FakePowerShelf) that wraps
db::expected_power_shelf::create; update tests like
test_site_explorer_power_shelf_discovery,
test_site_explorer_power_shelf_with_static_ip,
test_site_explorer_power_shelf_creation_limit, etc. to call these helpers and
replace the repeated EndpointExplorationReport literals with
create_basic_exploration_report calls.
- Around line 1513-1520: Delete the commented-out controller iteration block
(the three lines calling env.run_power_shelf_controller_iteration() that are
prefixed with //) so the test no longer contains dead commented code; keep the
surrounding TODO comment if desired and leave the remaining early return logic
unchanged—look for the env.run_power_shelf_controller_iteration() calls to
locate the commented lines to remove.

In `@crates/test-harness/src/lib.rs`:
- Around line 57-72: The test_domain function uses multiple chained .unwrap()
calls (notably the
.map(::carbide_uuid::domain::DomainId::try_from).unwrap().unwrap()) which hurts
readability; replace these with explicit, contextual failures by using expect()
with clear messages on the RPC/result unwraps and convert the map/unwrap/unwrap
into .map(|s| ::carbide_uuid::domain::DomainId::try_from(s).expect("failed to
parse DomainId from response")) (or alternatively pattern-match the
Option/Result to produce a clear expect message) so callers still panic on setup
failure but the failure points (create_domain RPC, missing id, parse error) are
explicit; update test_domain, create_domain call handling and the DomainId
conversion accordingly while preserving the returned TestDomain { id, name }.

In `@crates/test-harness/src/network/controller.rs`:
- Around line 68-110: Add a brief explanatory comment above the two
run_single_iteration().await calls in both create_underlay_segment() and
create_admin_segment() that documents why exactly two state controller
iterations are required for provisioning (e.g., first iteration to enqueue/apply
the network creation and a second to reconcile and persist resulting
state/allocate IPs), mention any assumptions (such as async job scheduling or
reconciliation semantics), and reference run_single_iteration() so future
readers understand this “magic number” is intentional and tied to the
controller’s two-step provisioning lifecycle.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 4790ace5-95d1-4ea4-a758-15de3d1d2bfe

📥 Commits

Reviewing files that changed from the base of the PR and between d7aa6dc and b5a448b.

⛔ Files ignored due to path filters (1)

Cargo.lock is excluded by !**/*.lock

📒 Files selected for processing (15)

crates/api-core/src/test_support/endpoint_explorer.rs
crates/api-core/src/test_support/mod.rs
crates/api-core/src/tests/common/api_fixtures/mod.rs
crates/api-core/src/tests/mod.rs
crates/api-core/src/tests/site_explorer.rs
crates/site-explorer/Cargo.toml
crates/site-explorer/tests/power_shelf.rs
crates/test-harness/Cargo.toml
crates/test-harness/src/builder.rs
crates/test-harness/src/dns.rs
crates/test-harness/src/lib.rs
crates/test-harness/src/network/controller.rs
crates/test-harness/src/network/mod.rs
crates/test-harness/src/network/segment.rs
crates/test-harness/src/prelude.rs

github-actions · 2026-06-05T20:56:44Z

🌿 Preview your docs: https://nvidia-preview-pull-request-2250.docs.buildwithfern.com/infra-controller

Move power shelf integration coverage out of api-core site explorer tests into the site-explorer crate, and introduce a shared test harness foundation for cross-crate test setup. Signed-off-by: Dmitry Porokh <dporokh@nvidia.com>

poroh requested a review from a team as a code owner June 5, 2026 17:39

coderabbitai Bot reviewed Jun 5, 2026

View reviewed changes

poroh force-pushed the tests-refactoring-p3 branch from b5a448b to fa2e40f Compare June 5, 2026 18:01

wminckler approved these changes Jun 5, 2026

View reviewed changes

poroh enabled auto-merge (squash) June 5, 2026 20:23

poroh force-pushed the tests-refactoring-p3 branch from fa2e40f to c0cbea2 Compare June 5, 2026 20:55

wminckler approved these changes Jun 5, 2026

View reviewed changes

test: extract site explorer power shelf tests

2dedd22

Move power shelf integration coverage out of api-core site explorer tests into the site-explorer crate, and introduce a shared test harness foundation for cross-crate test setup. Signed-off-by: Dmitry Porokh <dporokh@nvidia.com>

poroh force-pushed the tests-refactoring-p3 branch from c0cbea2 to 2dedd22 Compare June 5, 2026 22:05

chet approved these changes Jun 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: extract site explorer power shelf tests#2250

test: extract site explorer power shelf tests#2250
poroh wants to merge 1 commit into
NVIDIA:mainfrom
poroh:tests-refactoring-p3

poroh commented Jun 5, 2026 •

edited

Loading

Uh oh!

coderabbitai Bot commented Jun 5, 2026 •

edited

Loading

Review skipped

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

poroh commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

Related Issues (Optional)

Breaking Changes

Testing

Additional Notes

Uh oh!

coderabbitai Bot commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

poroh commented Jun 5, 2026 •

edited

Loading

coderabbitai Bot commented Jun 5, 2026 •

edited

Loading