Update gold_image_request.py #205

vizsatiz · 2026-01-10T15:06:11Z

Summary by CodeRabbit

Enhancements
- Image metadata fields now accept structured/hierarchical data (not just plain text), enabling richer metadata storage and processing.
- mTLS authentication accepts SPIFFE principals in additional formats, trusts an extra principal namespace, and improves validation flow, error logging, and standardized 403 responses on auth failure.
Chores
- Container now runs as a non-root user for improved runtime security.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2026-01-10T15:06:22Z

📝 Walkthrough

Walkthrough

ImageMetadata fields metadata_1–metadata_5 changed from str to dict. Authorization middleware now accepts SPIFFE principal from header or URI, adds explicit mTLS validation via X-Forwarded-Client-Cert with 403 on failure, and reorganizes dispatch for HMAC/token paths. Dockerfile creates and switches to non-root floware user.

Changes

Cohort / File(s)	Summary
ImageMetadata field type updates `wavefront/server/modules/gold_module/gold_module/models/gold_image_request.py`	Changed `metadata_1`–`metadata_5` types from `str` → `dict` (defaults remain `None`). Affects serialization/deserialization and consumers expecting string metadata.
mTLS & auth dispatch adjustments `wavefront/server/modules/user_management_module/user_management_module/authorization/require_auth.py`	`validate_mtls_auth` accepts SPIFFE principal from `URI=...` or header starting with `spiffe://`, allows principals under `cluster.local/ns/client-applications` or `cluster.local/ns/gpu-processing`, logs the principal/header, introduces `mtls_header` handling, gates HMAC validation behind an `authorization` presence check, and returns explicit 403 on mTLS validation failure; token-based flow preserved.
Dockerfile non-root user `wavefront/server/docker/floware.Dockerfile`	Adds creation of non-root user `floware` (UID 1000), chowns `/app` to that user, switches to `USER floware`, and resets workdir to `/app/apps/floware/floware` before CMD.

Sequence Diagram(s)

sequenceDiagram
  actor Client
  participant Proxy
  participant API_Server
  participant Auth_Middleware
  participant Token_Service

  Client->>Proxy: send request (may include X-Forwarded-Client-Cert / Authorization)
  Proxy->>API_Server: forward request + headers
  API_Server->>Auth_Middleware: perform auth checks
  alt X-Forwarded-Client-Cert present
    Auth_Middleware->>Auth_Middleware: extract principal (URI or spiffe://)
    Auth_Middleware->>Auth_Middleware: validate_mtls_auth(principal)
    alt valid
      Auth_Middleware->>API_Server: allow request
    else invalid
      Auth_Middleware->>API_Server: return 403
    end
  else no mTLS header
    Auth_Middleware->>Token_Service: validate token / fallback HMAC
    Token_Service->>Auth_Middleware: validation result
    Auth_Middleware->>API_Server: allow or deny based on token/HMAC
  end

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Poem

🐰 I nibbled through fields — five strings turned to maps,
Certificates hopped in, checking spiffe-trod paths.
A floware hat I sewed, non-root to take the stage,
I logged, I hopped, and watched the guards gauge.
Hooray for careful, springy code!

🚥 Pre-merge checks | ✅ 1 | ❌ 2

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Title check	⚠️ Warning	The title only references one file (gold_image_request.py) but the PR changes three files with meaningful updates across authorization, Docker configuration, and metadata structure.	Revise the title to reflect the actual scope of changes, such as 'Update metadata field types and authorization validation' or provide a more comprehensive description of the main change.
Docstring Coverage	⚠️ Warning	Docstring coverage is 50.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

wavefront/server/modules/gold_module/gold_module/models/gold_image_request.py (2)
86-90: Use Field(default_factory=ImageMetadata) instead of ImageMetadata() to prevent shared mutable state.

The current code reuses a single ImageMetadata instance across all ImageAnalysisRequest instances. Since ImageMetadata contains mutable dict fields (metadata_1 through metadata_5), modifications to these fields in one request would affect all other instances.
Proposed diff
-from pydantic import BaseModel, ConfigDict
+from pydantic import BaseModel, ConfigDict, Field
@@
 class ImageAnalysisRequest(BaseModel):
     image: str  # data URL (base64 with MIME) or direct URL
-    metadata: ImageMetadata = (
-        ImageMetadata()
-    )  # Ensure metadata is always an ImageMetadata instance
+    metadata: ImageMetadata = Field(default_factory=ImageMetadata)
56-84: to_string_dict() incorrectly stringifies Item models (and doesn't stringify dict keys) — breaks expected output format.

When processing items: List[Item], each Item object falls through to str(val) (line 80), producing "Item(item_id='...' ...)" instead of a dict with stringified leaf values. Additionally, dict keys remain non-string despite the docstring claiming "all nested values are strings."

Also replace __pydantic_extra__ (internal API) with model_extra for Pydantic v2 compatibility.
Proposed fix
     def get_extra_fields(self) -> dict:
         """Return a dict of extra fields not defined in the model."""
-        return (
-            dict(self.__pydantic_extra__)
-            if hasattr(self, '__pydantic_extra__') and self.__pydantic_extra__
-            else {}
-        )
+        return dict(self.model_extra or {})

     def get_defined_fields(self) -> dict:
         """Return a dict of only the defined fields (excluding extras)."""
-        return self.model_dump(exclude=self.get_extra_fields().keys())
+        return self.model_dump(exclude=set(self.get_extra_fields()))

     def to_string_dict(self) -> dict:
         """Return a dict with all fields (excluding extras) as strings. None remains None. All nested values are strings."""

         def to_str_recursive(val):
             if val is None:
                 return None
+            if isinstance(val, BaseModel):
+                return to_str_recursive(val.model_dump())
             if isinstance(val, list):
                 return [to_str_recursive(v) for v in val]
             if isinstance(val, dict):
-                return {k: to_str_recursive(v) for k, v in val.items()}
+                return {str(k): to_str_recursive(v) for k, v in val.items()}
             return str(val)

🧹 Nitpick comments (1)

wavefront/server/modules/gold_module/gold_module/models/gold_image_request.py (1)
44-48: Tighten metadata_1..metadata_5 typing (avoid bare dict).
Bare dict weakens validation/schema and makes downstream expectations unclear; prefer dict[str, Any] | None (or Mapping[str, Any] | None).
Proposed diff
-from typing import List
+from typing import Any, List

 class ImageMetadata(BaseModel):
@@
-    metadata_1: dict = None
-    metadata_2: dict = None
-    metadata_3: dict = None
-    metadata_4: dict = None
-    metadata_5: dict = None
+    metadata_1: dict[str, Any] | None = None
+    metadata_2: dict[str, Any] | None = None
+    metadata_3: dict[str, Any] | None = None
+    metadata_4: dict[str, Any] | None = None
+    metadata_5: dict[str, Any] | None = None

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 42c9543 and 18ca236.

📒 Files selected for processing (1)

wavefront/server/modules/gold_module/gold_module/models/gold_image_request.py

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: build-push-artifact

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In
@wavefront/server/modules/user_management_module/user_management_module/authorization/require_auth.py:
- Around line 421-435: The code currently logs the full X-Forwarded-Client-Cert
header (mtls_header) in the mTLS branch; remove or replace that sensitive value
with a non-sensitive presence message and rely on validate_mtls_auth to log the
extracted principal. Update the logger.info call in the block that checks
request.headers.get('X-Forwarded-Client-Cert') to something like "mTLS client
certificate presented" (or otherwise redact mtls_header) and do not log
mtls_header itself; keep the rest of the flow (await validate_mtls_auth,
call_next, JSONResponse using response_formatter.buildErrorResponse) unchanged
so authentication behavior is preserved.

🧹 Nitpick comments (1)

wavefront/server/modules/user_management_module/user_management_module/authorization/require_auth.py (1)

272-274: Consider refactoring for improved readability.

The double-negative logic (and not ... and not ...) is correct but could be clearer. Consider using a positive check or restructuring the condition.

♻️ Proposed refactor for clarity

-            if not principal.startswith(
-                'spiffe://cluster.local/ns/client-applications'
-            ) and not principal.startswith('spiffe://cluster.local/ns/gpu-processing'):
+            allowed_prefixes = (
+                'spiffe://cluster.local/ns/client-applications',
+                'spiffe://cluster.local/ns/gpu-processing',
+            )
+            if not principal.startswith(allowed_prefixes):
                 logger.error(f'Invalid mTLS authentication. Principal: {principal}')
                 return False

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 18ca236 and 648497c.

📒 Files selected for processing (1)

wavefront/server/modules/user_management_module/user_management_module/authorization/require_auth.py

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: build-push-artifact

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In
@wavefront/server/modules/user_management_module/user_management_module/authorization/require_auth.py:
- Around line 425-439: The code logs the full X-Forwarded-Client-Cert header
(mtls_header) in the middleware; remove or redact that sensitive logging. Locate
the block handling mtls_header in require_auth.py (look for the mtls_header =
request.headers.get('X-Forwarded-Client-Cert') line and the subsequent
logger.info call) and either delete the logger.info(f'mTLS authentication by
{mtls_header}') statement or replace it with a non-sensitive message like
logger.info('mTLS authentication attempt') while keeping the existing call to
validate_mtls_auth and the existing success/failure logging inside
validate_mtls_auth.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 648497c and 9404732.

📒 Files selected for processing (1)

wavefront/server/modules/user_management_module/user_management_module/authorization/require_auth.py

🧰 Additional context used

🧬 Code graph analysis (1)

wavefront/server/modules/user_management_module/user_management_module/authorization/require_auth.py (1)

wavefront/server/modules/api_services_module/api_services_module/models/service.py (1)

error (131-143)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: build-push-artifact

🔇 Additional comments (3)

wavefront/server/modules/user_management_module/user_management_module/authorization/require_auth.py (3)

267-274: LGTM! Good handling of multiple XFCC formats.

The dual extraction strategy correctly handles both standard URI-encoded SPIFFE IDs and direct SPIFFE ID formats. The fallback logic is well-structured.

275-280: LGTM! Correct namespace authorization logic.

The addition of the gpu-processing namespace correctly implements an OR condition - either namespace is allowed for mTLS authentication.

297-297: LGTM! More accurate logging.

The updated log message correctly references the principal variable, improving log clarity and consistency.

coderabbitai · 2026-01-10T19:55:36Z

...t/server/modules/user_management_module/user_management_module/authorization/require_auth.py

+                mtls_header = request.headers.get('X-Forwarded-Client-Cert')
+                if mtls_header and not token:
+                    logger.info(f'mTLS authentication by {mtls_header}')
                    if await validate_mtls_auth(request):
                        return await call_next(request)
+                    else:
+                        logger.error(
+                            f'Invalid mTLS authentication for {request.url.path}'
+                        )
+                        return JSONResponse(
+                            status_code=status.HTTP_403_FORBIDDEN,
+                            content=response_formatter.buildErrorResponse(
+                                error='Invalid mTLS authentication'
+                            ),
+                        )


⚠️ Potential issue | 🟠 Major

Security concern: Avoid logging sensitive certificate data.

Line 427 logs the entire X-Forwarded-Client-Cert header, which may contain sensitive certificate information such as serial numbers, subject distinguished names, and other metadata. This data should not be logged in full, as logs may be accessed by multiple systems and personnel.

The validate_mtls_auth function already logs authentication success/failure with the principal on lines 290-292 and 296-298, making this additional logging redundant.

🔒 Proposed fix to remove or redact sensitive header logging

Option 1 (Recommended): Remove the redundant log statement

Since validate_mtls_auth already provides detailed logging, simply remove this line:

mtls_header = request.headers.get('X-Forwarded-Client-Cert') if mtls_header and not token: - logger.info(f'mTLS authentication by {mtls_header}') if await validate_mtls_auth(request):

Option 2: Log only that mTLS is being attempted

If you need logging at the dispatch level, log without exposing the header value:

mtls_header = request.headers.get('X-Forwarded-Client-Cert') if mtls_header and not token: - logger.info(f'mTLS authentication by {mtls_header}') + logger.info('Attempting mTLS authentication') if await validate_mtls_auth(request):

🤖 Prompt for AI Agents

In @wavefront/server/modules/user_management_module/user_management_module/authorization/require_auth.py around lines 425 - 439, The code logs the full X-Forwarded-Client-Cert header (mtls_header) in the middleware; remove or redact that sensitive logging. Locate the block handling mtls_header in require_auth.py (look for the mtls_header = request.headers.get('X-Forwarded-Client-Cert') line and the subsequent logger.info call) and either delete the logger.info(f'mTLS authentication by {mtls_header}') statement or replace it with a non-sensitive message like logger.info('mTLS authentication attempt') while keeping the existing call to validate_mtls_auth and the existing success/failure logging inside validate_mtls_auth.

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In
`@wavefront/server/modules/user_management_module/user_management_module/authorization/require_auth.py`:
- Around line 295-298: The code logs the raw mTLS header variable xfcc in
logger.warning; remove or redact that sensitive value: locate the logger.warning
call near where request_id is computed (get_current_request_id()) and change the
message to omit xfcc (e.g., log "mTLS header present but no valid principal
found" with the request_id) or replace xfcc with a non-sensitive
token/hashing/redacted literal before logging; keep request_id in the log for
traceability and do not persist or print the raw certificate header anywhere.

♻️ Duplicate comments (1)

wavefront/server/modules/user_management_module/user_management_module/authorization/require_auth.py (1)

425-439: Duplicate: Sensitive certificate logging already flagged.

The logging of full mtls_header on line 427 was addressed in a previous review comment. The new explicit 403 response for invalid mTLS authentication (lines 430-439) is a good addition that properly rejects failed mTLS attempts.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 864ad14 and 7d141b2.

📒 Files selected for processing (1)

wavefront/server/modules/user_management_module/user_management_module/authorization/require_auth.py

🔇 Additional comments (3)

wavefront/server/modules/user_management_module/user_management_module/authorization/require_auth.py (3)

267-280: LGTM on SPIFFE principal extraction enhancement.

The logic correctly handles both URI-embedded SPIFFE IDs and direct SPIFFE headers. The disjunction allowing client-applications or gpu-processing namespaces is clear.

338-340: LGTM on HMAC gating with Authorization header check.

The condition not authorization ensures HMAC validation only applies when no Authorization header is present, preventing auth method ambiguity.

371-372: LGTM on dispatch flow restructuring.

The explicit else block cleanly separates auth paths (HMAC, service auth, normal token flow), improving readability and maintainability.

_{✏️ Tip: You can disable this entire section by setting review_details to false in your review settings.}

coderabbitai · 2026-01-15T07:10:59Z

...t/server/modules/user_management_module/user_management_module/authorization/require_auth.py

        request_id = getattr(request.state, 'request_id', get_current_request_id())
        logger.warning(
-            f'mTLS header present but no valid URI found: {xfcc} [Request ID: {request_id}]'
+            f'mTLS header present but no valid principal found: {xfcc} [Request ID: {request_id}]'
        )


⚠️ Potential issue | 🟠 Major

Security concern: Avoid logging raw certificate header.

Line 297 logs the full xfcc header content, which may contain sensitive certificate metadata (serial numbers, subject DNs, issuer info). Consider logging only that no valid principal was found, without exposing the raw header value.

🔒 Proposed fix to redact sensitive header data

request_id = getattr(request.state, 'request_id', get_current_request_id()) logger.warning( - f'mTLS header present but no valid principal found: {xfcc} [Request ID: {request_id}]' + f'mTLS header present but no valid SPIFFE principal found [Request ID: {request_id}]' )

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

request_id = getattr(request.state, 'request_id', get_current_request_id())

logger.warning(

f'mTLS header present but no valid URI found: {xfcc} [Request ID: {request_id}]'

f'mTLS header present but no valid principal found: {xfcc} [Request ID: {request_id}]'

)

request_id = getattr(request.state, 'request_id', get_current_request_id())

logger.warning(

f'mTLS header present but no valid SPIFFE principal found [Request ID: {request_id}]'

)

🤖 Prompt for AI Agents

In `@wavefront/server/modules/user_management_module/user_management_module/authorization/require_auth.py` around lines 295 - 298, The code logs the raw mTLS header variable xfcc in logger.warning; remove or redact that sensitive value: locate the logger.warning call near where request_id is computed (get_current_request_id()) and change the message to omit xfcc (e.g., log "mTLS header present but no valid principal found" with the request_id) or replace xfcc with a non-sensitive token/hashing/redacted literal before logging; keep request_id in the log for traceability and do not persist or print the raw certificate header anywhere.

Update gold_image_request.py

18ca236

coderabbitai bot reviewed Jan 10, 2026

View reviewed changes

Adding more logs tp debug auth

648497c

coderabbitai bot reviewed Jan 10, 2026

View reviewed changes

vizsatiz added 2 commits January 10, 2026 23:19

Adding spiffe case

14fefbf

Ignoring mtls if token is present

9404732

coderabbitai bot reviewed Jan 10, 2026

View reviewed changes

vizsatiz added 2 commits January 13, 2026 20:10

fix for container isolation

864ad14

fix to consider authorization over hmac

7d141b2

coderabbitai bot reviewed Jan 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update gold_image_request.py #205

Update gold_image_request.py #205

Uh oh!

vizsatiz commented Jan 10, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Jan 10, 2026 •

edited

Loading

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Jan 10, 2026

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Jan 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Update gold_image_request.py #205

Are you sure you want to change the base?

Update gold_image_request.py #205

Uh oh!

Conversation

vizsatiz commented Jan 10, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jan 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vizsatiz commented Jan 10, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 10, 2026 •

edited

Loading