chore: Dev to Main Merge by Vamshi-Microsoft · Pull Request #623 · microsoft/content-processing-solution-accelerator

Vamshi-Microsoft · 2026-06-16T10:31:15Z

Purpose

This pull request introduces a major refactor to how extraction quality scores are calculated, surfaced, and described in the Content Processor pipeline. It replaces the previous scoring logic with a robust, well-tested system that distinguishes between probabilistic confidence and structural completeness, ensuring that completed runs without logprobs are scored meaningfully rather than as zero. It also improves Azure authentication error handling and updates dependency versions.

Extraction Score Calculation and API Improvements:

Refactored the extraction quality scoring logic in SaveHandler to use a new _derive_aggregate_scores method, which selects between probabilistic confidence and a structural completeness fallback, ensuring completed runs without logprobs get a meaningful score rather than 0.0. Also added a helper _is_filled_value to robustly determine if a field is filled. [1] [2] [3]
Enhanced the documentation for entity_score and schema_score in the API model to clarify the new scoring semantics for consumers.
Added comprehensive unit tests for the new scoring logic, covering probabilistic, structural, and zero-score paths.

Dependency and Azure Authentication Handling:

Updated the idna dependency to version 3.15 in both requirements files for consistency and security. [1] [2]
Removed fallback to DefaultAzureCredential in Azure credential utilities; now, if CLI and managed identity authentication fail, a clear error is raised, prompting explicit user action. [1] [2] [3] [4]
Updated application initialization to use the new credential utility instead of DefaultAzureCredential directly. [1] [2]

Frontend Robustness:

Updated the process queue grid UI to handle null or undefined scores gracefully, always displaying "0" instead of crashing or showing blank. [1] [2]
Updated the frontend type documentation to match the new score semantics.

Does this introduce a breaking change?

Yes
No

Golden Path Validation

I have tested the primary workflows (the "golden path") to ensure they function correctly without errors.

Deployment Validation

I have validated the deployment process successfully and all services are running as expected with this change.

What to Check

Verify that the following are valid

...

Other Information

…rkflow Applies the changes from Dependabot PR #589 onto dev so they reach the dev branch ahead of the upstream PR (which targets main). Refs: ADO #44960 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Applies the changes from Dependabot PR #595 onto dev so they reach the dev branch ahead of the upstream PR (which targets main). Refs: ADO #44960 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Applies the changes from Dependabot PR #596 onto dev so they reach the dev branch ahead of the upstream PR (which targets main). Refs: ADO #44960 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

…uv.lock Applies the changes from Dependabot PR #597 onto dev so they reach the dev branch ahead of the upstream PR (which targets main). Refs: ADO #44960 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Regenerated uv.lock after merging dev to incorporate both dependabot upgrades and agent-framework 1.3.0 changes. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

chore(deps): bring dependabot upgrades (#589, #595, #596, #597) onto dev (AB#44960)

…ial in python application.

…ores Root cause: When the evaluate step couldn't compute any per-field confidence (e.g. logprobs unavailable on reasoning models like gpt-5/o1/o3, or image-only flow with no Content Understanding signal), save_handler emitted entity_score=0.0, schema_score=0.0. These `0.0`s flowed through Cosmos -> API -> UI and rendered as `0%` (red), indistinguishable from a genuine zero confidence. Fix: Treat `total_evaluated_fields_count == 0` (or no comparison items) as *unavailable* and propagate `None` through the ContentProcessor, ContentProcessorAPI and ContentProcessorWorkflow models. The frontend percentage cell renderer now shows `N/A` for null/undefined and `0%` only for a genuine numeric zero. Files changed: - ContentProcessor: save_handler.py (extracted _derive_aggregate_scores helper) - ContentProcessor: content_process.py default scores -> None - ContentProcessorAPI: ContentProcess + Content_Process default scores -> None - ContentProcessorWorkflow: ContentProcessRecord + Content_Process default scores -> None - ContentProcessorWorkflow: document_process_executor preserves None instead of coercing to 0.0 - ContentProcessorWeb: ProcessQueueGridTypes types scores nullable; ProcessQueueGrid passes undefined for null/undefined; CustomCellRender renders `N/A` when valueText is null/undefined and only `...` while still processing Tests: - New: ContentProcessor/tests/unit/pipeline/test_save_handler_scores.py (5 cases: valid scores, missing per-field signal, no comparison items, genuine zero, all-fields-above-threshold) - Updated existing default-value tests in Workflow + src/tests to assert None - Added tests for explicit zero preservation and Failed status -> None

Per feedback: Completed runs must always show a meaningful number; Failed runs and genuine zeros stay at 0%. - save_handler._derive_aggregate_scores picks the best available signal: (1) probabilistic confidence when logprobs available; (2) structural completeness (filled fields / total) when no logprobs (reasoning models, image-only flow); (3) 0.0 when no extraction data at all. - _is_filled_value heuristic: None/empty/whitespace count as not filled; descends into nested dicts/lists. - Reverted models from Optional[float]=None back to default 0.0. - Reverted frontend: no N/A path; renders 0% for null/missing scores. - 15 new tests covering all 3 paths + _is_filled_value heuristic.

…ation - F401: drop unused sync DefaultAzureCredential import in 3 credential util files (sync flow now raises RuntimeError; AsyncDefaultAzureCredential is still used). - W293/E122: fix blank-line whitespace and continuation-line indentation in ContentProcessorWorkflow/src/utils/credential_util.py.

fix: Psl entity score

github-actions · 2026-06-16T10:31:59Z

Coverage Report •

File	Stmts	Miss	Cover
libs/utils
azure_credential_utils.py	97	0	100%
TOTAL	1217	161	86%

Tests	Skipped	Failures	Errors	Time
244	0 💤	0 ❌	0 🔥	3.199s ⏱️

Copilot

Pull request overview

This pull request refactors extraction quality scoring in the Content Processor pipeline to avoid misleading 0.0 scores for Completed runs without probabilistic confidence, while also tightening Azure credential selection behavior, updating dependencies, and hardening the UI display of scores.

Changes:

Refactors SaveHandler aggregate score derivation to select probabilistic confidence when available, otherwise fall back to structural completeness.
Removes sync fallback to DefaultAzureCredential in credential utilities (raising a clear error instead) and updates app initialization to use the utility.
Updates score semantics documentation/models, bumps dependencies (idna, authlib), and adds/updates unit tests + UI handling for missing scores.

Reviewed changes

Copilot reviewed 23 out of 24 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
src/tests/ContentProcessorWorkflow/utils/test_credential_util_extended.py	Updates tests to expect a RuntimeError when no credential options succeed.
src/tests/ContentProcessorWorkflow/services/test_content_process_models.py	Clarifies score defaults in model default tests.
src/tests/ContentProcessorWorkflow/repositories/test_claim_process_model.py	Clarifies score defaults in repository model tests.
src/tests/ContentProcessor/utils/test_azure_credential_utils.py	Updates tests to expect RuntimeError when no credentials are available.
src/tests/ContentProcessor/utils/test_azure_credential_utils_extended.py	Updates extended credential tests for the new “raise on failure” behavior.
src/ContentProcessorWorkflow/uv.lock	Bumps `authlib` lock entry to `1.6.12`.
src/ContentProcessorWorkflow/tests/unit/services/test_content_process_models.py	Adds assertions/tests around preserving explicit `0.0` scores.
src/ContentProcessorWorkflow/tests/unit/repositories/test_claim_process_model.py	Adds tests ensuring explicit `0.0` and failure-default `0.0` behavior.
src/ContentProcessorWorkflow/src/utils/credential_util.py	Changes sync credential selection to raise when no auth is available.
src/ContentProcessorWorkflow/src/steps/document_process/executor/document_process_executor.py	Centralizes safe coercion of score values from poll payloads.
src/ContentProcessorWorkflow/src/repositories/model/claim_process.py	Updates field descriptions to document new score semantics.
src/ContentProcessorWorkflow/src/libs/base/application_base.py	Switches initialization to use `get_azure_credential()` instead of `DefaultAzureCredential`.
src/ContentProcessorWorkflow/src/libs/azure/app_configuration.py	Requires an explicit credential instead of implicitly defaulting.
src/ContentProcessorWorkflow/pyproject.toml	Bumps `authlib` to `1.6.12`.
src/ContentProcessorWeb/src/Pages/DefaultPage/Components/ProcessQueueGrid/ProcessQueueGridTypes.ts	Updates score field documentation (and should align types with nullish handling).
src/ContentProcessorWeb/src/Pages/DefaultPage/Components/ProcessQueueGrid/ProcessQueueGrid.tsx	Handles null/undefined scores in the grid rendering path.
src/ContentProcessorAPI/requirements.txt	Bumps `idna` to `3.15`.
src/ContentProcessorAPI/app/routers/models/contentprocessor/claim_process.py	Updates API model field descriptions to explain new score semantics.
src/ContentProcessorAPI/app/libs/base/application_base.py	Switches initialization to use `get_azure_credential()` utility.
src/ContentProcessor/tests/unit/pipeline/test_save_handler_scores.py	Adds comprehensive unit tests for new aggregate scoring logic.
src/ContentProcessor/src/libs/utils/credential_util.py	Changes sync credential selection to raise when no auth is available.
src/ContentProcessor/src/libs/utils/azure_credential_utils.py	Changes sync credential selection to raise when no auth is available.
src/ContentProcessor/src/libs/pipeline/handlers/save_handler.py	Implements `_derive_aggregate_scores` + `_is_filled_value` structural fallback scoring.
src/ContentProcessor/requirements.txt	Bumps `idna` to `3.15`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

github-actions · 2026-06-16T12:44:25Z

🎉 This PR is included in version 2.1.2 🎉

The release is available on GitHub release

Your semantic-release bot 📦🚀

Shreyas-Microsoft and others added 17 commits June 1, 2026 16:28

build(deps): bump idna from 3.11 to 3.15 in ContentProcessorAPI

68f54be

Applies the changes from Dependabot PR #595 onto dev so they reach the dev branch ahead of the upstream PR (which targets main). Refs: ADO #44960 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

build(deps): bump idna from 3.11 to 3.15 in ContentProcessor

03bcf44

Applies the changes from Dependabot PR #596 onto dev so they reach the dev branch ahead of the upstream PR (which targets main). Refs: ADO #44960 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

fix: resolve merge conflict in ContentProcessorWorkflow/uv.lock

7f176e3

Regenerated uv.lock after merging dev to incorporate both dependabot upgrades and agent-framework 1.3.0 changes. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Merge pull request #606 from microsoft/psl-sw/44960-dependabot-upgrades

cdf62bd

chore(deps): bring dependabot upgrades (#589, #595, #596, #597) onto dev (AB#44960)

Resolve CodeQL issues

ec8d7a0

Resolve test cases error

8fc57b8

Resolve test cases error 1

e11892e

Resolve test cases error 2

cab11be

Resolve test cases error 3

cf8dfc7

fix: Resolve CodeQL issues to avoid unsafe use of DefaultAzureCredent…

3f193ba

…ial in python application.

fix(lint): remove trailing blank line at EOF (W391)

8d7b592

Merge pull request #619 from microsoft/psl-entity-score

1fee0b1

fix: Psl entity score

Vamshi-Microsoft requested a review from Avijit-Microsoft as a code owner June 16, 2026 10:31

Copilot AI review requested due to automatic review settings June 16, 2026 10:31

Vamshi-Microsoft requested review from Roopan-Microsoft, aniaroramsft, dgp10801, nchandhi and toherman-msft as code owners June 16, 2026 10:31

Vamshi-Microsoft temporarily deployed to production June 16, 2026 10:31 — with GitHub Actions Inactive

Copilot started reviewing on behalf of Vamshi-Microsoft June 16, 2026 10:31 View session

Avijit-Microsoft approved these changes Jun 16, 2026

View reviewed changes

Copilot AI reviewed Jun 16, 2026

View reviewed changes

Roopan-Microsoft approved these changes Jun 16, 2026

View reviewed changes

Roopan-Microsoft merged commit b47cec4 into main Jun 16, 2026
33 of 35 checks passed

github-actions Bot added the released label Jun 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: Dev to Main Merge#623

chore: Dev to Main Merge#623
Roopan-Microsoft merged 17 commits into
mainfrom
dev

Vamshi-Microsoft commented Jun 16, 2026

Uh oh!

github-actions Bot commented Jun 16, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

Vamshi-Microsoft commented Jun 16, 2026

Purpose

Does this introduce a breaking change?

Golden Path Validation

Deployment Validation

What to Check

Other Information

Uh oh!

github-actions Bot commented Jun 16, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants