perf(evaluate): optimize reference product evaluation with concurrency by OgeonX-Ai · Pull Request #4 · Coding-Autopilot-System/cas-evals

OgeonX-Ai · 2026-07-03T11:03:26Z

This PR fixes an unoptimized path in reference_product.py where cases were being evaluated sequentially. It introduces concurrent.futures.ThreadPoolExecutor to evaluate cases concurrently.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 58099d3a7d

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-07-03T11:07:45Z

        )

+    with concurrent.futures.ThreadPoolExecutor(max_workers=10) as executor:
+        evaluated = list(executor.map(process_case, suite["cases"]))


Stop scheduling cases after the first adapter failure

In a multi-case reference-product suite where the endpoint times out or returns an invalid contract, this executor.map call submits the whole suite before any result is inspected, and list(...) exits the with only after those queued HTTP calls finish. That regresses the fail-closed path from stopping on the first bad response to spending up to one timeout per batch, and it can send later prompts even after an earlier case has already made the run invalid.

Useful? React with 👍 / 👎.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

perf(evaluate): optimize reference product evaluation with concurrency

58099d3

chatgpt-codex-connector Bot reviewed Jul 3, 2026

View reviewed changes

chore(ci): ignore Playwright artifacts; track UI test lockfile

98a5429

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

OgeonX-Ai enabled auto-merge (squash) July 3, 2026 14:58

OgeonX-Ai merged commit f4c4912 into main Jul 3, 2026
11 checks passed

OgeonX-Ai deleted the fix/audit-sweep branch July 3, 2026 15:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf(evaluate): optimize reference product evaluation with concurrency#4

perf(evaluate): optimize reference product evaluation with concurrency#4
OgeonX-Ai merged 2 commits into
mainfrom
fix/audit-sweep

OgeonX-Ai commented Jul 3, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Jul 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

OgeonX-Ai commented Jul 3, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jul 3, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants