Add support for benchmarking TLS endpoints by rgerganov · Pull Request #319 · mlcommons/endpoints

rgerganov · 2026-05-20T12:16:57Z

Enable the benchmark and probe commands to target HTTPS endpoints, including those serving self-signed certificates via a new --insecure flag that skips TLS certificate verification on the SSL context used by worker connections.

Also fix HttpResponseProtocol.eof_received() to return False so asyncio closes the transport itself; returning True is a no-op on SSL transports (which don't support TCP half-close) and was leaving pending body futures unresolved on connection teardown.

What does this PR do?

Type of change

Bug fix
New feature
Documentation update
Refactor/cleanup

Related issues

Testing

Tests added/updated
All tests pass locally
Manual testing completed

Checklist

Code follows project style
Pre-commit hooks pass
Documentation updated (if needed)

Enable the benchmark and probe commands to target HTTPS endpoints, including those serving self-signed certificates via a new --insecure flag that skips TLS certificate verification on the SSL context used by worker connections. Also fix HttpResponseProtocol.eof_received() to return False so asyncio closes the transport itself; returning True is a no-op on SSL transports (which don't support TCP half-close) and was leaving pending body futures unresolved on connection teardown.

github-actions · 2026-05-20T12:17:09Z

MLCommons CLA bot:
Thank you very much for your submission; we really appreciate it. Before we can accept your contribution,
we ask that you sign the MLCommons CLA (Apache 2). Please submit your GitHub ID to our onboarding form to initiate
authorization. If you are from a MLCommons member organization, we will request that you be added to the CLA.
If you are not from a member organization, we will email you a CLA to sign. For any questions, please contact
support@mlcommons.org.
0 out of 1 committers have signed the MLCommons CLA.
❌ @rgerganov
_{You can retrigger this bot by commenting recheck in this Pull Request}

Copilot

Pull request overview

This PR enables HTTPS/TLS targeting for the endpoint client used by the benchmark and probe commands, including an opt-in --insecure flag for connecting to endpoints with self-signed/untrusted certificates. It also fixes HttpResponseProtocol.eof_received() to return False so asyncio reliably closes the transport (including SSL transports) and resolves pending body futures on teardown.

Changes:

Add insecure (--insecure) to HTTPClientConfig and propagate it into worker TLS ssl.SSLContext configuration (skip hostname/cert verification when enabled).
Ensure HTTPS connections are actually established by passing an SSL context into the worker connection pool when the endpoint scheme is https.
Fix HttpResponseProtocol.eof_received() to return False and update the unit test expectation accordingly; update benchmark config templates and probe CLI to expose --insecure.

Reviewed changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
tests/unit/endpoint_client/test_http.py	Updates unit test to assert `eof_received()` returns `False` for correct asyncio transport closure behavior.
src/inference_endpoint/endpoint_client/worker.py	Creates and configures an SSL context for `https://` endpoints; applies `insecure` by disabling verification.
src/inference_endpoint/endpoint_client/http.py	Changes `eof_received()` to return `False` and documents why this is required (especially for SSL transports).
src/inference_endpoint/endpoint_client/config.py	Adds `insecure` config/CLI flag to control TLS certificate verification.
src/inference_endpoint/config/templates/online_template_full.yaml	Adds `settings.client.insecure` to the full online benchmark template.
src/inference_endpoint/config/templates/offline_template_full.yaml	Adds `settings.client.insecure` to the full offline benchmark template.
src/inference_endpoint/config/templates/concurrency_template_full.yaml	Adds `settings.client.insecure` to the full concurrency benchmark template.
src/inference_endpoint/commands/probe.py	Adds `--insecure` to probe CLI and forwards it into `HTTPClientConfig`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

gemini-code-assist

Code Review

This pull request introduces an "insecure" configuration option across the inference endpoint client, CLI, and configuration templates to allow skipping TLS certificate verification. Additionally, it updates the eof_received method in the HTTP protocol implementation to return False, ensuring proper transport closure for SSL connections that do not support TCP half-close. I have no feedback to provide as there were no review comments to evaluate.

rgerganov requested review from a team and Copilot May 20, 2026 12:16

Copilot started reviewing on behalf of rgerganov May 20, 2026 12:17 View session

Copilot AI reviewed May 20, 2026

View reviewed changes

gemini-code-assist Bot reviewed May 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for benchmarking TLS endpoints#319

Add support for benchmarking TLS endpoints#319
rgerganov wants to merge 1 commit into
mlcommons:mainfrom
rgerganov:ssl-support

rgerganov commented May 20, 2026

Uh oh!

github-actions Bot commented May 20, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rgerganov commented May 20, 2026

What does this PR do?

Type of change

Related issues

Testing

Checklist

Uh oh!

github-actions Bot commented May 20, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants