Add OpenAI Responses-compatible endpoint by CUHKSZzxy · Pull Request #4582 · InternLM/lmdeploy

CUHKSZzxy · 2026-05-13T03:14:45Z

Summary

Add a text-first OpenAI Responses-compatible POST /v1/responses endpoint.
Support string/message input, instructions/developer-role normalization, function tools, tool choice validation, and Responses SSE events.
Add focused tests, Responses API docs, and Codex integration docs.

Validation

pytest tests/test_lmdeploy/serve/openai/test_responses.py -q (18 passed)
git diff --check upstream/main...HEAD
Local Codex smoke tests against LMDeploy for no-tool, read, edit, multi-step, and project workflows.

Codex Demo

Copilot

Pull request overview

This PR adds a text-first, OpenAI Responses API–compatible endpoint (POST /v1/responses) to LMDeploy’s OpenAI server, including request normalization (string/messages/instructions/developer role), function tool mapping/tool-choice validation, and an SSE streaming event surface. It also updates middleware route protection, integrates the new router into api_server, and adds tests + documentation (including Codex integration docs).

Changes:

Add lmdeploy/serve/openai/responses.py implementing POST /v1/responses (non-stream + SSE streaming) and related request/response models.
Wire the new endpoint into the OpenAI API server and protect it under engine-sleep middleware.
Add focused unit tests plus English/Chinese documentation and integration guides (Codex / Claude Code).

Reviewed changes

Copilot reviewed 11 out of 12 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
tests/test_lmdeploy/serve/openai/test_responses.py	Adds unit coverage for input normalization, tools/tool_choice validation, response shapes, and SSE event shapes.
lmdeploy/serve/utils/server_utils.py	Adds `/v1/responses` to sleeping-engine protected inference routes.
lmdeploy/serve/openai/responses.py	Implements the Responses-compatible router, request parsing, tool conversion, non-stream response construction, and SSE streaming events.
lmdeploy/serve/openai/api_server.py	Registers the new Responses router on the FastAPI app.
docs/zh_cn/llm/api_server.md	Links to the new Responses endpoint documentation.
docs/zh_cn/llm/api_server_responses.md	Documents the `/v1/responses` endpoint (Text V1 subset), tools, SSE events, and Codex setup notes.
docs/zh_cn/index.rst	Adds the Responses doc page to the Chinese toctree.
docs/en/llm/api_server.md	Links to the new Responses endpoint documentation.
docs/en/llm/api_server_responses.md	Documents the `/v1/responses` endpoint and points to Codex integration docs.
docs/en/integration/codex.md	Adds a Codex → LMDeploy `/v1/responses` integration guide.
docs/en/integration/claude_code.md	Adds a Claude Code → LMDeploy `/v1/messages` integration guide.
docs/en/index.rst	Adds the Responses doc page and a new Integrations toctree (Codex/Claude Code).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

CUHKSZzxy added 7 commits May 11, 2026 20:30

feat: add text v1 responses endpoint

7c70b20

feat: support responses function tools

f4a7ecb

docs: add responses api server guide

378fb8a

fix: harden responses tool handling

442c11e

docs: update responses example model

59d9c41

docs: add codex integration guide

b5d2ab1

Merge branch 'main' into feat/responses-api-text-v1

5f48ed4

CUHKSZzxy marked this pull request as ready for review May 13, 2026 03:30

Copilot AI review requested due to automatic review settings May 13, 2026 03:30

Copilot started reviewing on behalf of CUHKSZzxy May 13, 2026 03:30 View session

Copilot AI reviewed May 13, 2026

View reviewed changes

Comment thread lmdeploy/serve/openai/responses.py

Comment thread lmdeploy/serve/openai/responses.py Outdated

Comment thread lmdeploy/serve/openai/responses.py

fix: address responses api review comments

3cf9159

lvhan028 added the enhancement New feature or request label May 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add OpenAI Responses-compatible endpoint#4582

Add OpenAI Responses-compatible endpoint#4582
CUHKSZzxy wants to merge 8 commits into
InternLM:mainfrom
CUHKSZzxy:feat/responses-api-text-v1

CUHKSZzxy commented May 13, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

CUHKSZzxy commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Validation

Codex Demo

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CUHKSZzxy commented May 13, 2026 •

edited

Loading