Remove hardcoded DeepSeek-OCR model default value #31

Copilot · 2025-12-15T09:35:16Z

DeepSeek-OCR model name was hardcoded as default in DeepSeekOCRConfig, limiting model flexibility despite DS_OCR_MODEL env var support.

Changes

Make model field required (no default) in DeepSeekOCRConfig, consistent with api_key
Users must explicitly set DS_OCR_MODEL in .env (already present in env.example)

# Before
model: str = Field(
    default="deepseek-ai/DeepSeek-OCR",
    description="DeepSeek-OCR Model Name"
)

# After
model: str = Field(..., description="DeepSeek-OCR Model Name")

Original prompt

This section details on the original issue you should resolve

<issue_title>DeepSeek-OCR model name hardcoded - limits model flexibility</issue_title>
<issue_description>## 问题描述
DeepSeek-OCR 模型名称硬编码，限制模型灵活性。

受影响的文件

src/config.py 行 102

硬编码值

default="deepseek-ai/DeepSeek-OCR"

问题

虽然有 DS_OCR_MODEL 环境变量，但代码默认值也硬编码。

解决方案

应该保持一致，建议移除硬编码的默认值。</issue_description>

Comments on the Issue (you are @copilot in this section)

Fixes DeepSeek-OCR model name hardcoded - limits model flexibility #17

💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

将 model 字段从有默认值 "deepseek-ai/DeepSeek-OCR" 改为必填字段，用户需要在 .env 文件中显式设置 DS_OCR_MODEL 环境变量。修复 issue: DeepSeek-OCR model name hardcoded - limits model flexibility Co-authored-by: BukeLy <19304666+BukeLy@users.noreply.github.com>

Copilot

Pull request overview

This PR removes the hardcoded default value for the DeepSeek-OCR model name in the configuration, making it a required field that must be explicitly set via the DS_OCR_MODEL environment variable. The change aims to improve model flexibility by eliminating the hardcoded default value while relying on users to configure this in their .env file (as shown in env.example).

Key Changes:

Changed model field in DeepSeekOCRConfig from having a default value ("deepseek-ai/DeepSeek-OCR") to being required (Field(...))
Users must now explicitly set DS_OCR_MODEL in their .env file for the application to start

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-15T09:49:08Z

src/config.py

-        default="deepseek-ai/DeepSeek-OCR",
-        description="DeepSeek-OCR Model Name"
-    )
+    model: str = Field(..., description="DeepSeek-OCR Model Name")


Making the model field required creates an inconsistency with other configuration classes in this codebase. LLMConfig (line 24), EmbeddingConfig (line 46-49), and RerankConfig (line 74-77) all provide default values for their model fields.

This inconsistency means that:

Users must explicitly set DS_OCR_MODEL or the application will fail to start

Other services can start with sensible defaults even if their model env vars are missing

The configuration pattern is not uniform across the codebase

Consider either:

Keeping the default value to maintain consistency with other configs

Or updating all other config classes to also require explicit model specification (if that's the desired pattern)

Suggested change

model: str = Field(..., description="DeepSeek-OCR Model Name")

model: str = Field(default="deepseek-ocr-base", description="DeepSeek-OCR Model Name")

Initial plan

b8c1455

Copilot AI assigned Copilot and BukeLy Dec 15, 2025

Copilot started work on behalf of BukeLy December 15, 2025 09:35 View session

Copilot AI changed the title ~~[WIP] Remove hardcoded default model name in DeepSeek-OCR~~ Remove hardcoded DeepSeek-OCR model default value Dec 15, 2025

Copilot finished work on behalf of BukeLy December 15, 2025 09:41

Copilot AI requested a review from BukeLy December 15, 2025 09:41

BukeLy approved these changes Dec 15, 2025

View reviewed changes

BukeLy marked this pull request as ready for review December 15, 2025 09:42

Copilot AI review requested due to automatic review settings December 15, 2025 09:42

Copilot started reviewing on behalf of BukeLy December 15, 2025 09:43 View session

Copilot AI reviewed Dec 15, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove hardcoded DeepSeek-OCR model default value #31

Remove hardcoded DeepSeek-OCR model default value #31

Uh oh!

Copilot AI commented Dec 15, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	model: str = Field(..., description="DeepSeek-OCR Model Name")
	model: str = Field(default="deepseek-ocr-base", description="DeepSeek-OCR Model Name")

Remove hardcoded DeepSeek-OCR model default value #31

Are you sure you want to change the base?

Remove hardcoded DeepSeek-OCR model default value #31

Uh oh!

Conversation

Copilot AI commented Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

受影响的文件

硬编码值

问题

解决方案

Comments on the Issue (you are @copilot in this section)

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Dec 15, 2025 •

edited

Loading