improve multimodal image preprocessing with max_image_pixels auto-resize by hiworldwzj · Pull Request #1309 · ModelTC/LightLLM

hiworldwzj · 2026-05-14T09:31:29Z

No description provided.

gemini-code-assist

Code Review

This pull request introduces a --max_image_pixels parameter to limit the pixel count of input images, automatically resizing them if they exceed the threshold (defaulting to ~4K resolution). The implementation includes updates to the CLI, documentation, and multimodal processing logic. Feedback highlights several critical areas for improvement: a potential infinite loop if the pixel limit is set to zero, a possible ZeroDivisionError when handling zero-pixel inputs, and performance inefficiencies caused by redundant image decoding and multiple thread pool executions. Additionally, the hardcoded JPEG conversion may lead to data loss for transparent images, and the high-quality optimization settings could introduce unnecessary CPU overhead.

gemini-code-assist · 2026-05-14T09:36:55Z

+    while new_w * new_h > max_image_pixels:
+        if new_w >= new_h:
+            new_w = max(1, new_w - 1)
+        else:
+            new_h = max(1, new_h - 1)


If max_image_pixels is set to 0, this loop will become infinite because new_w * new_h (which will be 1 * 1) will always be greater than 0, and max(1, new_w - 1) will keep the value at 1. Ensure max_image_pixels is at least 1 before entering this logic.

gemini-code-assist · 2026-05-14T09:36:55Z

+            src_w, src_h = await loop.run_in_executor(_IMAGE_VERIFY_POOL, _verify_image_bytes, img_data)
+            # 2) Resize (or no-op) after verification.
+            img_data, resized_w, resized_h = await loop.run_in_executor(
+                _IMAGE_VERIFY_POOL,
+                _resize_image_bytes_if_needed,
+                img_data,
+                src_w,
+                src_h,
+                max_image_pixels,
+            )


The current implementation performs image verification and resizing in two separate run_in_executor calls. This is inefficient because _verify_image_bytes already decodes the image (via image.load()), and _resize_image_bytes_if_needed decodes it again (via Image.open()). Additionally, large image bytes are passed between the event loop and the thread pool twice.

Consider combining these operations into a single helper function to avoid redundant decoding and overhead.

gemini-code-assist · 2026-05-14T09:36:55Z

+    if old_pixels <= max_image_pixels:
+        return src_w, src_h
+
+    scale = (max_image_pixels / old_pixels) ** 0.5


Potential ZeroDivisionError if old_pixels is 0. While _verify_image_bytes should catch invalid images, src_w and src_h can be 0 if provided via the image_size type (lines 148-149).

Suggested change

scale = (max_image_pixels / old_pixels) ** 0.5

if old_pixels <= max_image_pixels or old_pixels == 0:

return src_w, src_h

gemini-code-assist · 2026-05-14T09:36:55Z

+        resized_image = image.resize((new_w, new_h), resampling).convert("RGB")
+
+        buffer = BytesIO()
+        resized_image.save(buffer, format="JPEG", quality=96, optimize=True)


Hardcoding format="JPEG" and quality=96 with optimize=True might be suboptimal.

If the input was a PNG with transparency, convert("RGB") will result in a black background, and the alpha channel will be lost.

optimize=True can be CPU-intensive for a real-time server.

quality=96 is very high; 90 is usually sufficient for VLM tasks and results in smaller payloads.

improve multimodal image preprocessing with max_image_pixels auto-resize

d021718

gemini-code-assist Bot reviewed May 14, 2026

View reviewed changes

fix

0980f3a

shihaobai merged commit 45e8cca into main May 14, 2026
1 check passed

shihaobai deleted the wzj_dev branch May 14, 2026 12:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve multimodal image preprocessing with max_image_pixels auto-resize#1309

improve multimodal image preprocessing with max_image_pixels auto-resize#1309
shihaobai merged 2 commits into
mainfrom
wzj_dev

hiworldwzj commented May 14, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 14, 2026

Uh oh!

gemini-code-assist Bot May 14, 2026

Uh oh!

gemini-code-assist Bot May 14, 2026

Uh oh!

gemini-code-assist Bot May 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	scale = (max_image_pixels / old_pixels) ** 0.5
	if old_pixels <= max_image_pixels or old_pixels == 0:
	return src_w, src_h

Conversation

hiworldwzj commented May 14, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 14, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 14, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 14, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 14, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants