Skip to content

chore(deps): make vLLM an optional extra so base install works on macOS#4

Open
Khurdhula-Harshavardhan wants to merge 3 commits intomainfrom
chore/optional-vllm
Open

chore(deps): make vLLM an optional extra so base install works on macOS#4
Khurdhula-Harshavardhan wants to merge 3 commits intomainfrom
chore/optional-vllm

Conversation

@Khurdhula-Harshavardhan
Copy link
Copy Markdown
Contributor

Summary

  • vLLM is currently a hard dep, which pulls CUDA-only transitive deps (nvidia-cudnn-frontend, etc.) with no macOS / arm64 wheels — uv sync fails immediately on a Mac.
  • Move vllm==0.17.0 into [project.optional-dependencies] so the base resolve is cross-platform. CUDA users opt in with uv sync --extra vllm.
  • README updated to document the new install path.

Test plan

  • uv sync succeeds on darwin-arm64 (was failing on nvidia-cudnn-frontend).
  • uv run python -c "import datasets, anthropic, openai, google.genai, sentence_transformers" works on Mac.
  • uv pip compile pyproject.toml --extra vllm --python-platform x86_64-unknown-linux-gnu still resolves vllm==0.17.0 and its CUDA deps for Linux users.
  • Verify on a CUDA host: uv sync --extra vllm followed by a vLLM-provider inference run.

🤖 Generated with Claude Code

Khurdhula-Harshavardhan and others added 3 commits April 30, 2026 16:10
…rature

Opus 4.7 returns a 400 when temperature/top_p/top_k are set; the safest
migration per Anthropic's docs is to omit them entirely. Guard on the
model id so existing 4.6 / Sonnet 4.6 runs continue to send
temperature=0.0 unchanged.

Ref: https://platform.claude.com/docs/en/about-claude/models/whats-new-claude-4-7

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
vllm pulls CUDA-only transitive deps (nvidia-cudnn-frontend etc.) that
have no macOS / arm64 wheels, breaking `uv sync` for anyone running
hosted-API providers on a Mac. Move vllm into [project.optional-dependencies]
so the base resolve is cross-platform; CUDA users opt in with
`uv sync --extra vllm`.

Verified base `uv sync` succeeds on darwin-arm64 and that
`uv pip compile --extra vllm --python-platform x86_64-unknown-linux-gnu`
still resolves vllm==0.17.0 with its CUDA deps for Linux users.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant