Skip to content

Feat/claude opus 4 7#5

Open
Khurdhula-Harshavardhan wants to merge 4 commits intomainfrom
feat/claude-opus-4-7
Open

Feat/claude opus 4 7#5
Khurdhula-Harshavardhan wants to merge 4 commits intomainfrom
feat/claude-opus-4-7

Conversation

@Khurdhula-Harshavardhan
Copy link
Copy Markdown
Contributor

  • add opus 4.7

Khurdhula-Harshavardhan and others added 4 commits April 30, 2026 16:10
…rature

Opus 4.7 returns a 400 when temperature/top_p/top_k are set; the safest
migration per Anthropic's docs is to omit them entirely. Guard on the
model id so existing 4.6 / Sonnet 4.6 runs continue to send
temperature=0.0 unchanged.

Ref: https://platform.claude.com/docs/en/about-claude/models/whats-new-claude-4-7

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
vllm pulls CUDA-only transitive deps (nvidia-cudnn-frontend etc.) that
have no macOS / arm64 wheels, breaking `uv sync` for anyone running
hosted-API providers on a Mac. Move vllm into [project.optional-dependencies]
so the base resolve is cross-platform; CUDA users opt in with
`uv sync --extra vllm`.

Verified base `uv sync` succeeds on darwin-arm64 and that
`uv pip compile --extra vllm --python-platform x86_64-unknown-linux-gnu`
still resolves vllm==0.17.0 with its CUDA deps for Linux users.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@github-actions
Copy link
Copy Markdown

🏆 Leaderboard preview

Built 25 models, top 10 by Overall:

Rank Model Overall Val. Acc. JSON Pass Perfect
1 GPT-5.4 0.870 0.798 0.993 0.469
2 gemini-3.1-pro-preview 0.869 0.820 0.966 0.542
3 Claude-Opus-4.7 0.864 0.785 0.994 0.426
4 GLM-4.7 0.861 0.804 0.965 0.508
5 Qwen3.5-35B 0.861 0.801 0.969 0.500
6 gpt-5.5 0.860 0.795 0.978 0.464
7 Gemini-2.5-Flash 0.860 0.796 0.972 0.498
8 Qwen3-235B 0.857 0.786 0.978 0.463
9 Interfaze-Beta 0.855 0.795 0.967 0.480
10 Claude-Sonnet-4.6 0.854 0.779 0.979 0.442

Generated at 2026-04-30T23:41:32+00:00 • full JSON in workflow artifacts

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant