Skip to content
#

diffusiongemma

Here are 5 public repositories matching this topic...

FastMCP fleet MCP server for diffusion LMs (dLLM). DiffusionGemma on Goliath RTX 4090 — batch inference, HLE-shaped reasoning, ~200–400 tok/s. Doc phase; llama-diffusion-cli sidecar next. Complements local-llm-mcp.

  • Updated Jun 17, 2026

Improve this page

Add a description, image, and links to the diffusiongemma topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the diffusiongemma topic, visit your repo's landing page and select "manage topics."

Learn more