diffusiongemma
Here are 5 public repositories matching this topic...
Native WinUI 3 control panel for running local llama.cpp and DiffusionGemma backends with model management, Hugging Face downloads, runtime tuning, logs, and resource monitoring.
-
Updated
Jun 11, 2026 - C#
Matrix-style logit conditioning for DiffusionGemma's llama.cpp denoiser
-
Updated
Jun 15, 2026 - Python
Docker-Compose template to self-host Google DiffusionGemma 26B on an NVIDIA GPU host via llama.cpp
-
Updated
Jun 12, 2026 - Dockerfile
FastMCP fleet MCP server for diffusion LMs (dLLM). DiffusionGemma on Goliath RTX 4090 — batch inference, HLE-shaped reasoning, ~200–400 tok/s. Doc phase; llama-diffusion-cli sidecar next. Complements local-llm-mcp.
-
Updated
Jun 17, 2026
Improve this page
Add a description, image, and links to the diffusiongemma topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the diffusiongemma topic, visit your repo's landing page and select "manage topics."