Learning has no ending
Building LLMs, diffusion models, and VLMs from scratch in PyTorch.
MoE · FP8/Triton · FSDP · DDPM/DDIM · GAN/VAE. No black boxes.
Pinned Loading
-
StableDiffusion
StableDiffusion PublicA Stable Diffusion 1.x-class latent diffusion model trained from scratch on 2× RTX 5090 (Blackwell) GPUs. Full UNet (~860M params), DDPM/DDIM, LAION pipeline, DDP+BF16.
Python
-
FusionLLM
FusionLLM PublicHybrid LLM pre-training framework fusing Multi-Head Latent Attention, Gated Delta Net, DeepSeek MoE, and Multi-Token Prediction — 415M active params, 8.31B tokens, single A100 80GB.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

