vram-optimization

Star

Here are 8 public repositories matching this topic...

EricRollei / Comfy_HunyuanImage3

Star

Nodes to run Hunyuan Image 3 locally with BF16 and NF4 quantized options in Comfyui

Updated Feb 15, 2026
Python

damienos61 / SynapSwap

Star

Predictive VRAM Virtualization Engine

c performance deep-learning cuda artificial-intelligence memory-management gpu-computing system-programming c-language inference-engine llm-inference llm-inference-poisoning vram-optimization pcie-transfer

Updated Feb 5, 2026
C

About LoRA Lens v1.6: Multiply your AI's intelligence by compressing LoRAs by up to 94% to load more specialized knowledge into your VRAM simultaneously. Introducing .loradb database format specifically for LoRAs for further reduction.

flux machine-learning compression ai pytorch image-generation lora stable-diffusion safetensors sdxl vram-optimization

Updated Feb 16, 2026
Python

WizardsForgeGames / sparsemma

Star

INT8 Sparse Tensor Core GEMM for PyTorch — built for Windows

windows gpu cuda inference pytorch nvidia sparse quantization gemm int8 ptx structured-sparsity tensor-cores vram-optimization

Updated Feb 16, 2026
Cuda

anthony-maio / fitcheck

Star

Know before you train — VRAM estimation for LLM fine-tuning.

training gpu fine-tuning-llm vram-optimization

Updated Feb 16, 2026
Python

Pomilon / LEMA

Star

LEMA (Layer-wise Efficient Memory Abstraction): A hardware-aware framework for fine-tuning LLMs in VRAM-constrained environments using asynchronous binary pre-fetching and triple-tier memory orchestration.

machine-learning cuda pytorch memory-management lora system-architecture fine-tuning llm safetensors vram-optimization low-resource-computing lema

Updated Feb 17, 2026
Python

Pomilon / LEMA-llama

Star

A Proof of Concept for the LEMA (Layer-wise Efficient Memory Abstraction) framework. Enables stable fine-tuning of Llama-2-7B on consumer-grade hardware (16GB VRAM) through layer-wise weight streaming and triple-buffer memory virtualization.

machine-learning deep-learning pytorch kaggle memory-efficiency fine-tuning llm llama2 ai-infrastructure low-resource-ai vram-optimization low-resource-computing lema lema-framework

Updated Feb 18, 2026
Jupyter Notebook

milliaccount / SynapSwap

Star

🔄 Transform your GPU's VRAM limits with SynapSwap, a predictive virtualization engine that runs large AI models on consumer hardware effortlessly.

c performance deep-learning cuda memory-management gpu-computing system-programming inference-engine llm-inference llm-inference-poisoning vram-optimization pcie-transfer

Updated Feb 18, 2026
C

Improve this page

Add a description, image, and links to the vram-optimization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vram-optimization topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vram-optimization

Here are 8 public repositories matching this topic...

EricRollei / Comfy_HunyuanImage3

damienos61 / SynapSwap

intuitivation / LoRA-Lens

WizardsForgeGames / sparsemma

anthony-maio / fitcheck

Pomilon / LEMA

Pomilon / LEMA-llama

milliaccount / SynapSwap

Improve this page

Add this topic to your repo