Skip to content

Add p quantization to our triton fa kernel#1757

Open
sychen52 wants to merge 2 commits into
NVIDIA:mainfrom
sychen52:p_quant
Open

Add p quantization to our triton fa kernel#1757
sychen52 wants to merge 2 commits into
NVIDIA:mainfrom
sychen52:p_quant

make p_qdq_amax default to 1.0 instead of scale

afb1a4f
Select commit
Loading
Failed to load commit list.
DCO / DCO succeeded Jun 23, 2026 in 1s

DCO

All commits are signed off!