Popular repositories Loading
-
sparse-ternary-fma
sparse-ternary-fma PublicHigh-performance ternary arithmetic kernel with 2-bit encoding and AVX-512 SIMD acceleration for FHE and AI applications
C 1
-
llm-2-bit-inference-kernel
llm-2-bit-inference-kernel PublicThis benchmark suite demonstrates the dramatic performance and memory efficiency advantages of 2-bit ternary weight quantization for BitNet-style 1.58-bit LLM inference, now with SparseTernaryFMA C…
Python
-
2bit-ternary-bandwidth
2bit-ternary-bandwidth PublicSurgical proof that 2-bit packed ternary encoding solves the memory bandwidth bottleneck in neural network inference.
C
-
-
fused-vs-unpacked-bench
fused-vs-unpacked-bench PublicFused computation on packed ternary data is fundamentally more efficient than decode-then-compute approaches. This is not about a specific FHE implementation or BitNet optimization. This is a compu…
C
If the problem persists, check the GitHub status page or contact support.