HyperFoldUK

HyperFold Technologies UK HyperFoldUK

Popular repositories Loading

sparse-ternary-fma sparse-ternary-fma Public

High-performance ternary arithmetic kernel with 2-bit encoding and AVX-512 SIMD acceleration for FHE and AI applications

C 1
llm-2-bit-inference-kernel llm-2-bit-inference-kernel Public

This benchmark suite demonstrates the dramatic performance and memory efficiency advantages of 2-bit ternary weight quantization for BitNet-style 1.58-bit LLM inference, now with SparseTernaryFMA C…

Python
2bit-ternary-bandwidth 2bit-ternary-bandwidth Public

Surgical proof that 2-bit packed ternary encoding solves the memory bandwidth bottleneck in neural network inference.

C
BitNet BitNet Public

Forked from microsoft/BitNet

Official inference framework for 1-bit LLMs

C++
fused-vs-unpacked-bench fused-vs-unpacked-bench Public

Fused computation on packed ternary data is fundamentally more efficient than decode-then-compute approaches. This is not about a specific FHE implementation or BitNet optimization. This is a compu…

C