Skip to content
View HyperFoldUK's full-sized avatar
  • HyperFold Technologies UK Ltd
  • United Kingdom
  • Joined Dec 18, 2025

Block or report HyperFoldUK

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. sparse-ternary-fma sparse-ternary-fma Public

    High-performance ternary arithmetic kernel with 2-bit encoding and AVX-512 SIMD acceleration for FHE and AI applications

    C 1

  2. llm-2-bit-inference-kernel llm-2-bit-inference-kernel Public

    This benchmark suite demonstrates the dramatic performance and memory efficiency advantages of 2-bit ternary weight quantization for BitNet-style 1.58-bit LLM inference, now with SparseTernaryFMA C…

    Python

  3. 2bit-ternary-bandwidth 2bit-ternary-bandwidth Public

    Surgical proof that 2-bit packed ternary encoding solves the memory bandwidth bottleneck in neural network inference.

    C

  4. BitNet BitNet Public

    Forked from microsoft/BitNet

    Official inference framework for 1-bit LLMs

    C++

  5. fused-vs-unpacked-bench fused-vs-unpacked-bench Public

    Fused computation on packed ternary data is fundamentally more efficient than decode-then-compute approaches. This is not about a specific FHE implementation or BitNet optimization. This is a compu…

    C