Skip to content
@neuralmagic

Neural Magic

Neural Magic (Acquired by Red Hat) empowers developers to optimize & deploy LLMs at scale. Our model compression & acceleration enable top performance with vLLM

Pinned Loading

  1. deepsparse deepsparse Public archive

    Sparsity-aware deep learning inference runtime for CPUs

    Python 3.2k 190

Repositories

Showing 10 of 89 repositories
  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    neuralmagic/vllm’s past year of commit activity
    Python 16 Apache-2.0 14,912 0 31 Updated Mar 25, 2026
  • research Public

    Repository to enable research flows

    neuralmagic/research’s past year of commit activity
    Python 3 0 0 3 Updated Mar 24, 2026
  • GuardBench Public Forked from eldarkurtic/GuardBench

    A Python library for guardrail models evaluation with vLLM support.

    neuralmagic/GuardBench’s past year of commit activity
    Python 0 EUPL-1.2 9 0 6 Updated Mar 23, 2026
  • SWE-bench Public Forked from SWE-bench/SWE-bench

    SWE-bench: Can Language Models Resolve Real-world Github Issues?

    neuralmagic/SWE-bench’s past year of commit activity
    Python 0 MIT 812 0 0 Updated Mar 23, 2026
  • lighteval Public Forked from huggingface/lighteval

    Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

    neuralmagic/lighteval’s past year of commit activity
    Python 0 MIT 446 0 1 Updated Mar 23, 2026
  • nyann_poker Public
    neuralmagic/nyann_poker’s past year of commit activity
    Go 0 Apache-2.0 0 0 2 Updated Mar 22, 2026
  • axolotl Public Forked from axolotl-ai-cloud/axolotl

    Go ahead and axolotl questions

    neuralmagic/axolotl’s past year of commit activity
    Python 0 Apache-2.0 1,298 0 5 Updated Mar 15, 2026
  • lm-evaluation-harness Public Forked from EleutherAI/lm-evaluation-harness

    A framework for few-shot evaluation of language models.

    neuralmagic/lm-evaluation-harness’s past year of commit activity
    Python 5 MIT 3,147 0 1 Updated Mar 14, 2026
  • lmms-eval Public Forked from EvolvingLMMs-Lab/lmms-eval

    Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

    neuralmagic/lmms-eval’s past year of commit activity
    Python 0 546 0 12 Updated Mar 12, 2026
  • DeepEP Public Forked from deepseek-ai/DeepEP

    DeepEP: an efficient expert-parallel communication library

    neuralmagic/DeepEP’s past year of commit activity
    Cuda 1 MIT 1,122 0 0 Updated Mar 11, 2026