Skip to content

d-Matrix.ai

Transforming AI from unsustainable to attainable. d-Matrix powers next generation compute for generative AI inference.

Pinned Loading

  1. dmx-compressor dmx-compressor Public

    d-Matrix DMX Compressor: A Pytorch toolkit for nn.Module transformations supporting advanced quantization, sparsity, and elementwise function approximations.

    Python 22 2

  2. keyformer-llm keyformer-llm Public

    Keyformer proposes KV Cache reduction through key tokens identification and without the need for fine-tuning

    Python 59 5

  3. comet comet Public

    COMET is a framework for modeling and optimizing dataflow for compound operations on machine learning accelerators

    C++ 2

  4. rlquant rlquant Public

    Reinforcement Learning algorithms (GRPO, drGRPO, etc) under quantization (QAT, PTQ)

    Python 1 1

  5. pt2-bfp pt2-bfp Public

    Implementation of Block Floating Point supporting the Pytorch 2.0 export quantization route.

    Python

  6. bigcode-evaluation-harness bigcode-evaluation-harness Public

    Forked from bigcode-project/bigcode-evaluation-harness

    A framework for the evaluation of autoregressive code generation language models.

    Python

Repositories

Showing 10 of 10 repositories

Top languages

Loading…

Most used topics

Loading…