Transforming AI from unsustainable to attainable. d-Matrix powers next generation compute for generative AI inference.
d-Matrix
Pinned Loading
Repositories
Showing 10 of 10 repositories
- .github Public
d-matrix-ai/.github’s past year of commit activity - dmx-compressor Public
d-Matrix DMX Compressor: A Pytorch toolkit for nn.Module transformations supporting advanced quantization, sparsity, and elementwise function approximations.
d-matrix-ai/dmx-compressor’s past year of commit activity - lm-evaluation-harness-nikita Public Forked from EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
d-matrix-ai/lm-evaluation-harness-nikita’s past year of commit activity - pt2-bfp Public
Implementation of Block Floating Point supporting the Pytorch 2.0 export quantization route.
d-matrix-ai/pt2-bfp’s past year of commit activity - bigcode-evaluation-harness Public Forked from bigcode-project/bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
d-matrix-ai/bigcode-evaluation-harness’s past year of commit activity - keyformer-llm Public
Keyformer proposes KV Cache reduction through key tokens identification and without the need for fine-tuning
d-matrix-ai/keyformer-llm’s past year of commit activity - gptq Public Forked from IST-DASLab/gptq
Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers"
d-matrix-ai/gptq’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…