This project aims to explore binary quantization techniques for OpenAI models.
Binary quantization is a technique used to reduce the memory footprint and computational requirements of deep learning models. This project focuses on applying binary quantization to OpenAI models to improve their efficiency.
To install the necessary dependencies, follow these steps:
- Clone the repository:
git clone https://github.com/qdrant/examples.git - Navigate to the project directory:
cd binary-quantization-openai - Install Poetry:
pip install poetry - Install the required packages:
poetry install --no-root
To use the binary quantization techniques on an OpenAI embedding, follow these steps:
- Load the created embedding into Qdrant
- Apply binary quantization to the collection
- Evaluate the performance of the quantized model
Contributions are welcome! If you have any ideas or suggestions, please open an issue or submit a pull request.
This project is licensed under the MIT License.