Transformers-Labs: Research Repository

Abstract

Transformers-Labs is a comprehensive research-oriented repository for experimentation, benchmarking, and fine-tuning of transformer-based models. The project provides a unified platform for training, evaluation, quantization, benchmarking, and deployment of state-of-the-art models, with robust support for multimodal and distributed workflows. It is designed to facilitate reproducible research, scalable experimentation, and rapid prototyping for both academic and industrial use cases.

Project Overview

This repository exists to advance research in transformer architectures, quantization techniques, and large-scale model deployment. Key goals include:

Benchmarking transformer models across hardware and cloud platforms
Fine-tuning and evaluating models for NLP and multimodal tasks
Experimenting with quantization (GPTQ, 4/8-bit) and efficient inference
Integrating with cloud infrastructure (Azure, AWS SageMaker)
Supporting multimodal research (video, text)

Capabilities include:

Model training and evaluation pipelines
Inference benchmarking (latency, throughput, accuracy)
Quantization and deployment workflows
Infrastructure-as-code for reproducible cloud setups
Multimodal experimentation (video-llava)

Repository Structure

model-train/ – Jupyter notebooks and scripts for model training and fine-tuning
model-eval/ – Evaluation pipelines, metrics, and analysis notebooks
inference-benchmark/ – Scripts for benchmarking inference performance
optimum-benchmark/ – Advanced benchmarking using HuggingFace Optimum
sagemaker-benchmark/, sagemaker-labs/ – AWS SageMaker integration for distributed training and benchmarking
terraform/azure-workstation/ – Terraform scripts for provisioning Azure GPU workstations
video-llava/ – Multimodal (video+text) model experimentation
AutoGPTQ/ – GPTQ quantization, CUDA builds, and extension modules
mistral-common/ – Utilities and shared code for Mistral models
requirements.txt, pyproject.toml – Python dependencies and environment configuration
benchmarks/, model-info/, model/ – Model artifacts, configs, and benchmark results

Features

Transformer fine-tuning (BERT, T5, LLaMA, Mistral, etc.)
GPTQ quantization (4/8-bit) via AutoGPTQ
HuggingFace TRL integration for RLHF and advanced training
SageMaker benchmarking and distributed training
Azure infrastructure provisioning with Terraform
CUDA-enabled PyTorch builds for efficient GPU utilization
Multimodal research (video-llava)
Inference benchmarking and reporting
Environment management with Conda and .env files
Code formatting and linting with Ruff

Environment Setup

System Dependencies

CUDA Toolkit (>=11.x recommended)
NVIDIA drivers (latest)
GCC (>=9.x)
pkg-config, libmysqlclient-dev (for some quantization/builds)

Conda Environment & Key Packages

conda env create -f environment.yml
conda activate transformers-labs

Key packages:

torch
transformers
trl
optimum
auto-gptq
langchain
evaluate

.env File Usage

Store Hugging Face token and other secrets in .env

Example:

HF_TOKEN=your_huggingface_token
AWS_ACCESS_KEY_ID=...
AZURE_SUBSCRIPTION_ID=...

Official Install Docs

Quickstart / Usage

Train/Evaluate a Model

See model-train/ and model-eval/ notebooks for training and evaluation workflows

Example:

# Train
python model-train/train-gpt2.ipynb
# Evaluate
python model-eval/eval.ipynb

Run Inference Benchmarks

Use scripts in inference-benchmark/ and optimum-benchmark/
Example:
```
python inference-benchmark/benchmark.py
```

SageMaker Integration

See sagemaker-benchmark/ and sagemaker-labs/ for distributed training and benchmarking

Example:

python sagemaker-benchmark/run_benchmark.py

Azure Workstation Provisioning

Use Terraform scripts in terraform/azure-workstation/

Example:

cd terraform/azure-workstation
terraform init
terraform apply -auto-approve

Multimodal Experimentation

See video-llava/ for video+text model workflows

Benchmarks

Run benchmarking pipelines in inference-benchmark/, optimum-benchmark/, and benchmarks/
Results are stored in CSV/JSON format for reproducibility

Example:

python inference-benchmark/benchmark.py --model gpt2 --output results/gpt2_benchmark.csv

Interpret results using provided analysis notebooks in model-eval/

Development Notes

Code formatting: Use Ruff (ruff format ...) for linting and formatting
Jupyter/interactive workflow: Use %load_ext autoreload and %autoreload 2 for live code reload
Debugging: Common issues include CUDA setup, missing drivers, and environment variables
Use .env for secrets and tokens

Research Roadmap

Extend to new transformer architectures (e.g., Mixtral, Phi-3)
Larger scale distributed experiments (multi-node, multi-GPU)
Advanced quantization and pruning strategies
Multimodal fusion and cross-modal benchmarks
Integration with additional cloud providers (GCP, OCI)
Automated hyperparameter tuning and experiment tracking

Contributing

Fork the repository and submit pull requests
Add new models, benchmarks, or infrastructure scripts
Cite this work in academic publications
See CONTRIBUTING.md for guidelines

References

License

This repository is licensed under the MIT License. See LICENSE for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformers-Labs: Research Repository

Abstract

Project Overview

Repository Structure

Features

Environment Setup

System Dependencies

Conda Environment & Key Packages

.env File Usage

Official Install Docs

Quickstart / Usage

Train/Evaluate a Model

Run Inference Benchmarks

SageMaker Integration

Azure Workstation Provisioning

Multimodal Experimentation

Benchmarks

Development Notes

Research Roadmap

Contributing

References

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
.vscode		.vscode
AutoGPTQ		AutoGPTQ
inference-benchmark		inference-benchmark
machine-learning-journey		machine-learning-journey
mistral-common		mistral-common
model-eval		model-eval
model-info		model-info
model-train		model-train
optimum-benchmark		optimum-benchmark
sagemaker-benchmark		sagemaker-benchmark
sagemaker-labs		sagemaker-labs
terraform		terraform
transformers-tutorial		transformers-tutorial
utils		utils
video-llava		video-llava
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
transformers.md		transformers.md

iliazlobin/transformers-labs

Folders and files

Latest commit

History

Repository files navigation

Transformers-Labs: Research Repository

Abstract

Project Overview

Repository Structure

Features

Environment Setup

System Dependencies

Conda Environment & Key Packages

.env File Usage

Official Install Docs

Quickstart / Usage

Train/Evaluate a Model

Run Inference Benchmarks

SageMaker Integration

Azure Workstation Provisioning

Multimodal Experimentation

Benchmarks

Development Notes

Research Roadmap

Contributing

References

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages