Overview

LLM-PBE is a toolkit to assess the data privacy of LLMs. The code is used for the LLM-PBE benchmark, which was selected as the 🏆 Best Research Paper Nomination in VLDB 2024.

Getting Started

Setup Environment

conda create -n llm-pbe python=3.10 -y
conda activate llm-pbe
# If you encounter the issue of 'kernel image' when running torch on GPU, try to install a proper torch with cuda.
pip install torch==1.12.1+cu116 torchvision==0.13.1+cu116 torchaudio==0.12.1 --extra-index-url https://download.pytorch.org/whl/cu116
pip install git+https://github.com/microsoft/analysing_pii_leakage.git
pip install wandb accelerate
pip install -r requirements.txt

Attack Demo

You can find the attack demo below, which is also presented in AttackDemo.py

from data import JailbreakQueries
from models import TogetherAIModels
from attacks import Jailbreak
from metrics import JailbreakRate

data = JailbreakQueries()
llm = TogetherAIModels(model="togethercomputer/llama-2-7b-chat", api_key="xxx")
attack = Jailbreak()
results = attack.execute_attack(data, llm)
rate = JailbreakRate(results).compute_metric()
print("rate:", rate)

Evaluate DP model metrics

dp_evaluation = metrics.Evaluate(attack_dp_metrics, ground_truths=dataset.labels)
# Output results
print(f"Attack metrics on regular model: {evaluation}")
print(f"Attack metrics on DP model: {dp_evaluation}")

Finetuning LLMs

Finetuning code is hosted separately with a different environment setup. Please refer to Private Finetuning for LLMs (LLM-PFT).

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
attacks		attacks
data		data
defenses		defenses
generations		generations
images		images
metrics		metrics
models		models
notebooks		notebooks
results/LLM-PBE		results/LLM-PBE
scripts		scripts
sweeps		sweeps
.gitignore		.gitignore
AttackDemo.py		AttackDemo.py
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Overview

Getting Started

Setup Environment

Attack Demo

Evaluate DP model metrics

Finetuning LLMs

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Languages

License

QinbinLi/LLM-PBE

Folders and files

Latest commit

History

Repository files navigation

Overview

Getting Started

Setup Environment

Attack Demo

Evaluate DP model metrics

Finetuning LLMs

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Languages

Packages