AMuLaP (Automatic Multi-Label Prompting)

Code for NAACL 2022 paper Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification.

Requirements

You can install all required packages by running the following command:

pip install -r requirements.txt

Note: Different versions of packages (like pytorch, transformers, etc.) may lead to different results from the paper. However, the trend should still hold no matter what versions of packages you use.

Prepare Data

We follow the setup in LM-BFF for few-shot text classification. Therefore, you can follow the same steps to prepare the data.

Run AMuLaP

Quick Start

Take the 16-shot SST-2 dataset with seed 42 as an example, and you can run our code following the example:

python run_prompt.py \
    --model_name_or_path roberta-large \
    --task_name sst2 \
    --data_dir data/k-shot/SST-2/16-42 \
    --output_dir outputs \
    --shot_num 16 \
    --seed 42 \
    --max_train_steps 1000 \
    --num_warmup_steps 0 \
    --eval_steps 100 \
    --learning_rate 1e-5 \
    --per_device_train_batch_size 8 \
    --per_device_eval_batch_size 8 \
    --top_k 16 \
    --max_seq_len 128 \
    --template *cls**sent_0*_It_was*mask*.*sep+* \
    --dedup

Most arguments are inherited from transformers and are easy to understand. We further explain the AMuLaP arguments below (some are not shown in the example):

shot_num: the number of shots for each class in the training set.
top_k: the number of label tokens to use for each class.
template: the template for the prompt.
dedup: whether to remove duplicate label tokens which appear in more than one class.
no_finetue: whether to disable fine-tuning.
random_k_token: whether to use random k tokens for each class.
label_token_mode: the method to obtain label tokens.
- AMuLaP: the methods proposed in our paper.
- AutoL: the automatic label searching method used in LM-BFF.
  
  We use the sorted results of the automatic label searching, which you can find in LM-BFF/auto_label_mapping. You should use --mapping_path to read the label mapping file. For example, if you want to use the coressponding label mapping file for the 16-shot SST-2 dataset with seed 42, you can use the following command:
```
python run_prompt.py \
    --model_name_or_path roberta-large \
    --task_name sst2 \
    --data_dir data/k-shot/SST-2/16-42 \
    --output_dir outputs \
    --shot_num 16 \
    --seed 42 \
    --max_train_steps 1000 \
    --num_warmup_steps 0 \
    --eval_steps 100 \
    --learning_rate 1e-5 \
    --per_device_train_batch_size 8 \
    --per_device_eval_batch_size 8 \
    --top_k 16 \
    --max_seq_len 128 \
    --template *cls**sent_0*_It_was*mask*.*sep+* \
    --label_token_mode AutoL \
    --mapping_path auto_label_mapping/SST-2/16-42.sort.txt
```
- PETAL: the method used in PETAL.
  
  You can use petal.py in PET to generate the label mapping file for each training set, and then just change --label_token_mode to PETAL and --mapping_path to the corresponding generated label mapping file in the above command.

To easily run our experiments, you can also use run_experiment.sh:

TASK=SST-2 BS=2 LR=1e-5 SEED=42 MODEL=roberta-large K=16 bash run_experiment.sh

Experiments with multiple runs

To carry out experiments with multiple data splits and grid search the hyperparameters, you can use the following command:

for seed in 13 21 42 87 100; do
    for lr in 1e-5 2e-5 5e-5; do
        for bs in 2 4 8; do
            for k in 1 2 4 8 16; do
                TASK=SST-2 \
                BS=$bs \
                LR=$lr \
                SEED=$seed \
                MODEL=roberta-large \
                K=$k \
                bash run_experiment.sh
            done
        done
    done
done

Templates

We use the manual templates from LM-BFF. Additionally, you can design your own templates following this guide.

Acknowledgement

Portions of the source code are based on the transformers, LM-BFF projects. We sincerely thank them for their contributions!

Citation

@inproceedings{wang2022automatic,
  title={Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification},
  author={Wang, Han and Xu, Canwen and McAuley, Julian},
  booktitle={{NAACL} 2022},
  year={2022}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
models.py		models.py
requirements.txt		requirements.txt
run_experiment.sh		run_experiment.sh
run_prompt.py		run_prompt.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AMuLaP (Automatic Multi-Label Prompting)

Requirements

Prepare Data

Run AMuLaP

Quick Start

Experiments with multiple runs

Templates

Acknowledgement

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AMuLaP (Automatic Multi-Label Prompting)

Requirements

Prepare Data

Run AMuLaP

Quick Start

Experiments with multiple runs

Templates

Acknowledgement

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages