Skip to content

valentinhofmann/geoadaptation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 

Repository files navigation

Geoadaptation of Language Models 🌍

This repository contains the code for the TACL 2024 paper Geographic Adaptation of Pretrained Language Models.

Usage

The main code for conducting geoadaptation with language models can be found in src/main_geoadaptation.py.

We also provide scripts to replicate all experiments from the paper:

  • To replicate the geoadaptation experiments, use the scripts src/start_geoadaptation.sh and src/start_geoadaptation_mtl.sh.
  • To replicate the experiments on fine-tuned geolocation prediction, use the scripts src/start_geolocation.sh and src/start_geolocation_mtl.sh.
  • To replicate the experiments on fine-tuned language identification, use the scripts src/start_id.sh and src/start_id_mtl.sh.
  • To replicate the experiments on zero-shot geolocation prediction, use the script src/start_zeroshot_geolocation.sh.
  • To replicate the experiments on zero-shot dialect feature prediction, use the script src/start_zeroshot_dialect.sh.

Citation

If you use the code in this repository, please cite the following paper:

@article{hofmann2024geoadaptation,
    title = {Geographic Adaptation of Pretrained Language Models},
    author = {Hofmann, Valentin and Glavaš, Goran and Ljubešić, Nikola and Pierrehumbert, Janet and Schütze, Hinrich},
    journal = {Transactions of the Association for Computational Linguistics},
    volume = {12},
    pages = {411-431},
    year = {2024}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •