Published May 27, 2023 | Version 1.0.0
Dataset Open

Webis Trigger Warning Corpus 2023

  • 1. Matti
  • 2. Magdalena
  • 3. Christopher
  • 4. Ole
  • 5. Benno
  • 6. Martin

Description

Abstract   A trigger warning or a content warning is intended to enable individuals to make an informed decision about whether to expose themselves to potentially distressing content. We introduce trigger warning assignment as a multilabel classification task and create the Webis Trigger Warning Corpus (WTWC), the first dataset of 1 million fanfiction works from the Archive of our Own with up to 36 different warnings per document. To provide a reliable catalog of trigger warnings, we carefully mapped institutionally-recommended trigger warnings against the millions of free-form tags assigned by fanfiction authors and organized them into the first comprehensive taxonomy of trigger warnings.

 

Code for dehydration   https://github.com/webis-de/ACL-23

 

Cite   

@InProceedings{wiegmann:2023a,
  address =               {Toronto, Canada},
  author =                {Matti Wiegmann and Magdalena Wolska and Christopher Schr{\"{o}}der and Ole Borchardt and Benno Stein and Martin Potthast},
  booktitle =             {Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
  month =                 jul,
  publisher =             {Association for Computational Linguistics},
  title =                 {{Trigger Warning Assignment as a Multi-Label Document Classification Problem}},
  year =                  2023
}

 

Files

Files (1.1 GB)

Name Size Download all
md5:1df6bc20dde05d9f0b845dbab0ffcbc7
1.1 GB Download