Leveraging Large Language Models for Building Interpretable Rule-Based Data-to-Text Systems

This repository contains the code and resources for the paper titled "Leveraging Large Language Models for Building Interpretable Rule-Based Data-to-Text Systems" by Jędrzej Warczyński, Mateusz Lango, and Ondřej Dušek.

Paper Abstract

In this work, we introduce a straightforward approach that employs a large language model to automatically implement a fully interpretable rule-based data-to-text system in pure Python. Our experimental evaluation on the WebNLG dataset demonstrates that this system produces text of superior quality compared to zero-shot LLMs, as measured by the BLEU and BLEURT metrics. Moreover, it generates fewer hallucinations than the fine-tuned BART baseline. Additionally, the proposed method generates text significantly faster than neural approaches, requiring only a single CPU for processing.

Repository Contents

This repository contains:

Python implementation of the rule-based data-to-text system.
Scripts for experimental evaluation, including BLEU and BLEURT metrics.
Scripts for dataset augmenatation with graph clustering algorithm.

Link to the Paper

For a detailed explanation of our approach and findings, please refer to our paper: Leveraging Large Language Models for Building Interpretable Rule-Based Data-to-Text Systems.

Acknowledgements

This work was co-funded by the European Union (ERC, NG-NLG, 101039303).

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Leveraging Large Language Models for Building Interpretable Rule-Based Data-to-Text Systems

Paper Abstract

Repository Contents

Link to the Paper

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

License

jwarczynski/RuLLeM

Folders and files

Latest commit

History

Repository files navigation

Leveraging Large Language Models for Building Interpretable Rule-Based Data-to-Text Systems

Paper Abstract

Repository Contents

Link to the Paper

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages