GitHub - UCSB-NLP-Chang/BTProp

Hallucination Detection with Belief Tree Propagation

This is the official implementation for the NAACL-2025 (main) paper, "A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation".

Requirements

The dependency packages can be found in requirements.txt file. One can use pip install -r requirements.txt to configure the environment. We use python 3.10 to run the experiments.

Running the experiments

The overall pipeline is: build the belief tree via prompting the LLM $\rightarrow$ prompt the LLM for its confidence score $\rightarrow$ compute the posterior probability $\rightarrow$ evaluate the performance. Before running experiments, you need to configure your OpenAI API key by setting the OPENAI_API_KEY environment variable.

Belief tree generation

python generate_belief_tree.py --dataset=wikibio --backbone=chatgpt

Use python generate_belief_tree.py --helpfull to see the choices for dataset and backbone.

By default, the generated belief trees will be stored at logs/belief_trees/{dataset}_{backbone}.json

Prompt the LLM for its confidence score Similarly, you can specify the dataset name and the backbone LLM used for the experiment in the command line:

python confidence_estimation.py --dataset=wikibio --backbone=chatgpt

By default, the generated belief trees will be stored at logs/conf_estimation/{dataset}_{backbone}.json

Use the NLI model to label the edge type (the relationship between a parent node and a child node)

python tools/label_edges.py --dataset=wikibio --backbone=chatgpt

Compute the posterior probabilities

python hmm_forward.py --dataset=wikibio --backbone=chatgpt

Performance evaluation

python tools/compute_metrics.py --dataset=wikibio --backbone=chatgpt

Citation

@article{hou2024probabilistic,
  title={A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation},
  author={Hou, Bairu and Zhang, Yang and Andreas, Jacob and Chang, Shiyu},
  journal={arXiv preprint arXiv:2406.06950},
  year={2024}
}```

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
configs		configs
logs		logs
src		src
tools		tools
.gitignore		.gitignore
README.md		README.md
confidence_estimation.py		confidence_estimation.py
generate_belief_tree.py		generate_belief_tree.py
hmm_forward.py		hmm_forward.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Hallucination Detection with Belief Tree Propagation

Requirements

Running the experiments

Citation

About

Uh oh!

Releases

Packages

Languages

UCSB-NLP-Chang/BTProp

Folders and files

Latest commit

History

Repository files navigation

Hallucination Detection with Belief Tree Propagation

Requirements

Running the experiments

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages