Language Models with Conformal Factuality Guarantees

This is the official repository for the paper Language Models with Conformal Factuality Guarantees by Christopher Mohri and Tatsunori Hashimoto.

Installation

Install the packages in requirements.txt and also run the pre-commit script pre-commit install.

Files

main.py: shows an example of running inference on one example
sayless.py: core functions necessary for inference (sub-claim splitting + merging)
factscore.py: code for FActScore dataset
nq.py: code for Natural Questions dataset
math_data.py: code for MATH dataset
calibrate_thresh.py: code for analyzing each dataset (producing sub-claims for annotation, computing thresholds, producing plots, generating new outputs)

Simple FActScore example

The script factscore.py will first dump a jsonl with sub-claims into out/factscore_subclaims.jsonl. Afterwards, the user will need to copy the file to data/factscore_annotations.jsonl and annotate the correctness entries in the json. Then, the user can compute conformal thresholds, produce any of the plots appearing in the paper, and generate conformally factual outputs by setting the appropriate fields in analyze_dataset. nq.py and math_data.py have the same functionality but for the Natural Questions and MATH datasets.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
out		out
src		src
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Language Models with Conformal Factuality Guarantees

Installation

Files

Simple FActScore example

About

Releases

Packages

Languages

tatsu-lab/conformal-factual-lm

Folders and files

Latest commit

History

Repository files navigation

Language Models with Conformal Factuality Guarantees

Installation

Files

Simple FActScore example

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages