This repository provides an image captioning metric from our EMNLP-Findings 2021 paper QACE: Asking Questions to Evaluate an Image Caption .
Refer to https://github.com/hwanheelee1993/BUTD-UNITER-NLVR2
python 3.6.6
pip install -r requirements.txt
https://vqamodel.s3.us-east-2.amazonaws.com/t5vqa/ckpt.zip
unzip the file to "ckpt"
Refer to demo.ipynb
Refer to qace_demo.ipynb
Answer similarity computation code will be updated soon. (e.g. using BERTScore)
@misc{lee2021qace,
title={QACE: Asking Questions to Evaluate an Image Caption},
author={Hwanhee Lee and Thomas Scialom and Seunghyun Yoon and Franck Dernoncourt and Kyomin Jung},
year={2021},
eprint={2108.12560},
archivePrefix={arXiv},
primaryClass={cs.CL}
}