GitHub - ck0123/improved-bertscore-for-image-captioning-evaluation

Improved BERTScore for image captioning evaluation

Implementation of paper: Improving Image Captioning Evaluation by Considering Inter References Variance (ACL2020)

Usage:

Recently, this repo provides two metrics ('with BERT' and 'simple')

python3 run_metric.py
python3 run_metric_simple.py

example data:

example/example.json (you can modify this file for your own datasets)

Fields explanation:

"refs": reference captions (each sample 5 references)
"cand": candidate caption (each sample 1 candidate)
"refs_hid": contextual embeddings of reference captions
"cand_hid": contextual embeddings of cand captions
"mismatch": mismatches marks computed from all of reference captions
"metric_result": scores on our metric

NOTE:
we also provide Flickr 8K Expert Annotation file with our format 'example/flickr.json'
you can easily reproduce our result following run_metric.py lines 223-235.

Dependencies:

pytorch-pretrained-bert==0.6.2 (old version of huggingface/transformers)
torch==0.4.1
bert_score==0.1.2 (already in this repo)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
bert_score		bert_score
example		example
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
run_metric.py		run_metric.py
run_metric_simple.py		run_metric_simple.py
stop_word_list.txt		stop_word_list.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improved BERTScore for image captioning evaluation

Usage:

example data:

Dependencies:

About

Releases

Packages

Languages

ck0123/improved-bertscore-for-image-captioning-evaluation

Folders and files

Latest commit

History

Repository files navigation

Improved BERTScore for image captioning evaluation

Usage:

example data:

Dependencies:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages