REO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image Captioning

Ming Jiang; Junjie Hu; Qiuyuan Huang; Lei Zhang; Jana Diesner; Jianfeng Gao

doi:10.48550/arxiv.1909.02217

RDLNetworkEkosistem

Hakkımızda SSS

Giriş yap Başla

Hakkımızda SSS Gizlilik Şartlar İletişim

REO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image Captioning — Ming Jiang (2019) | RDL Network

Back

Home
Publications
REO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image Captioning

Shared by

Lei Zhang

REO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image Captioning

Preprint 2019 en

Authors

MJ
Ming Jiang
JH
Junjie Hu
QH
Qiuyuan Huang

Abstract

1 min read

Popular metrics used for evaluating image captioning systems, such as BLEU and CIDEr, provide a single score to gauge the system's overall effectiveness. This score is often not informative enough to indicate what specific errors are made by a given system. In this study, we present a fine-grained evaluation method REO for automatically measuring the performance of image captioning systems. REO assesses the quality of captions from three perspectives: 1) Relevance to the ground truth, 2) Extraness of the content that is irrelevant to the ground truth, and 3) Omission of the elements in the images and human references. Experiments on three benchmark datasets demonstrate that our method achieves a higher consistency with human judgments and provides more intuitive evaluation results than alternative metrics.

Discussion(0)

No comments yet. Be the first to comment.

Open reviews(0)

Public, signed peer feedback on this preprint.

No reviews yet.

Publication Info

DOI: 10.48550/arxiv.1909.02217
Year: 2019
Published: —
Language: en

Preprint Details

Link Of The Paper: http://arxiv.org/abs/1909.02217

Timeline

Created:June 19, 2026

Related publications

Article2023

IC3: Image Captioning by Committee Consensus

David W. Chan, Austin Myers, Sudheendra Vijayanarasimhan, David A. Ross, John F Canny

Preprint2023

IC3: Image Captioning by Committee Consensus

David M. Chan, Austin Myers, Sudheendra Vijayanarasimhan, David A. Ross, John F Canny

Preprint2024

Wolf: Dense Video Captioning with a World Summarization Framework

Boyi Li, Ligeng Zhu, Ran Tian, Shuhan Tan, Yuxiao Chen, Yao Lu, Yin Cui, Sushant Veer, Max Simon Ehrlich, Jonah Philion, Xinshuo Weng, Fuzhao Xue, Andrew Tao, Ming-Yu Liu, Sanja Fidler, Boris Ivanovic, Trevor Darrell, Jitendra Malik, Song Han, Marco Pavone

Article2023

CLAIR: Evaluating Image Captions with Large Language Models

David M. Chan, Suzanne Petryk, Joseph E. Gonzalez, Trevor Darrell, John F Canny

Preprint2023

CLAIR: Evaluating Image Captions with Large Language Models

David M. Chan, Suzanne Petryk, Joseph E. Gonzalez, Trevor Darrell, John F Canny