TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Factual Inconsistency Detection in Chart Captioning	CHOCOLATE	ChartVE	Kendall's Tau-c	0.178	# 1
Factual Inconsistency Detection in Chart Captioning	CHOCOLATE-FT	ChartVE	Kendall's Tau-c	0.215	# 2
Factual Inconsistency Detection in Chart Captioning	CHOCOLATE-LLM	ChartVE	Kendall's Tau-c	0.091	# 4
Factual Inconsistency Detection in Chart Captioning	CHOCOLATE-LVLM	ChartVE	Kendall's Tau-c	0.178	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/do-lvlms-understand-charts-analyzing-and/factual-inconsistency-detection-in-chart)](https://paperswithcode.com/sota/factual-inconsistency-detection-in-chart?p=do-lvlms-understand-charts-analyzing-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/do-lvlms-understand-charts-analyzing-and/factual-inconsistency-detection-in-chart-3)](https://paperswithcode.com/sota/factual-inconsistency-detection-in-chart-3?p=do-lvlms-understand-charts-analyzing-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/do-lvlms-understand-charts-analyzing-and/factual-inconsistency-detection-in-chart-2)](https://paperswithcode.com/sota/factual-inconsistency-detection-in-chart-2?p=do-lvlms-understand-charts-analyzing-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/do-lvlms-understand-charts-analyzing-and/factual-inconsistency-detection-in-chart-1)](https://paperswithcode.com/sota/factual-inconsistency-detection-in-chart-1?p=do-lvlms-understand-charts-analyzing-and)`

Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning

15 Dec 2023 · Kung-Hsiang Huang, Mingyang Zhou, Hou Pong Chan, Yi R. Fung, Zhenhailong Wang, Lingyu Zhang, Shih-Fu Chang, Heng Ji ·

Recent advancements in large vision-language models (LVLMs) have led to significant progress in generating natural language descriptions for visual content and thus enhancing various applications. One issue with these powerful models is that they sometimes produce texts that are factually inconsistent with the visual input. While there has been some effort to mitigate such inconsistencies in natural image captioning, the factuality of generated captions for structured document images, such as charts, has not received as much scrutiny, posing a potential threat to information reliability in critical applications. This work delves into the factuality aspect by introducing a comprehensive typology of factual errors in generated chart captions. A large-scale human annotation effort provides insight into the error patterns and frequencies in captions crafted by various chart captioning models, ultimately forming the foundation of a novel dataset, CHOCOLATE. Our analysis reveals that even state-of-the-art models, including GPT-4V, frequently produce captions laced with factual inaccuracies. In response to this challenge, we establish the new task of Chart Caption Factual Error Correction and introduce CHARTVE, a model for visual entailment that outperforms proprietary and open-source LVLMs in evaluating factual consistency. Furthermore, we propose C2TFEC, an interpretable two-stage framework that excels at correcting factual errors. This work inaugurates a new domain in factual error correction for chart captions, presenting a novel evaluation mechanism, and demonstrating an effective approach to ensuring the factuality of generated chart captions.

PDF Abstract

Code

Add Remove Mark official

khuangaf/chocolate official

huggingface/transformers

124,984

Tasks

Add Remove

Factual Inconsistency Detection in Chart Captioning

Image Captioning

Visual Entailment

Datasets

Introduced in the Paper:

CHOCOLATE

Used in the Paper:

ChartQA

PlotQA Chart-to-text

Results from the Paper

Add Remove

Ranked #1 on Factual Inconsistency Detection in Chart Captioning on CHOCOLATE-LVLM

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Factual Inconsistency Detection in Chart Captioning	CHOCOLATE	ChartVE	Kendall's Tau-c	0.178	# 1	Compare
Factual Inconsistency Detection in Chart Captioning	CHOCOLATE-FT	ChartVE	Kendall's Tau-c	0.215	# 2	Compare
Factual Inconsistency Detection in Chart Captioning	CHOCOLATE-LLM	ChartVE	Kendall's Tau-c	0.091	# 4	Compare
Factual Inconsistency Detection in Chart Captioning	CHOCOLATE-LVLM	ChartVE	Kendall's Tau-c	0.178	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove