IFEval (Instruction Following Evaluation Datset)

Introduced by Zhou et al. in Instruction-Following Evaluation for Large Language Models

This dataset evaluates instruction following ability of large language models. There are 500+ prompts with instructions such as "write an article with more than 800 words", "wrap your response with double quotation marks", etc.

Homepage

Benchmarks

Add a new result Link an existing benchmark

Trend	Task	Dataset Variant	Best Model	Paper	Code
	Instruction Following	IFEval	GPT-4

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

No data loaders found. You can submit your data loader here.

Tasks

Instruction Following
Large Language Model

Similar Datasets

RewardBench

TheoremQA

MT-Bench

Usage

License

Unknown

Modalities

Texts

Languages

English

IFEval (Instruction Following Evaluation Datset)

Benchmarks Edit Add a new result Link an existing benchmark