Reasoning

127 benchmarks • 68 tasks • 180 datasets • 4184 papers with code

Classification

Classification

3240 papers with code

Text Classification

1104 papers with code

Graph Classification

380 papers with code

Audio Classification

131 papers with code

Medical Image Classification

122 papers with code

See all 18 tasks

Question Answering

Question Answering

2883 papers with code

Open-Ended Question Answering

209 papers with code

Open-Domain Question Answering

195 papers with code

Conversational Question Answering

62 papers with code

Answer Selection

47 papers with code

See all 19 tasks

Decision Making

Decision Making

2039 papers with code

Imitation Learning

520 papers with code

Natural Language Inference

Natural Language Inference

730 papers with code

Answer Generation

56 papers with code

Visual Entailment

27 papers with code

Cross-Lingual Natural Language Inference

16 papers with code

Logical Reasoning

Navigate

407 papers with code

Logical Reasoning

183 papers with code

Novel Concepts

51 papers with code

Temporal Sequences

51 papers with code

StrategyQA

13 papers with code

See all 23 tasks

Multi-Label Classification

Multi-Label Classification

374 papers with code

Missing Labels

40 papers with code

Extreme Multi-Label Classification

29 papers with code

Medical Code Prediction

15 papers with code

Hierarchical Multi-label Classification

13 papers with code

General Reinforcement Learning

Offline RL

226 papers with code

Model-based Reinforcement Learning

195 papers with code

Conformal Prediction

147 papers with code

Text Simplification

117 papers with code

Music Source Separation

53 papers with code

Audio Source Separation

44 papers with code

Decision Making Under Uncertainty

44 papers with code

See all 9 tasks

Common Sense Reasoning

Common Sense Reasoning

254 papers with code

Physical Commonsense Reasoning

6 papers with code

Riddle Sense

5 papers with code

Winowhy

4 papers with code

Anachronisms

3 papers with code

See all 16 tasks

Visual Reasoning

Visual Reasoning

211 papers with code

Visual Commonsense Reasoning

29 papers with code

Program Synthesis

Program Synthesis

138 papers with code

Type prediction

42 papers with code

Program Repair

34 papers with code

Value prediction

15 papers with code

Enumerative Search

5 papers with code

See all 6 tasks

Mathematical Reasoning

Mathematical Reasoning

114 papers with code

Math Word Problem Solving

61 papers with code

Formal Logic

11 papers with code

Geometry Problem Solving

8 papers with code

Abstract Algebra

3 papers with code

See all 8 tasks

Video Question Answering

Video Question Answering

152 papers with code

Zero-Shot Video Question Answer

33 papers with code

Few-shot Video Question Answering

1 papers with code

Multi-Label Learning

Multi-Label Learning

81 papers with code

Missing Labels

40 papers with code

Mathematical Proofs

Automated Theorem Proving

70 papers with code

Mathematical Proofs

17 papers with code

Arithmetic Reasoning

Arithmetic Reasoning

70 papers with code

Program Repair

Program Repair

34 papers with code

Fault localization

15 papers with code

Variable misuse

9 papers with code

Exception type

2 papers with code

Function-docstring mismatch

1 papers with code

See all 7 tasks

Math Word Problem Solving

Math Word Problem Solving

61 papers with code

Mathematical Question Answering

Math Word Problem Solving

61 papers with code

Systematic Generalization

Systematic Generalization

61 papers with code

Decision Making Under Uncertainty

Decision Making Under Uncertainty

44 papers with code

Uncertainty Visualization

3 papers with code

Video-based Generative Performance Benchmarking

Video-based Generative Performance Benchmarking (Consistency)

9 papers with code

Video-based Generative Performance Benchmarking (Contextual Understanding)

9 papers with code

Video-based Generative Performance Benchmarking (Correctness of Information)

9 papers with code

Video-based Generative Performance Benchmarking (Detail Orientation))

9 papers with code

Video-based Generative Performance Benchmarking (Temporal Understanding)

9 papers with code

Multimodal Reasoning

Multimodal Reasoning

37 papers with code

Natural Language Visual Grounding

Natural Language Visual Grounding

16 papers with code

Discrete Choice Models

Discrete Choice Models

14 papers with code

Generative Visual Question Answering

Video-based Generative Performance Benchmarking

13 papers with code

Causal Identification

Causal Identification

12 papers with code

Odd One Out

Odd One Out

10 papers with code

Geometry Problem Solving

Geometry Problem Solving

8 papers with code

Autonomous Navigation

Sequential Place Recognition

5 papers with code

Autonomous Flight (Dense Forest)

1 papers with code

Autonomous Web Navigation

Abstract Argumentation

Abstract Argumentation

4 papers with code

Analogical Similarity

Analogical Similarity

4 papers with code

Theory of Mind Modeling

Theory of Mind Modeling

4 papers with code

Anachronisms

Anachronisms

3 papers with code

Human Judgment Correlation

Human Judgment Correlation

3 papers with code

Human Judgment Classification

Human Judgment Classification

2 papers with code

Identify Odd Metapor

Identify Odd Metapor

2 papers with code

Commonsense Reasoning for RL

Commonsense Reasoning for RL

1 papers with code

Pre-election ratings estimation

Pre-election ratings estimation

1 papers with code