To analyze how these capabilities would mesh together in a natural conversation, and compare the performance of different architectures and training schemes.
28 PAPERS • 1 BENCHMARK
A large-scale collection of visually-grounded, task-oriented dialogues in English designed to investigate shared dialogue history accumulating during conversation.
10 PAPERS • NO BENCHMARKS YET
Taiga is a corpus, where text sources and their meta-information are collected according to popular ML tasks.
5 PAPERS • NO BENCHMARKS YET
The Metaphorical Connections dataset is a poetry dataset that contains annotations between metaphorical prompts and short poems. Each poem is annotated whether or not it successfully communicates the idea of the metaphorical prompt.
2 PAPERS • NO BENCHMARKS YET
ChatGPT Software Testing Study Dataset contains questions from a well-known software testing book by Ammann and Offutt. It uses all the textbook questions in Chapters 1 to 5 that have solutions available on the book’s official website. These solutions are made publicly available to help students learn. Questions that are not in the student solution are omitted because publishing our results might expose answers that the authors of the book do not intend to make public.
1 PAPER • NO BENCHMARKS YET
Finnish chat conversation corpus and includes unscripted conversations on seven topics from people of different ages.
Dataset Description Our dataset contains questions from a well-known software testing book Introduction to Software Testing 2nd Edition by Ammann and Offutt. We use all the text-book questions in Chapters 1 to 5 that have solutions available on the book’s official website.
0 PAPER • NO BENCHMARKS YET
The Customer Support on Twitter dataset is a large, modern corpus of tweets and replies to aid innovation in natural language understanding and conversational models, and for study of modern customer support practices and impact.