7 dataset results for Code Classification

xCodeEval is one of the largest executable multilingual multitask benchmarks consisting of 17 programming languages with execution-level parallelism. It features a total of seven tasks involving code understanding, generation, translation, and retrieval, and it employs an execution-based evaluation instead of traditional lexical approaches. It also provides a test-case-based multilingual code execution engine, ExecEval that supports all the programming languages in xCodeEval.

6 PAPERS • NO BENCHMARKS YET

PyTorrent

PyTorrent contains 218,814 Python package libraries from PyPI and Anaconda environment. This is because earlier studies have shown that much of the code is redundant and Python packages from these environments are better in quality and are well-documented. PyTorrent enables users (such as data scientists, students, etc.) to build off the shelf machine learning models directly without spending months of effort on large infrastructure.

4 PAPERS • NO BENCHMARKS YET

Spectre-v1

Description

3 PAPERS • NO BENCHMARKS YET

A Software Maintainability Dataset

This dataset contains 304 manual evaluations of class-level software maintainability, drawn from 5 open-source projects: ArgoUML, Art of Illusion, Diary Management, JUnit 4, JSweet. Each Java class is labelled along 5 axis: readability, understandability, complexity, modularity and overall maintainability. Each Java class was assessed by several experts independently of its relation to other classes.

1 PAPER • NO BENCHMARKS YET

FixEval

We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of execution based evaluation compared to suboptimal match based evaluation metrics like BLEU, CodeBLEU, Syntax Match, Exact Match etc.

1 PAPER • NO BENCHMARKS YET

ISAdetect dataset (ISAdetect binary file and object code dataset)

This repository holds two datasets: one with both the original binaries and the code sections extracted from them (“full dataset”), and one with only the code sections (“only code sections”). The code sections were extracted by carving out sections of the binary that were marked as executable. The binaries were scraped from Debian repositories.

1 PAPER • NO BENCHMARKS YET

TACO-Code (Topics in Algorithmic COde generation dataset)

TACO (Topics in Algorithmic COde generation dataset) is a dataset focused on algorithmic code generation, designed to provide a more challenging training dataset and evaluation benchmark for the code generation model field. The dataset consists of programming competition problems that are more difficult and closer to real programming scenarios. It emphasizes improving or evaluating the model's understanding and reasoning abilities in practical application scenarios, rather than just implementing predefined function functionalities.

1 PAPER • 1 BENCHMARK

Datasets

7 dataset results for Code Classification