Benchmarking

1504 papers with code • 1 benchmarks • 5 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Benchmarking

Trend	Dataset	Best Model	Paper	Code	Compare
	Wiki-40B	OutEffHop-Bert_base			See all

Datasets

Most implemented papers

Most implemented Social Latest No code

MMDetection: Open MMLab Detection Toolbox and Benchmark

open-mmlab/mmdetection • • 17 Jun 2019

In this paper, we introduce the various features of this toolbox.

144

Paper
Code

Learning Transferable Visual Models From Natural Language Supervision

openai/CLIP • • 26 Feb 2021

State-of-the-art computer vision systems are trained to predict a fixed set of predetermined object categories.

Paper
Code

Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms

zalandoresearch/fashion-mnist • • 25 Aug 2017

We present Fashion-MNIST, a new dataset comprising of 28x28 grayscale images of 70, 000 fashion products from 10 categories, with 7, 000 images per category.

Paper
Code

CIDEr: Consensus-based Image Description Evaluation

tylin/coco-caption • CVPR 2015

We propose a novel paradigm for evaluating image descriptions that uses human consensus.

Paper
Code

The StarCraft Multi-Agent Challenge

oxwhirl/pymarl • • 11 Feb 2019

In this paper, we propose the StarCraft Multi-Agent Challenge (SMAC) as a benchmark problem to fill this gap.

Paper
Code

Benchmarking Graph Neural Networks

graphdeeplearning/benchmarking-gnns • • 2 Mar 2020

In the last few years, graph neural networks (GNNs) have become the standard toolkit for analyzing and learning from data on graphs.

Paper
Code

Benchmarking Deep Reinforcement Learning for Continuous Control

rllab/rllab • • 22 Apr 2016

Recently, researchers have made significant progress combining the advances in deep learning for learning feature representations with reinforcement learning.

Paper
Code

Technical Report on the CleverHans v2.1.0 Adversarial Examples Library

tensorflow/cleverhans • • 3 Oct 2016

An adversarial example library for constructing attacks, building defenses, and benchmarking both

Paper
Code

AI Fairness 360: An Extensible Toolkit for Detecting, Understanding, and Mitigating Unwanted Algorithmic Bias

IBM/AIF360 • • 3 Oct 2018

Such architectural design and abstractions enable researchers and developers to extend the toolkit with their new algorithms and improvements, and to use it for performance benchmarking.

Paper
Code

Benchmarking Neural Network Robustness to Common Corruptions and Perturbations

hendrycks/robustness • • ICLR 2019

Then we propose a new dataset called ImageNet-P which enables researchers to benchmark a classifier's robustness to common perturbations.

Paper
Code

Benchmarking

Benchmarks Add a Result

Datasets

Most implemented papers

Content

Benchmarks

Add a Result