Object Recognition

486 papers with code • 7 benchmarks • 42 datasets

Object recognition is a computer vision technique for detecting + classifying objects in images or videos. Since this is a combined task of object detection plus image classification, the state-of-the-art tables are recorded for each component task here and here.

( Image credit: Tensorflow Object Detection API )

Benchmarks

Add a Result

These leaderboards are used to track progress in Object Recognition

Dataset	Best Model	Compare
MECCANO	Faster-RCNN	See all
CIFAR10-DVS	SSNN	See all
DVS128 Gesture	SSNN	See all
N-Caltech 101	SSNN	See all
ObjectNet (ImageNet classes, trained on ImageNet)	ObjectNet-Baseline	See all
ObjectNet (ImageNet classes)	ObjectNet-Baseline	See all
ObjectNet (All classes)	ObjectNet-Baseline	See all

Libraries

Use these libraries to find Object Recognition models and implementations

peymanbateni/simple-cnaps

3 papers

110

plai-group/simple-cnaps

3 papers

open-mmlab/mmdetection

2 papers

27,790

pytorch/vision

2 papers

15,438

See all 11 libraries.

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

Densely Connected Convolutional Networks

liuzhuang13/DenseNet • • CVPR 2017

Recent work has shown that convolutional networks can be substantially deeper, more accurate, and efficient to train if they contain shorter connections between layers close to the input and those close to the output.

143

Paper
Code

Going Deeper with Convolutions

worksheets/0xbcd424d2 • CVPR 2015

We propose a deep convolutional neural network architecture codenamed "Inception", which was responsible for setting the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC 2014).

Paper
Code

Striving for Simplicity: The All Convolutional Net

pytorch/captum • • 21 Dec 2014

Most modern convolutional neural networks (CNNs) used for object recognition are built using the same principles: Alternating convolution and max-pooling layers followed by a small number of fully connected layers.

Paper
Code

Microsoft COCO: Common Objects in Context

PaddlePaddle/PaddleDetection • • 1 May 2014

We present a new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding.

Paper
Code

Residual Attention Network for Image Classification

tengshaofeng/ResidualAttentionNetwork-pytorch • • CVPR 2017

In this work, we propose "Residual Attention Network", a convolutional neural network using attention mechanism which can incorporate with state-of-art feed forward network architecture in an end-to-end training fashion.

Paper
Code

Finding Tiny Faces

peiyunh/tiny • • CVPR 2017

We explore three aspects of the problem in the context of finding small faces: the role of scale invariance, image resolution, and contextual reasoning.

Paper
Code

Deep Predictive Coding Networks for Video Prediction and Unsupervised Learning

coxlab/prednet • • 25 May 2016

Here, we explore prediction of future frames in a video sequence as an unsupervised learning rule for learning about the structure of the visual world.

Paper
Code

Texture Synthesis Using Convolutional Neural Networks

leongatys/DeepTextures • NeurIPS 2015

Here we introduce a new model of natural textures based on the feature spaces of convolutional neural networks optimised for object recognition.

Paper
Code

Describing Textures in the Wild

deeplearning-wisc/knn-ood • • CVPR 2014

Patterns and textures are defining characteristics of many natural objects: a shirt can be striped, the wings of a butterfly can be veined, and the skin of an animal can be scaly.

Paper
Code

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

yhenon/keras-spp • • 18 Jun 2014

This requirement is "artificial" and may reduce the recognition accuracy for the images or sub-images of an arbitrary size/scale.

Paper
Code

Object Recognition

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result