Neural Architecture Search

776 papers with code • 26 benchmarks • 27 datasets

Neural architecture search (NAS) is a technique for automating the design of artificial neural networks (ANN), a widely used model in the field of machine learning. NAS essentially takes the process of a human manually tweaking a neural network and learning what works well, and automates this task to discover more complex architectures.

Image Credit : NAS with Reinforcement Learning

Benchmarks

Add a Result

These leaderboards are used to track progress in Neural Architecture Search

Dataset	Best Model	Compare
ImageNet	DeepMAD-50M	See all
NAS-Bench-201, ImageNet-16-120	CR-LSO	See all
CIFAR-10	NAT-M4	See all
NAS-Bench-201, CIFAR-100	DiNAS	See all
NAS-Bench-201, CIFAR-10	DiNAS	See all
CIFAR-10 Image Classification	NAT-M4	See all
CIFAR-100	DNA-c	See all
NATS-Bench Topology, ImageNet16-120	LayerNAS	See all
NATS-Bench Topology, CIFAR-10	LayerNAS	See all
NATS-Bench Topology, CIFAR-100	LayerNAS	See all
NAS-Bench-101	DiNAS	See all
Oxford 102 Flowers	NAT-M3	See all
Oxford-IIIT Pet Dataset	NAT-M3	See all
DTD	NAT-M3	See all
STL-10	NAT-M3	See all
FGVC Aircraft	NAT-M3	See all
Stanford Cars	NAT-M3	See all
Food-101	NAT-M3	See all
CINIC-10	NAT-M4	See all
NATS-Bench Size, CIFAR-10	BossNAS	See all
NATS-Bench Size, CIFAR-100	BossNAS	See all
NAS-Bench-201	Improved FireFly Algorithme	See all
MNIST	Sparse Neural Network	See all
LIDC-IDRI	NASLung (ours)	See all
NATS-Bench Size, ImageNet16-120	LayerNAS	See all
NAS-Bench-301	DiNAS	See all

Show all 26 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Neural Architecture Search models and implementations

osmr/imgclsmob

10 papers

2,917

rwightman/pytorch-image-models

7 papers

29,758

D-X-Y/NAS-Projects

6 papers

1,547

D-X-Y/AutoDL-Projects

6 papers

1,547

See all 24 libraries.

Datasets

Subtasks

Activation Function Synthesis

Most implemented papers

Most implemented Social Latest No code

Proximal Policy Optimization Algorithms

labmlai/annotated_deep_learning_paper_implementations • • 20 Jul 2017

We propose a new family of policy gradient methods for reinforcement learning, which alternate between sampling data through interaction with the environment, and optimizing a "surrogate" objective function using stochastic gradient ascent.

171

Paper
Code

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

tensorflow/tpu • • ICML 2019

Convolutional Neural Networks (ConvNets) are commonly developed at a fixed resource budget, and then scaled up for better accuracy if more resources are available.

133

Paper
Code

Searching for MobileNetV3

tensorflow/models • • ICCV 2019

We achieve new state of the art results for mobile classification, detection and segmentation.

Paper
Code

DARTS: Differentiable Architecture Search

quark0/darts • • ICLR 2019

This paper addresses the scalability challenge of architecture search by formulating the task in a differentiable manner.

Paper
Code

Efficient Neural Architecture Search via Parameter Sharing

google-research/google-research • • 9 Feb 2018

The controller is trained with policy gradient to select a subgraph that maximizes the expected reward on the validation set.

Paper
Code

MnasNet: Platform-Aware Neural Architecture Search for Mobile

tensorflow/tpu • • CVPR 2019

In this paper, we propose an automated mobile neural architecture search (MNAS) approach, which explicitly incorporate model latency into the main objective so that the search can identify a model that achieves a good trade-off between accuracy and latency.

Paper
Code

ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware

MIT-HAN-LAB/ProxylessNAS • • ICLR 2019

We address the high memory consumption issue of differentiable NAS and reduce the computational cost (GPU hours and GPU memory) to the same level of regular training while still allowing a large candidate set.

Paper
Code

EfficientNetV2: Smaller Models and Faster Training

google/automl • • 1 Apr 2021

By pretraining on the same ImageNet21k, our EfficientNetV2 achieves 87. 3% top-1 accuracy on ImageNet ILSVRC2012, outperforming the recent ViT by 2. 0% accuracy while training 5x-11x faster using the same computing resources.

Paper
Code

Progressive Neural Architecture Search

tensorflow/models • • ECCV 2018

We propose a new method for learning the structure of convolutional neural networks (CNNs) that is more efficient than recent state-of-the-art methods based on reinforcement learning and evolutionary algorithms.

Paper
Code

Learning Transferable Architectures for Scalable Image Recognition

tensorflow/models • • CVPR 2018

In our experiments, we search for the best convolutional layer (or "cell") on the CIFAR-10 dataset and then apply this cell to the ImageNet dataset by stacking together more copies of this cell, each with their own parameters to design a convolutional architecture, named "NASNet architecture".

Paper
Code

Neural Architecture Search

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result