reinforcement-learning

We propose an algorithm for meta-learning that is model-agnostic, in the sense that it is compatible with any model trained with gradient descent and applicable to a variety of different learning problems, including classification, regression, and reinforcement learning.

Paper
Code

Prioritized Experience Replay

labmlai/annotated_deep_learning_paper_implementations • • 18 Nov 2015

Experience replay lets online reinforcement learning agents remember and reuse experiences from the past.

Paper
Code

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

haarnoja/sac • • ICML 2018

A platform for Applied Reinforcement Learning (Applied RL)

Paper
Code

Dueling Network Architectures for Deep Reinforcement Learning

labmlai/annotated_deep_learning_paper_implementations • • 20 Nov 2015

In recent years there have been many successes of using deep representations in reinforcement learning.

Paper
Code

Asynchronous Methods for Deep Reinforcement Learning

ray-project/ray • 4 Feb 2016

We propose a conceptually simple and lightweight framework for deep reinforcement learning that uses asynchronous gradient descent for optimization of deep neural network controllers.

Paper
Code

Addressing Function Approximation Error in Actor-Critic Methods

sfujim/TD3 • • ICML 2018

In value-based reinforcement learning methods such as deep Q-learning, function approximation errors are known to lead to overestimated value estimates and suboptimal policies.

Paper
Code