Deep Reinforcement Learning Agents

This repository contains a collection of reinforcement learning algorithms written in Tensorflow. The ipython notebook here were written to go along with a still-underway tutorial series I have been publishing on Medium. If you are new to reinforcement learning, I recommend reading the accompanying post for each algorithm.

The repository currently contains the following algorithms:

Q-Table - An implementation of Q-learning using tables to solve a stochastic environment problem.
Q-Network - A neural network implementation of Q-Learning to solve the same environment as in Q-Table.
Simple-Policy - An implementation of policy gradient method for stateless environments such as n-armed bandit problems.
Contextual-Policy - An implementation of policy gradient method for stateful environments such as contextual bandit problems.
Policy-Network - An implementation of a neural network policy-gradient agent that solves full RL problems with states and delayed rewards, and two opposite actions (ie. CartPole or Pong).
Vanilla-Policy - An implementation of a neural network vanilla-policy-gradient agent that solves full RL problems with states, delayed rewards, and an arbitrary number of actions.
Model-Network - An addition to the Policy-Network algorithm which includes a separate network which models the environment dynamics.
Double-Dueling-DQN - An implementation of a Deep-Q Network with the Double DQN and Dueling DQN additions to improve stability and performance.
Deep-Recurrent-Q-Network - An implementation of a Deep Recurrent Q-Network which can solve reinforcement learning problems involving partial observability.
Q-Exploration - An implementation of DQN containing multiple action-selection strategies for exploration. Strategies include: greedy, random, e-greedy, Boltzmann, and Bayesian Dropout.
A3C-Doom - An implementation of Asynchronous Advantage Actor-Critic (A3C) algorithm. It utilizes multiple agents to collectively improve a policy. This implementation can solve RL problems in 3D environments such as VizDoom challenges.

A set of Deep Reinforcement Learning Agents implemented in Tensorflow.

Related tags

Overview

Deep Reinforcement Learning Agents

Owner

Arthur Juliani

PyTorch implementation of the WarpedGANSpace: Finding non-linear RBF paths in GAN latent space (ICCV 2021)

Individual Tree Crown classification on WorldView-2 Images using Autoencoder -- Group 9 Weak learners - Final Project (Machine Learning 2020 Course)

This is a model to classify Vietnamese sign language using Motion history image (MHI) algorithm and CNN.

My implementation of transformers related papers for computer vision in pytorch

Deep Learning for Time Series Forecasting.

TensorFlow implementation of AlexNet and its training and testing on ImageNet ILSVRC 2012 dataset

You Only Look Once for Panopitic Driving Perception

GitHub repository for "Improving Video Generation for Multi-functional Applications"

Implementation of paper "DeepTag: A General Framework for Fiducial Marker Design and Detection"

This repo contains code to reproduce all experiments in Equivariant Neural Rendering

[CVPR 2022] Unsupervised Image-to-Image Translation with Generative Prior

The code release of paper Low-Light Image Enhancement with Normalizing Flow

Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition

Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised manner.

Official code of "R2RNet: Low-light Image Enhancement via Real-low to Real-normal Network."

Official repository for the NeurIPS 2021 paper Get Fooled for the Right Reason: Improving Adversarial Robustness through a Teacher-guided curriculum Learning Approach

A repository built on the Flow software package to explore cyber-security attacks on intelligent transportation systems.

The 2nd place solution of 2021 google landmark retrieval on kaggle.

A Structured Self-attentive Sentence Embedding

Uses OpenCV and Python Code to detect a face on the screen