Deeprl - Standard DQN and dueling network for simple games

Last update: Apr 12, 2020

Overview

DeepRL

This code implements the standard deep Q-learning and dueling network with experience replay (memory buffer) for playing simple games.

DQN algorithm implemented in this code is from the Google DeepMind's paper Playing Atari with Deep Reinforcement Learning[link].

Dueling network is from the paper Dueling Network Architectures for Deep Reinforcement Learning [link]

Requirement

DeepRL is implemented with Torch and the packages of its ecosystem. This code is well worked on my Mac Pro with CPU (I haven't tested it on Linux and GPU). Install Torch7 firstly, then you should install the following packages by luarocks

luarocks install nn
luarocks install image
luarocks install qt
luarocks install optim

Running

You can run this code by tapping the command in the project dir.

qlua main.lua

The result looks like

DQN: I got the accuracy of 93.2% (932 success of 1000 epochs).

Dueling: I got the accuracy of 99.2% (992 success of 1000 epochs).

Code

The envir.lua indicates the environment in reinforcement learning stage, which receives the action and produces the states and a reward for agent.

The agent.lua is the implementation of agent which receives the states and reward to produce the action directed by the policy network.

The learner.lua is the learning algorithm of DQN with experience replay as the following.

MISC

I completed this code when I was an intern at Horizon Robotics. I will greatly thank the article of Andrej Karpathy and other implementations:SeanNaren's code and EderSantana's gist.

LICENSE

MIT

Deeprl - Standard DQN and dueling network for simple games

Related tags

Overview

DeepRL

Requirement

Running

Code

MISC

LICENSE

Owner

Yao Zhou

Why Are You Weird? Infusing Interpretability in Isolation Forest for Anomaly Detection

A framework for attentive explainable deep learning on tabular data

Matching python environment code for Lux AI 2021 Kaggle competition, and a gym interface for RL models.

A machine learning library for spiking neural networks. Supports training with both torch and jax pipelines, and deployment to neuromorphic hardware.

CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images

WSDM2022 Challenge - Large scale temporal graph link prediction

[CVPR2021 Oral] UP-DETR: Unsupervised Pre-training for Object Detection with Transformers

Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)

[ICML 2020] DrRepair: Learning to Repair Programs from Error Messages

TabNet for fastai

[ICCV'2021] Image Inpainting via Conditional Texture and Structure Dual Generation

Code for a real-time distributed cooperative slam(RDC-SLAM) system for ROS compatible platforms.

Tutorials, assignments, and competitions for MIT Deep Learning related courses.

Semi-supervised learning for object detection

Learning Skeletal Articulations with Neural Blend Shapes

Flickr-Faces-HQ (FFHQ) is a high-quality image dataset of human faces, originally created as a benchmark for generative adversarial networks (GAN)

[NeurIPS 2021] A weak-shot object detection approach by transferring semantic similarity and mask prior.

NAS-Bench-x11 and the Power of Learning Curves

Repo for flood prediction using LSTMs and HAND

TVNet: Temporal Voting Network for Action Localization