PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces

Last update: Mar 10, 2022

Overview

Exploring Munchausen Reinforcement Learning

This is the project repository of my team in the "Advanced Deep Learning for Robotics" course at TUM. Our project's topic is "Exploring Munchausen Reinforcement Learning" based on this paper.

For a detailed discussion, see the report and the final presentation.

Setup

Create a virtual environment.
Run pip3 install -r requirements.txt

Code Structure

This repository is structured as follows:

The directories M-DQN and M-SAC contain the implementations of the RL agents DQN and SAC extended with the Munchausen term, respectively.
The directories rl-baselines3-zoo contains a copy of this repository, where we included the implementations of M-DQN so that we can easily train and test the M-DQN agent on benchmark environments and also compare it to other classical agents. To do so, just follow the steps described in the original repository and insert M-DQN as the agent argument.
The directory particles-envcontains a modified version of this repository. The modified version contains code for a particles environment, where an agent wants to reach a goal, while avoiding obstacles. Besides, M-SAC agent is implemented and included in the code, so that it can be trained and compared to the classical SAC agent.
The directory action-gap contains implementation of callbacks for experiment manager of rl-baselines3-zoo which logs action-gap for tensorboard.

PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces

Related tags

Overview

Exploring Munchausen Reinforcement Learning

Setup

Code Structure

Owner

Mohamed Amine Ketata

Official implementation for "Low-light Image Enhancement via Breaking Down the Darkness"

Zero-shot Synthesis with Group-Supervised Learning (ICLR 2021 paper)

Existing Literature about Machine Unlearning

Official implementation for (Refine Myself by Teaching Myself : Feature Refinement via Self-Knowledge Distillation, CVPR-2021)

Python implementation of O-OFDMNet, a deep learning-based optical OFDM system,

[CVPR2021] Invertible Image Signal Processing

(3DV 2021 Oral) Filtering by Cluster Consistency for Large-Scale Multi-Image Matching

Reproduction of Vision Transformer in Tensorflow2. Train from scratch and Finetune.

Code for "Learning Canonical Representations for Scene Graph to Image Generation", Herzig & Bar et al., ECCV2020

Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness

Supervised Contrastive Learning for Downstream Optimized Sequence Representations

[CVPR 2020] Interpreting the Latent Space of GANs for Semantic Face Editing

Training a deep learning model on the noisy CIFAR dataset

CoANet: Connectivity Attention Network for Road Extraction From Satellite Imagery

Classify the disease status of a plant given an image of a passion fruit

Fuzzing tool (TFuzz): a fuzzing tool based on program transformation

An example to implement a new backbone with OpenMMLab framework.

Defending against Model Stealing via Verifying Embedded External Features

People Interaction Graph

Train CPPNs as a Generative Model, using Generative Adversarial Networks and Variational Autoencoder techniques to produce high resolution images.