A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

Last update: Dec 28, 2022

Overview

Multi-Agent-Deep-Deterministic-Policy-Gradients

A Pytorch implementation of the multi agent deep deterministic policy gradients(MADDPG) algorithm

This is my implementation of the algorithm presented in the paper: Multi Agent Actor Critic for Mixed Cooperative-Competitive Environments. You can find this paper here: https://arxiv.org/pdf/1706.02275.pdf

You will need to install the Multi Agent Particle Environment(MAPE), which you can find here: https://github.com/openai/multiagent-particle-envs

Make sure to create a virtual environment with the dependencies for the MAPE, since they are somewhat out of date. I also recommend running this with PyTorch version 1.4.0, as the latest version (1.8) seems to have an issue with an in place operation I use in the calculation of the critic loss.

It's probably easiest to just clone this repo into the same directory as the MAPE, as the main file requires the make_env function from that package.

The video for this tutorial is found here: https://youtu.be/tZTQ6S9PfkE

A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

Related tags

Overview

Multi-Agent-Deep-Deterministic-Policy-Gradients

Owner

Phil Tabor

PyTorch implementation of MoCo: Momentum Contrast for Unsupervised Visual Representation Learning

smc.covid is an R package related to the paper A sequential Monte Carlo approach to estimate a time varying reproduction number in infectious disease models: the COVID-19 case by Storvik et al

Generative Adversarial Text to Image Synthesis

A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

Semantically Contrastive Learning for Low-light Image Enhancement

PyMatting: A Python Library for Alpha Matting

Rethinking of Pedestrian Attribute Recognition: A Reliable Evaluation under Zero-Shot Pedestrian Identity Setting

ElasticFace: Elastic Margin Loss for Deep Face Recognition

Repository containing detailed experiments related to the paper "Memotion Analysis through the Lens of Joint Embedding".

A modular, research-friendly framework for high-performance and inference of sequence models at many scales

NUANCED is a user-centric conversational recommendation dataset that contains 5.1k annotated dialogues and 26k high-quality user turns.

Good Semi-Supervised Learning That Requires a Bad GAN

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"

Unofficial Pytorch Implementation of WaveGrad2

Implementation of ICCV 2021 oral paper -- A Novel Self-Supervised Learning for Gaussian Mixture Model

Adversarial vulnerability of powerful near out-of-distribution detection

A little Python application to auto tag your photos with the power of machine learning.

Incorporating Transformer and LSTM to Kalman Filter with EM algorithm

Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)

Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising