TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.

Last update: Dec 25, 2022

Related tags

Overview

TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL

TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods. We leverage Box2D procedurally generated environments to assess the performance of teacher algorithms in continuous task spaces. Our repository provides:

Two parametric Box2D environments: Stumps Tracks and Parkour
Multiple embodiments with different locomotion skills (e.g. bipedal walker, spider, climbing chimpanzee, fish)
Two Deep RL students: SAC and PPO
Several ACL algorithms: ADR, ALP-GMM, Covar-GMM, SPDL, GoalGAN, Setter-Solver, RIAC
Two benchmark experiments using elements above: Skill-specific comparison and global performance assessment
Three notebooks for systematic analysis of results using statistical tests along with visualization tools (plots, videos...) allowing to reproduce our figures

See our documentation for an exhaustive list.

Using this, we performed a benchmark of the previously mentioned ACL methods which can be seen in our paper. We also provide additional visualization on our website.

Installation

1- Get the repository

git clone https://github.com/flowersteam/TeachMyAgent
cd TeachMyAgent/

2- Install it, using Conda for example (use Python >= 3.6)

conda create --name teachMyAgent python=3.6
conda activate teachMyAgent
pip install -e .

Note: For Windows users, add -f https://download.pytorch.org/whl/torch_stable.html to the pip install -e . command.

Import baseline results from our paper

In order to benchmark methods against the ones we evaluated in our paper you must download our results:

Go to the notebooks folder
Make the download_baselines.sh script executable: chmod +x download_baselines.sh
Download results: ./download_baselines.sh

WARNING: This will download a zip weighting approximayely 4.5GB. Then, our script will extract the zip file in TeachMyAgent/data. Once extracted, results will weight approximately 15GB.

Usage

See our documentation for details on how to use our platform to benchmark ACL methods.

Development

See CONTRIBUTING.md for details.

Citing

If you use TeachMyAgent in your work, please cite the accompanying paper:

@inproceedings{romac2021teachmyagent,
  author    = {Cl{\'{e}}ment Romac and
               R{\'{e}}my Portelas and
               Katja Hofmann and
               Pierre{-}Yves Oudeyer},
  title     = {TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep
               {RL}},
  booktitle = {Proceedings of the 38th International Conference on Machine Learning,
               {ICML} 2021, 18-24 July 2021, Virtual Event},
  series    = {Proceedings of Machine Learning Research},
  volume    = {139},
  pages     = {9052--9063},
  publisher = {{PMLR}},
  year      = {2021}
}

TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.

Related tags

Overview

TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL

Installation

Import baseline results from our paper

Usage

Development

Citing

Owner

Flowers Team

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Source code of D-HAN: Dynamic News Recommendation with Hierarchical Attention Network

Diabet Feature Engineering - Predict whether people have diabetes when their characteristics are specified

MT-GAN-PyTorch - PyTorch Implementation of Learning to Transfer: Unsupervised Domain Translation via Meta-Learning

Pywonderland - A tour in the wonderland of math with python.

A PyTorch implementation of PointRend: Image Segmentation as Rendering

Creating a Linear Program Solver by Implementing the Simplex Method in Python with NumPy

Automatic number plate recognition using tech: Yolo, OCR, Scene text detection, scene text recognation, flask, torch

Human4D Dataset tools for processing and visualization

Protect against subdomain takeover

Learning Time-Critical Responses for Interactive Character Control

Tensorflow-Project-Template - A best practice for tensorflow project template architecture.

Adversarial examples to the new ConvNeXt architecture

使用深度学习框架提取视频硬字幕；docker容器免安装深度学习库，使用本地api接口使得界面和后端识别分离；

Normal Learning in Videos with Attention Prototype Network

Lip Reading - Cross Audio-Visual Recognition using 3D Convolutional Neural Networks

This is the official repository for our paper: ''Pruning Self-attentions into Convolutional Layers in Single Path''.

Meaningful titles for tabs and PDF downloads! Also supports tab search.

You Only Look One-level Feature (YOLOF), CVPR2021, Detectron2

Reference code for the paper CAMS: Color-Aware Multi-Style Transfer.