Proto-RL: Reinforcement Learning with Prototypical Representations

Last update: Dec 06, 2022

Overview

Proto-RL: Reinforcement Learning with Prototypical Representations

This is a PyTorch implementation of Proto-RL from

Reinforcement Learning with Prototypical Representations by

Denis Yarats, Rob Fergus, Alessandro Lazaric, Lerrel Pinto.

[Paper]

Citation

If you use this repo in your research, please consider citing the paper as follows

@article{yarats2021proto,
    title={Reinforcement Learning with Prototypical Representations},
    author={Denis Yarats and Rob Fergus and Alessandro Lazaric and Lerrel Pinto},
    year={2021},
    eprint={2102.11271},
    archivePrefix={arXiv},
    primaryClass={cs.ML}
}

Requirements

We assume you have access to a gpu that can run CUDA 11. Then, the simplest way to install all required dependencies is to create an anaconda environment by running

conda env create -f conda_env.yml

After the instalation ends you can activate your environment with

conda activate proto

Instructions

In order to pretrain the agent you need to specify the number of task-agnostic environment steps by setting num_expl_steps, after that many steps, the agent will start receving the downstream task reward until it takes num_train_steps in total. For example, to pre-train the Proto-RL agent on Cheetah Run task unsupervisely for 500k environment steps and then train it further with the downstream reward for another 500k steps, you can run:

python train.py env=cheetah_run num_expl_steps=250000 num_train_steps=500000

Note that we divede the number of steps by action repeat, which is set to 2 for all the environments.

This will produce the exp_local folder, where all the outputs are going to be stored including train/eval logs, tensorboard blobs, and evaluation episode videos. To launch tensorboard run

tensorboard --logdir exp_local

Proto-RL: Reinforcement Learning with Prototypical Representations

Related tags

Overview

Proto-RL: Reinforcement Learning with Prototypical Representations

Citation

Requirements

Instructions

Owner

Denis Yarats

Reference code for the paper "Cross-Camera Convolutional Color Constancy" (ICCV 2021)

This is a repository for a Semantic Segmentation inference API using the Gluoncv CV toolkit

Second Order Optimization and Curvature Estimation with K-FAC in JAX.

Real time sign language recognition

Mind the Trade-off: Debiasing NLU Models without Degrading the In-distribution Performance

PyTorch Implementation of Spatially Consistent Representation Learning(SCRL)

View model summaries in PyTorch!

This repository is for our paper Exploiting Scene Graphs for Human-Object Interaction Detection accepted by ICCV 2021.

Online Multi-Granularity Distillation for GAN Compression (ICCV2021)

A faster pytorch implementation of faster r-cnn

Repository for GNSS-based position estimation using a Deep Neural Network

Exploring the link between uncertainty estimates obtained via "exact" Bayesian inference and out-of-distribution (OOD) detection.

AnimationKit: AI Upscaling & Interpolation using Real-ESRGAN+RIFE

The aim of the game, as in the original one, is to find a specific image from a group of different images of a person's face

Spectral Tensor Train Parameterization of Deep Learning Layers

The official code for PRIMER: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization

Basics of 2D and 3D Human Pose Estimation.

Code and data for the paper "Hearing What You Cannot See"

Learn about Spice.ai with in-depth samples

Code for ICCV2021 paper PARE: Part Attention Regressor for 3D Human Body Estimation