Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Last update: Dec 19, 2021

Related tags

Overview

Official code for Continual Learning In Environments With Polynomial Mixing Times

Continual Learning in Environments with Polynomial Mixing Times

This repository provides official code base for the paper "Continual Learning in Environments with Polynomial Mixing Times"

Basic Setup

Clone this repository and then follow this command

cd polynomial-mixing-times

Create either use a python virtualenv or a conda environment and activate it.

pip install virtualenv
virtualenv -p /usr/bin/python3.7 mixing-times
source mixing-times/bin/activate

To install all the relevant packages use the following command:

pip install -e .

Running the experiments

We provide a running script with all relevant hyperparameters used for both baselines and our proposed model. One can run run_bottleneck.sh to run all the models.

To run the experiments of the proposed models on the Example 2 Bottleneck MDP class with 4 rooms, "random" task evolution and a random seed of 1, use the following command

bash run_bottleneck.sh 1 4 "random"

Available Models

Online Q learning
Q learning with Replay
Q learning w/ Dyna
Model based n-step TD
Vanilla Policy Gradient
Onpolicy rho learning
Off-policy rho learning
rho Policy Gradient

List of Environments

ScaleClass-v0
NBottleneckClass-v0
NCycleClass-v0

System requirements

We used python 3.7 version to run all our experiments.

Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Related tags

Overview

Continual Learning in Environments with Polynomial Mixing Times

Basic Setup

Running the experiments

Available Models

List of Environments

System requirements

Owner

Sharath Raparthy

A toy compiler that can convert Python scripts to pickle bytecode 🥒

Sdf sparse conv - Deep Learning on SDF for Classifying Brain Biomarkers

Bringing sanity to world of messed-up data

Graph parsing approach to structured sentiment analysis.

Equivariant CNNs for the sphere and SO(3) implemented in PyTorch

Creating multimodal multitask models

An updated version of virtual model making

Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting

Deepface is a lightweight face recognition and facial attribute analysis (age, gender, emotion and race) framework for python

TargetAllDomainObjects - A python wrapper to run a command on against all users/computers/DCs of a Windows Domain

Yoloxkeypointsegment - An anchor-free version of YOLO, with a simpler design but better performance

Repository containing detailed experiments related to the paper "Memotion Analysis through the Lens of Joint Embedding".

Implementing Vision Transformer (ViT) in PyTorch

Source code related to the article submitted to the International Conference on Computational Science ICCS 2022 in London

This is the source code of the 1st place solution for segmentation task (with Dice 90.32%) in 2021 CCF BDCI challenge.

Serverless proxy for Spark cluster

Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"

YOLOv5🚀 reproduction by Guo Quanhao using PaddlePaddle

Benchmarks for the Optimal Power Flow Problem

Official PyTorch implementation of the paper "Deep Constrained Least Squares for Blind Image Super-Resolution", CVPR 2022.