Dynamic Bottleneck for Robust Self-Supervised Exploration

Last update: Nov 14, 2022

Related tags

Deep Learning DB

Overview

Dynamic Bottleneck

Introduction

This is a TensorFlow based implementation for our paper on

"Dynamic Bottleneck for Robust Self-Supervised Exploration". NeurIPS 2021

Prerequisites

python3.6 or 3.7, tensorflow-gpu 1.x, tensorflow-probability, openAI baselines, openAI Gym

Installation and Usage

Atari games

The following command should train a pure exploration agent on "Breakout" with default experiment parameters.

python run.py --env BreakoutNoFrameskip-v4

Atari games with Random-Box noise

The following command should train a pure exploration agent on "Breakout" with randomBox noise.

python run.py --env BreakoutNoFrameskip-v4 --randomBoxNoise

Atari games with Gaussian noise

The following command should train a pure exploration agent on "Breakout" with Gaussian noise.

python run.py --env BreakoutNoFrameskip-v4 --pixelNoise

Atari games with sticky actions

The following command should train a pure exploration agent on "sticky Breakout" with a probability of 0.25

python run.py --env BreakoutNoFrameskip-v4 --stickyAtari

Baselines

ICM: We use the official code of "Curiosity-driven Exploration by Self-supervised Prediction, ICML 2017" and "Large-Scale Study of Curiosity-Driven Learning, ICLR 2019".
Disagreement: We use the official code of "Self-Supervised Exploration via Disagreement, ICML 2019".
CB: We use the official code of "Curiosity-Bottleneck: Exploration by Distilling Task-Specific Novelty, ICML 2019".

Dynamic Bottleneck for Robust Self-Supervised Exploration

Related tags

Overview

Dynamic Bottleneck

Introduction

Prerequisites

Installation and Usage

Atari games

Atari games with Random-Box noise

Atari games with Gaussian noise

Atari games with sticky actions

Baselines

Owner

Bai Chenjia

Implementation of Monocular Direct Sparse Localization in a Prior 3D Surfel Map (DSL)

Prometheus exporter for Cisco Unified Computing System (UCS) Manager

本步态识别系统主要基于GaitSet模型进行实现

Deep and online learning with spiking neural networks in Python

A Joint Video and Image Encoder for End-to-End Retrieval

Sky Computing: Accelerating Geo-distributed Computing in Federated Learning

Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, as a standalone package for Pytorch

This repository contains code and data for "On the Multimodal Person Verification Using Audio-Visual-Thermal Data"

PyTorch Implement of Context Encoders: Feature Learning by Inpainting

✨风纪委员会自动投票脚本，利用Github Action帮你进行裁决操作（为了让其他风纪委员有案件可判，本程序从中午12点才开始运行，有需要请自己修改运行时间）

This program will stylize your photos with fast neural style transfer.

CTF challenges from redpwnCTF 2021

AFLFast (extends AFL with Power Schedules)

An executor that performs image segmentation on fashion items

Plenoxels: Radiance Fields without Neural Networks, Code release WIP

Deep Learning ❤️ OneFlow

SurvITE: Learning Heterogeneous Treatment Effects from Time-to-Event Data

Medical image analysis framework merging ANTsPy and deep learning

A web porting for NVlabs' StyleGAN2, to facilitate exploring all kinds characteristic of StyleGAN networks

ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. In ICCV, 2021.