A pytorch reprelication of the model-based reinforcement learning algorithm MBPO

Last update: Jan 05, 2023

Related tags

Overview

This is a re-implementation of the model-based RL algorithm MBPO in pytorch as described in the following paper: When to Trust Your Model: Model-Based Policy Optimization.

This code is based on a previous paper in the NeurIPS reproducibility challenge that reproduces the result with a tensorflow ensemble model but shows a significant drop in performance with a pytorch ensemble model. This code re-implements the ensemble dynamics model with pytorch and closes the gap.

Reproduced results

The comparison are done on two tasks while other tasks are not tested. But on the tested two tasks, the pytorch implementation achieves similar performance compared to the official tensorflow code.

Dependencies

MuJoCo 1.5 & MuJoCo 2.0

Usage

python main_mbpo.py --env_name 'Walker2d-v2' --num_epoch 300 --model_type 'pytorch'

python main_mbpo.py --env_name 'Hopper-v2' --num_epoch 300 --model_type 'pytorch'

Reference

Official tensorflow implementation: https://github.com/JannerM/mbpo
Code to the reproducibility challenge paper: https://github.com/jxu43/replication-mbpo

A pytorch reprelication of the model-based reinforcement learning algorithm MBPO

Related tags

Overview

Overview

Reproduced results

Dependencies

Usage

Reference

Owner

Xingyu Lin

Object detection using yolo-tiny model and opencv used as backend

Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

Code for the Paper: Conditional Variational Capsule Network for Open Set Recognition

Simple reimplemetation experiments about FcaNet

A python implementation of Physics-informed Spline Learning for nonlinear dynamics discovery

PAthological QUpath Obsession - QuPath and Python conversations

BabelCalib: A Universal Approach to Calibrating Central Cameras. In ICCV (2021)

A Simple Long-Tailed Rocognition Baseline via Vision-Language Model

The code of paper "Block Modeling-Guided Graph Convolutional Neural Networks".

A fast poisson image editing implementation that can utilize multi-core CPU or GPU to handle a high-resolution image input.

Dictionary Learning with Uniform Sparse Representations for Anomaly Detection

Exploring Cross-Image Pixel Contrast for Semantic Segmentation

TLXZoo - Pre-trained models based on TensorLayerX

The official homepage of the COCO-Stuff dataset.

Source code for Task-Aware Variational Adversarial Active Learning

The implementation of the CVPR2021 paper "Structure-Aware Face Clustering on a Large-Scale Graph with 10^7 Nodes"

patchmatch和patchmatchstereo算法的python实现

This repository contains the implementation of Deep Detail Enhancment for Any Garment proposed in Eurographics 2021

Official PyTorch Implementation of Embedding Transfer with Label Relaxation for Improved Metric Learning, CVPR 2021

Minimal PyTorch implementation of YOLOv3