A clean and robust Pytorch implementation of PPO on continuous action space.

Last update: Dec 16, 2022

Related tags

Overview

PPO-Continuous-Pytorch

I found the current implementation of PPO on continuous action space is whether somewhat complicated or not stable.
And this is a clean and robust Pytorch implementation of PPO on continuous action space. Here is the result:

All the experiments are trained with same hyperparameters.

Dependencies

gym==0.18.3
box2d==2.3.10
numpy==1.21.2
pytorch==1.8.1

How to use my code

Play with trained model

run 'python main.py --write False --render True --Loadmodel True --ModelIdex 400'

Train from scratch

run 'python main.py', where the default enviroment is Pendulum-v0.

Change Enviroment

If you want to train on different enviroments, just run 'python main.py --EnvIdex 0'.
The --EnvIdex can be set to be 0~5, where
'--EnvIdex 0' for 'BipedalWalker-v3'
'--EnvIdex 1' for 'BipedalWalkerHardcore-v3'
'--EnvIdex 2' for 'LunarLanderContinuous-v2'
'--EnvIdex 3' for 'Pendulum-v0'
'--EnvIdex 4' for 'Humanoid-v2'
'--EnvIdex 5' for 'HalfCheetah-v2'

Visualize the training curve

You can use the tensorboard to visualize the training curve. History training curve is saved at '\runs'

Hyperparameter Setting

For more details of Hyperparameter Setting, please check 'main.py'

A clean and robust Pytorch implementation of PPO on continuous action space.

Related tags

Overview

PPO-Continuous-Pytorch

Dependencies

How to use my code

Play with trained model

Train from scratch

Change Enviroment

Visualize the training curve

Hyperparameter Setting

Owner

XinJingHao

CLIP2Video: Mastering Video-Text Retrieval via Image CLIP

Deep learning operations reinvented (for pytorch, tensorflow, jax and others)

for taichi voxel-challange event

a generic C++ library for image analysis

HCQ: Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval

The codes of paper 'Active-LATHE: An Active Learning Algorithm for Boosting the Error exponent for Learning Homogeneous Ising Trees'

😮The official implementation of "CoNeRF: Controllable Neural Radiance Fields" 😮

Learning Calibrated-Guidance for Object Detection in Aerial Images

A Blender python script for getting asset browser custom preview images for objects and collections.

An integration of several popular automatic augmentation methods, including OHL (Online Hyper-Parameter Learning for Auto-Augmentation Strategy) and AWS (Improving Auto Augment via Augmentation Wise Weight Sharing) by Sensetime Research.

This folder contains the python code of UR5E's advanced forward kinematics model.

Code to reproduce the results for Compositional Attention

This is a Tensorflow implementation of Learning to See in the Dark in CVPR 2018

Ascend your Jupyter Notebook usage

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

🏖 Keras Implementation of Painting outside the box

Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised manner.

Activity image-based video retrieval

Uncertainty-aware Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving

TensorFlow implementation of Deep Reinforcement Learning papers