PhysRL: Physics-driven Reinforcement Learning Framework

A high-performance framework for reinforcement learning research utilizing NVIDIA's Isaac Gym physics simulation engine.

Overview

This framework implements state-of-the-art reinforcement learning algorithms in a highly parallelized environment, leveraging NVIDIA's Isaac Gym for physics-based simulation. The implementation focuses on maximizing computational efficiency through GPU acceleration and vectorized environments.

Metrics

Metrics visualization

Technical Requirements

System Requirements

Operating System: Ubuntu 18.04 or 20.04 LTS
NVIDIA GPU with CUDA support
Python 3.7+

Core Dependencies

NVIDIA Isaac Gym (Preview 4)
PyTorch
Weights & Biases (for experiment tracking)
CUDA Toolkit

Installation

Download Isaac Gym Preview 4 from the NVIDIA Developer Portal
Follow the installation instructions in the Isaac Gym documentation
Install additional dependencies:

pip install -r requirements.txt

Implementation Details

Supported Algorithms

Proximal Policy Optimization (PPO)
Soft Actor-Critic (SAC)

Key Features

Highly parallelized environment simulation
GPU-accelerated physics computation
Integrated experiment tracking via Weights & Biases
Multi-GPU support for distributed training

Usage

Single-GPU Training

PPO Implementation:

python3 train.py task=RLTask train=TaskPPO headless=True wandb_activate=True

SAC Implementation:

python3 train.py task=RLTask train=TaskSAC headless=True wandb_activate=True

Multi-GPU Training

For distributed training across multiple GPUs:

torchrun --standalone --nnodes=1 nproc_per_node=3 train.py \
    multi_gpu=True \
    task=RLTask \
    train=TaskPPO \
    headless=True \
    wandb_activate=True

Model Evaluation

To evaluate a trained model:

python3 train.py task=RLTask train=TaskSAC test=True \
    checkpoint=runs/TaskSAC_xxx/nn/TaskSAC.pth

Alternative Implementation

A CPU-based implementation using Stable Baselines3 is available in the sb3_implementation directory for systems without NVIDIA GPU support. While this implementation maintains algorithm fidelity, it does not achieve the same performance characteristics as the GPU-accelerated version.

Current Research Direction

Our ongoing research focuses on several key areas:

Policy Optimization: Investigation of advanced policy optimization techniques beyond vanilla implementations
Experience Replay: Analysis of sophisticated replay buffer mechanisms for off-policy algorithms
Reward Engineering: Development of more nuanced reward functions to guide policy learning
Human Feedback Integration: Exploration of RLHF (Reinforcement Learning with Human Feedback) methodologies

Acknowledgments

This research builds upon the foundational work of the NVIDIA Isaac Gym team. Core functionalities are derived from the Isaac Gym Environments repository.

License

This project is released under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
assets/urdf/pallet		assets/urdf/pallet
cfg		cfg
images		images
learning		learning
sb3_implementation		sb3_implementation
tasks		tasks
utils		utils
README.md		README.md
__init__.py		__init__.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PhysRL: Physics-driven Reinforcement Learning Framework

Overview

Metrics

Technical Requirements

System Requirements

Core Dependencies

Installation

Implementation Details

Supported Algorithms

Key Features

Usage

Single-GPU Training

Multi-GPU Training

Model Evaluation

Alternative Implementation

Current Research Direction

Acknowledgments

License

About

Releases

Packages

Languages

benjaminegger/PhysRL

Folders and files

Latest commit

History

Repository files navigation

PhysRL: Physics-driven Reinforcement Learning Framework

Overview

Metrics

Technical Requirements

System Requirements

Core Dependencies

Installation

Implementation Details

Supported Algorithms

Key Features

Usage

Single-GPU Training

Multi-GPU Training

Model Evaluation

Alternative Implementation

Current Research Direction

Acknowledgments

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages