You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+13-21Lines changed: 13 additions & 21 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,14 +1,15 @@
1
-
# RSLRL
1
+
# RSL-RL
2
2
3
-
A fast and simple implementation of RL algorithms, designed to run fully on GPU.
4
-
This code is an evolution of `rl-pytorch` provided with NVIDIA's Isaac Gym.
3
+
A fast and simple implementation of learning algorithms for robotics. For an overview of the library please have a look at https://arxiv.org/pdf/2509.10771.
5
4
6
5
Environment repositories using the framework:
7
6
8
7
***`Isaac Lab`** (built on top of NVIDIA Isaac Sim): https://github.com/isaac-sim/IsaacLab
9
-
***`Legged-Gym`** (built on top of NVIDIA Isaac Gym): https://leggedrobotics.github.io/legged_gym/
8
+
***`Legged Gym`** (built on top of NVIDIA Isaac Gym): https://leggedrobotics.github.io/legged_gym/
9
+
***`MuJoCo Playground`** (built on top of MuJoCo MJX and Warp): https://github.com/google-deepmind/mujoco_playground/
10
+
***`mjlab`** (built on top of MuJoCo Warp): https://github.com/mujocolab/mjlab
10
11
11
-
The main branch supports **PPO** and **Student-Teacher Distillation** with additional features from our research. These include:
12
+
The library currently supports **PPO** and **Student-Teacher Distillation** with additional features from our research. These include:
12
13
13
14
*[Random Network Distillation (RND)](https://proceedings.mlr.press/v229/schwarke23a.html) - Encourages exploration by adding
14
15
a curiosity driven intrinsic reward.
@@ -21,8 +22,6 @@ information.
21
22
**Affiliation**: Robotic Systems Lab, ETH Zurich & NVIDIA <br/>
22
23
**Contact**: cschwarke@ethz.ch
23
24
24
-
> **Note:** The `algorithms` branch supports additional algorithms (SAC, DDPG, DSAC, and more). However, it isn't currently actively maintained.
25
-
26
25
27
26
## Setup
28
27
@@ -56,8 +55,7 @@ For documentation, we adopt the [Google Style Guide](https://sphinxcontrib-napol
56
55
We use the following tools for maintaining code quality:
57
56
58
57
-[pre-commit](https://pre-commit.com/): Runs a list of formatters and linters over the codebase.
59
-
-[black](https://black.readthedocs.io/en/stable/): The uncompromising code formatter.
60
-
-[flake8](https://flake8.pycqa.org/en/latest/): A wrapper around PyFlakes, pycodestyle, and McCabe complexity checker.
58
+
-[ruff](https://github.com/astral-sh/ruff): An extremely fast Python linter and code formatter, written in Rust.
61
59
62
60
Please check [here](https://pre-commit.com/#install) for instructions to set these up. To run over the entire repository, please execute the following command in the terminal:
63
61
@@ -70,20 +68,14 @@ pre-commit run --all-files
70
68
71
69
## Citing
72
70
73
-
**We are working on writing a white paper for this library.** Until then, please cite the following work
74
-
if you use this library for your research:
71
+
If you use this library for your research, please cite the following work:
75
72
76
73
```text
77
-
@InProceedings{rudin2022learning,
78
-
title = {Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning},
79
-
author = {Rudin, Nikita and Hoeller, David and Reist, Philipp and Hutter, Marco},
80
-
booktitle = {Proceedings of the 5th Conference on Robot Learning},
81
-
pages = {91--100},
82
-
year = {2022},
83
-
volume = {164},
84
-
series = {Proceedings of Machine Learning Research},
0 commit comments