Resetting only the vectorized environments that are done? #73

kfu02 · 2023-12-26T17:35:49Z

Hi, sorry in advance if this isn't the right place to ask these kinds of questions.

I have been playing with VMAS in its vanilla form (no torchRL/RLLib) to try and understand how to implement my own Scenarios, and currently I am confused with how VMAS handles resetting the environment. The reset() function docstring states that it handles resetting "in a vectorized way". From my testing, it seems to me that it resets all vectorized environments.

I was hoping "in a vectorized way" meant that it only reset the environments which were done and left the others alone. I would like it to behave this way to collect episode reward from episodes that are allowed to run until termination, for instance. Does VMAS have this functionality built-in? Am I misunderstanding reset()?

Thank you for the great library, by the way!

matteobettini · 2023-12-27T11:38:59Z

Hello. Thanks for this question as this is a point I feel it is good to clarify and improve upon.

The current situation

Currently, as you say, there are 2 ways to reset an environment:

env.reset() which resets all enviornments
env.reset_at(index) which resets a specific environment at env_index: int

The way that is currently available to reset done environments is to cycle through the done flags and reset only the done envs as:

done # shape = [n_envs]
for i in range(n_envs):
    if done[i]:
         env.reset_at(i)

The ideal situation

To improve efficiency and avoid this for loop. It would be awsome if the reset_at function also accepted a mask.

Something like:

env.reset_at(done)

This would be amazing. The only problem is that the reset_at function of all current scenarios and a major bit of simulator logic will need to be rewritten. So it is not a quick or easy effort.

A consideration

What I do for some scenarios I create is to not implment a done function and let all environments be only done after max_steps. This makes it so that you can always call env.reset(). I understand that this does not fit all tasks, but I figured I would mention this in case it is helpful.

P.S. This change has long been on our TODOs https://github.com/proroklab/VectorizedMultiAgentSimulator?tab=readme-ov-file#todos

kfu02 · 2023-12-27T16:49:21Z

Thank you! Your answer makes sense. I will think over these options.

matteobettini pinned this issue Dec 27, 2023

matteobettini mentioned this issue Jun 26, 2024

[DO NOT CLOSE] Library TODOs and call for contributions #116

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resetting only the vectorized environments that are done? #73

Resetting only the vectorized environments that are done? #73

kfu02 commented Dec 26, 2023 •

edited

Loading

matteobettini commented Dec 27, 2023 •

edited

Loading

kfu02 commented Dec 27, 2023

Resetting only the vectorized environments that are done? #73

Resetting only the vectorized environments that are done? #73

Comments

kfu02 commented Dec 26, 2023 • edited Loading

matteobettini commented Dec 27, 2023 • edited Loading

The current situation

The ideal situation

A consideration

kfu02 commented Dec 27, 2023

kfu02 commented Dec 26, 2023 •

edited

Loading

matteobettini commented Dec 27, 2023 •

edited

Loading