Controlling Behavioral Diversity

This is the code accompanying the paper: "Controlling Behavioral Diversity in Multi-Agent Reinforcement Learning".

Installing

Create a virtual environment (e.g., conda) with python 3.9
Install our versions of VMAS, TensorDict, TorchRL, and BenchMARL.

git clone -b het_control https://github.com/proroklab/VectorizedMultiAgentSimulator.git
pip install -e VectorizedMultiAgentSimulator

git clone -b het_control https://github.com/matteobettini/tensordict.git
cd tensordict
python setup.py develop
cd ..

git clone -b het_control https://github.com/matteobettini/rl.git
cd rl
python setup.py develop
cd ..

git clone -b het_control https://github.com/matteobettini/BenchMARL.git
pip install -e BenchMARL

Install optional dependencies for logging

pip installl wandb moviepy

Install this project via

git clone https://github.com/proroklab/ControllingBehavioralDiversity.git
pip install -e ControllingBehavioralDiversity

Try running a script (it will ask for cuda and wandb, you can change these values in ControllingBehavioralDiversity/het_control/conf/experiment/het_control_experiment.yaml)

python ControllingBehavioralDiversity/het_control/run_scripts/run_navigation_ippo.py model.desired_snd=0.1

Running

The het_control/run_scripts folder contains the files to run the various experiments included in the paper.

To run an experiment just do

python run_navigation_ippo.py model.desired_snd=0.3

You can run the same experiment over multiple config values, such as desired diversity values and seeds

python run_navigation_ippo.py -m model.desired_snd=-1,0,0.3 seed=0,1,2

model.desired_snd=-1 instructs to run unconstrained heterogeneous policies

For more information on the running syntax and options, check out the BenchMARL guide.

Configuring

The configuration for the codebase is available in the het_control/conf folder.

This configuration overrides the default BenchMARL configuration, contained in benchmarl/conf. So, if a value is not present in the configuration of this project, it will take the default value from that folder.

At the top level of the het_control/conf folder, you can find a config file for each experiment. For example: navigation_ippo_config.yaml.

These files define:

an algorithm
a policy model (Which, in this repository, is always our proposed method)
a critic model
a task
an experiment hyperparameter configuration

The values of these fields determine which configuration to load for each of these components. The configurations are loaded from the sub-folders: het_control/conf/algorithm, het_control/conf/model, het_control/conf/task, het_control/conf/experiment. The top level experiment file sometimes overrides certain values of these components.

All the configurations attributes in these sub-folders are documented to explain their meaning. Docs on these values are also available in BenchMARL.

You can override any of these values from the command-line using the basic Hydra syntax. For example:

python run_navigation_ippo.py model.desired_snd=0.3 seed=1 experiment.max_n_frames=1_000_000 algorithm.lmbda=0.8

Citation

@inproceedings{bettini2024controlling,
    title={Controlling Behavioral Diversity in Multi-Agent Reinforcement Learning},
    author={Bettini, Matteo and Kortvelesy, Ryan and Prorok, Amanda},
    booktitle={Forty-first International Conference on Machine Learning},
    year={2024},
    url={https://openreview.net/forum?id=qQjUgItPq4}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
het_control		het_control
.gitignore		.gitignore
CITATION.cff		CITATION.cff
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Controlling Behavioral Diversity

Installing

Running

Configuring

Citation

About

Languages

proroklab/ControllingBehavioralDiversity

Folders and files

Latest commit

History

Repository files navigation

Controlling Behavioral Diversity

Installing

Running

Configuring

Citation

About

Topics

Resources

Stars

Watchers

Forks

Languages