This repository contains the source code used to produce the results obtained in Multi-Agent Reinforcement Learning via Distributed MPC as a Function Approximator published in Automatica.
In this work we propose the use of a distributed model predictive control scheme as a function approximator for multi-agent reinforcement learning. We consider networks of linear dynamical systems.
If you find the paper or this repository helpful in your publications, please consider citing it.
@article{mallick2024multi,
title={Multi-agent reinforcement learning via distributed MPC as a function approximator},
author={Mallick, Samuel and Airaldi, Filippo and Dabiri, Azita and De Schutter, Bart},
journal={Automatica},
volume={167},
pages={111803},
year={2024},
publisher={Elsevier}
}
The code was created with Python 3.11
. To access it, clone the repository
git clone https://github.com/SamuelMallick/dmpcrl-concept.git
cd dmpcrl-concept
and then install the required packages by, e.g., running
pip install -r requirements.txt
The repository code is structured in the following way
data
contains the .pkl data files that have been generated for the paper Multi-Agent Reinforcement Learning via Distributed MPC as a Function Approximator.plotting
contains scripts that are used for generating the images used in the paper Multi-Agent Reinforcement Learning via Distributed MPC as a Function Approximator.power_system
contains contains all files relating to the power system example in the paper Multi-Agent Reinforcement Learning via Distributed MPC as a Function Approximator. q_learning_power.py runs the MARL training algorithm.example_1
contains contains all files relating to the power system example in the paper Multi-Agent Reinforcement Learning via Distributed MPC as a Function Approximator. example_1.py.py runs the MARL training algorithm.
The repository is provided under the GNU General Public License. See the LICENSE file included with this repository.
Samuel Mallick, PhD Candidate [[email protected] | [email protected]]
Delft Center for Systems and Control in Delft University of Technology
This research is part of a project that has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (Grant agreement No. 101018826 - CLariNet).
Copyright (c) 2023 Samuel Mallick.
Copyright notice: Technische Universiteit Delft hereby disclaims all copyright interest in the program “dmpcrl-concept” (Multi-Agent Reinforcement Learning via Distributed MPC as a Function Approximator) written by the Author(s). Prof. Dr. Ir. Fred van Keulen, Dean of 3mE.