The code is the part of Maytus Piriyajitakonkij's Individual Project (Dissertation), the half of MSc in Computing (AI and Machine Learning Specialism) programme at Imperial College London.
Single-agent training
python main_rl.py --train --task counter --render
Multi-agent training
python main_multi_agent.py --train --render