CA | ES | EN
Cooperative Control of Environmental Extremes: A Multi-Agent Reinforcement Learning Study on Collaboration and Human Biases

In this talk, we explore the emergence of collaboration in multiagent reinforcement learning (MARL) through various examples, highlighting the role of human biases such as loss aversion and their impact on collaborative dynamics. I will then focus on a recent paper co-authored by Ricard Sole and Clement Moulin-Frier (https://arxiv.org/abs/2212.02395), in which we model adaptive dynamics in complex ecosystems. Building on the Forest-Fire model, the study uses fire as an external, fluctuating force in a simulated environment. Agents must balance tree harvesting and fire avoidance, resulting in the evolution of an ecological engineering strategy that optimally maintains biomass while suppressing large fires. We will discuss the implications of these findings for AI management of complex ecosystems, emphasizing the potential benefits of incorporating MARL and collaboration strategies into environmental management and conservation efforts.

Martí Sánchez-Fibla is a Tenure Track researcher at UPF and has recently been granted a research scientist position at IIIA, CSIC. He is currently leading a research industrial project red.es (http://red.es/) with the company BMAT and he has previously been principal investigator of the Plan Nacional INSOCO. His research is focused on the areas of Constraint Optimization (inference and search algorithms for problem solving), and NeuroRobotics (cognitive architectures, sensorimotor learning, adaptability), and Complex Systems.