Effects of communication by RL-agents in a single-patch foraging environment

Keywords

Loading...
Thumbnail Image

Issue Date

2025-08-17

Language

en

Document type

Journal Title

Journal ISSN

Volume Title

Publisher

Title

ISSN

Volume

Issue

Startpage

Endpage

DOI

Abstract

Abstract This thesis examines how active, cost-aware communication influences cooperation in a two agent, singlepatch foraging environment under partial observability. A dual-channel mechanism is introduced, enabling agents to independently control message sending (“willingness”) and reception (“attention”), with metabolic penalties applied to both. Agents are trained using the Deep Deterministic Policy Gradient (DDPG) algorithm, and evaluated across varying communication costs, resource abundance, and proportions of social welfare in the reward function. Communication declined almost linearly as costs increased, with a small unexplored performance spike at low non-zero costs. Resource abundance produced a non-monotonic effect: in resource-scarce settings, communication reduced performance, while in resource-rich settings it improved it, with the tipping point well above the level at which communication first emerged. Social welfare weighting had a strong influence. Minimal communication occurred with purely individual rewards, rising to a maximum at moderate-to-high welfare proportions before plateauing. Overall, social welfare effects were stronger than those of direct communication channels, indicating that reward structure can act as a strong indirect communication mechanism. Findings suggest that the value of communication depends on environmental constraints and reward composition, and that jointly designing both is key to fostering efficient, stable cooperation in multi-agent reinforcement learning.

Description

Citation

Faculty

Faculteit der Sociale Wetenschappen