Empirical Robustness Analysis of Learning to Incentivize Other Self-interested Agents

Bengisu Guresti*, Abdullah Vanlioglu, Nazim Kemal Ure

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Sequential Social Dilemmas are gaining attention in recent years. The current trends either focus on engineering incentive functions for modifying rewards to reach general welfare, or develop learning based approaches to modify the reward function by accounting for the impact of the incentive on policy updates. One of the most significant works in the learning based approach is LIO, which enables independent self-interested agents to incentivize each other by an additive incentive reward and demonstrates the method’s success in several sequential social dilemma environments. We investigate LIO’s performance under a variety of different setups in public goods game Cleanup in order to analyse its robustness against necessity of including inductive bias in incentive function, randomness in initial agent position with an option of asymmetric incentive potential, and assess its stability under frozen incentive functions after agents’ explorations are reset. We observe and demonstrate empirically that LIO is indeed sensitive to these settings and it is not reliable for obtaining good incentives that would let the system stay stable when it is static. We conclude with some research directions that would improve the robustness of the method and incentive learning research.

Original languageEnglish
Title of host publicationComputational Science and Computational Intelligence - 11th International Conference, CSCI 2024, Proceedings
EditorsHamid R. Arabnia, Leonidas Deligiannidis, Farzan Shenavarmasouleh, Soheyla Amirian, Farid Ghareh Mohammadi
PublisherSpringer Science and Business Media Deutschland GmbH
Pages116-125
Number of pages10
ISBN (Print)9783031995880
DOIs
Publication statusPublished - 2025
Event11th International Conference on Computational Science and Computational Intelligence, CSCI 2024 - Las Vegas, United States
Duration: 11 Dec 202413 Dec 2024

Publication series

NameCommunications in Computer and Information Science
Volume2512 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference11th International Conference on Computational Science and Computational Intelligence, CSCI 2024
Country/TerritoryUnited States
CityLas Vegas
Period11/12/2413/12/24

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

Keywords

  • Meta-gradient Learning
  • Multi-agent Reinforcement Learning
  • Sequential Social Dilemmas

Fingerprint

Dive into the research topics of 'Empirical Robustness Analysis of Learning to Incentivize Other Self-interested Agents'. Together they form a unique fingerprint.

Cite this