skip to main content
10.1145/3583131.3590497acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article
Public Access

Leveraging Fitness Critics To Learn Robust Teamwork

Published: 12 July 2023 Publication History

Abstract

Co-evolutionary algorithms have successfully trained agent teams for tasks such as autonomous exploration or robot soccer. However generally, such approaches seek a single strong team, whereas many real-world applications require agents to effectively cooperate across multiple teams. To adapt to different teammates, agents need to learn more general teamwork skills rather than a single team-specific role. Previous work primarily frames this as a fitness-shaping problem, providing high-quality but expensive evaluation methods to isolate an agent's contribution. In this work, we introduce Learned Evaluations for Robust Teaming (LERT), an approach that provides a local evaluation that leverages state trajectories of agents to better quantify their impact across multiple teams. The key insight of this work is that agent state trajectories and previous experiences carry sufficient information to map agent abilities to team performance. As a result, LERT cooperatively co-evolves agents to work together across arbitrary teams. While only using local information and significantly fewer team evaluations, LERT performs as well as-if not better than-current methods.

References

[1]
Adrian K Agogino and Kagan Tumer. 2008. Analyzing and Visualizing Multiagent Rewards in Dynamic and Stochastic Domains. Autonomous Agents and Multi-Agent Systems 17, 2 (2008), 320--338.
[2]
Stefano V Albrecht and Peter Stone. 2018. Autonomous Agents Modelling Other Agents: A Comprehensive Survey and Open Problems. Artificial Intelligence 258 (2018), 66--95.
[3]
Samuel Barrett and Peter Stone. 2015. Cooperating with Unknown Teammates in Complex Domains: A Robot Soccer Case Study of Ad Hoc Teamwork. In AAAI, Vol. 15. Citeseer, 2010--2016.
[4]
Samuel Barrett, Peter Stone, Sarit Kraus, and Avi Rosenfeld. 2012. Learning Teammate Models For Ad Hoc Teamwork. In AAMAS Adaptive Learning Agents (ALA) Workshop. 57--63.
[5]
Samuel Barrett, Peter Stone, Sarit Kraus, and Avi Rosenfeld. 2013. Teamwork with Limited Knowledge of Teammates. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 27.
[6]
Yu-Han Chang, Tracey Ho, and Leslie P Kaelbling. 2004. All Learning is Local: Multi-agent Learning in Global Reward Games. In Advances in neural information processing systems. 807--814.
[7]
Shuo Chen, Ewa Andrejczuk, Zhiguang Cao, and Jie Zhang. 2020. AATEAM: Achieving the Ad Hoc Teamwork by Employing the Attention Mechanism. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 7095--7102.
[8]
Shuo Chen, Ewa Andrejczuk, Athirai Aravazhi Irissappane, and Jie Zhang. 2019. ATSIS: Achieving the Ad Hoc Teamwork by Sub-task Inference and Selection. (2019).
[9]
Jen Jen Chung, Carrie Rebhuhn, Connor Yates, Geoffrey A Hollinger, and Kagan Tumer. 2019. AMultiagent Framework for Learning Dynamic Traffic Management Strategies. Autonomous Robots 43, 6 (2019), 1375--1391.
[10]
Mitchell Colby, Jen Jen Chung, and Kagan Tumer. 2015. Implicit Adaptive Multi-Robot Coordination in Dynamic Environments. In 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 5168--5173.
[11]
Mitchell Colby, Theodore Duchow-Pressley, Jen Jen Chung, and Kagan Tumer. 2016. Local Approximation of Difference Evaluation Functions. In Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems. 521--529.
[12]
Joshua Cook and Kagan Tumer. 2022. Fitness Shaping For Multiple Teams. In Proceedings of the Genetic and Evolutionary Computation Conference. 332--340.
[13]
George Cybenko. 1989. Approximation by Superpositions of a Sigmoidal Function. Mathematics of control, signals and systems 2, 4 (1989), 303--314.
[14]
Ping-an Gao, Zi-xing Cai, and Ling-li Yu. 2009. Evolutionary Computation Approach to Decentralized Multi-Robot Task Allocation. In 2009 Fifth International Conference on Natural Computation, Vol. 5. IEEE, 415--419.
[15]
Katie Long Genter, Noa Agmon, and Peter Stone. 2011. Role-Based Ad Hoc Teamwork. In Plan, Activity, and Intent Recognition. Citeseer, 1782--1783.
[16]
Xavier Glorot and Yoshua Bengio. 2010. Understanding The Difficulty of Training Deep Feedforward Neural Networks. In Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (Proceedings of Machine Learning Research, Vol. 9), Yee Whye Teh and Mike Titterington (Eds.). PMLR, Chia Laguna Resort, Sardinia, Italy, 249--256. https://proceedings.mlr.press/v9/glorot10a.html
[17]
Jorge Gomes, Miguel Duarte, Pedro Mariano, and Anders Lyhne Christensen. 2016. Cooperative Coevolution of Control for a Real Multirobot System. In International Conference on Parallel Problem Solving from Nature. Springer, 591--601.
[18]
Jorge Gomes, Pedro Mariano, and Anders Lyhne Christensen. 2017. Dynamic Team Heterogeneity in Cooperative Coevolutionary Algorithms. IEEE Transactions on Evolutionary Computation 22, 6 (2017), 934--948.
[19]
Andreas Kallinteris, Stavros Orfanoudakis, and Georgios Chalkiadakis. 2022. The Performance Impact of Combining Agent Factorization with Different Learning Algorithms for Multiagent Coordination. In Proceedings of the 12th Hellenic Conference on Artificial Intelligence. 1--10.
[20]
Xin Ma, Qin Zhang, and Yibin Li. 2007. Genetic Algorithm-Based Multi-Robot Cooperative Exploration. In 2007 IEEE International Conference on Control and Automation. IEEE, 1018--1023.
[21]
P Mariano, AL Christensen, and J Gomes. 2014. Avoiding Convergence in Cooperative Coevolution with Novelty Search. Avoiding convergence in cooperative coevolution with novelty search (2014), 1149--1156.
[22]
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. 2013. Playing Atari With Deep Reinforcement Learning. arXiv preprint arXiv:1312.5602 (2013).
[23]
M. J. Norooz Oliaee, B. Hamdaoui, and K. Tumer. 2013. Efficient Objective Functions for Coordinated Learning in Large-Scale Distributed OSA Systems. IEEE Transactions on Mobile Computing 12 (2013), 5.
[24]
Liviu Panait, R Paul Wiegand, and Sean Luke. 2004. A Sensitivity Analysis of a Cooperative Coevolutionary Algorithm Biased for Optimization. In Genetic and Evolutionary Computation Conference. Springer, 573--584.
[25]
Mitchell A Potter and Kenneth A De Jong. 1994. A Cooperative Coevolutionary Approach to Function Optimization. In International Conference on Parallel Problem Solving from Nature. Springer, 249--257.
[26]
Mitchell A Potter and Kenneth A De Jong. 2000. Cooperative Coevolution: An Architecture For Evolving Coadapted Subcomponents. Evolutionary computation 8, 1 (2000), 1--29.
[27]
Muhammad A Rahman, Niklas Hopner, Filippos Christianos, and Stefano V Albrecht. 2021. Towards Open Ad Hoc Teamwork Using Graph-based Policy Learning. In International Conference on Machine Learning. PMLR, 8776--8786.
[28]
Aida Rahmattalabi, Jen Jen Chung, Mitchell Colby, and Kagan Tumer. 2016. D++: Structural Credit Assignment in Tightly Coupled Multiagent Domains. In 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 4424--4429.
[29]
Carrie Rebhuhn, Ryan Skeele, Jen Jen Chung, Geoffrey A Hollinger, and Kagan Tumer. 2015. Learning to Trick Cost-Based Planners Into Cooperative Behavior. In 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 4627--4633.
[30]
Golden Rockefeller, Shauharda Khadka, and Kagan Tumer. 2020. Multi-level Fitness Critics for Cooperative Coevolution. In AAMAS Conference proceedings.
[31]
Golden Rockefeller, Patrick Mannion, and Kagan Tumer. 2019. Fitness Critics for Multiagent Learning. In 2019 International Symposium on Multi-Robot and Multi-Agent Systems (MRS). IEEE, 222--224.
[32]
Michael Schmidt and Hod Lipson. 2005. Coevolution of Fitness Maximizers and Fitness Predictors. GECCO Late Breaking Paper (2005).
[33]
Peter Stone, Gal A Kaminka, Sarit Kraus, Jeffrey S Rosenschein, et al. 2010. Ad Hoc Autonomous Agent Teams: Collaboration without Pre-Coordination.
[34]
Peter Stone, Gal A Kaminka, Sarit Kraus, Jeffrey S Rosenschein, and Noa Agmon. 2013. Teaching and Leading an Ad Hoc Teammate: Collaboration Without Pre-coordination. Artificial Intelligence 203 (2013), 35--65.
[35]
Pedro Trueba, Abraham Prieto, and Francisco Bellas. 2017. Embodied Evolution Versus Cooperative Coevolution in Multi-Robot Optimization: A Practical Comparison. In Proceedings of the Genetic and Evolutionary Computation Conference Companion. 79--80.
[36]
K. Tumer and A. K. Agogino. 2007. Coordinating Multi-Rover Systems: Evaluation Functions for Dynamic and Noisy Environments. In Evolutionary Computation in Dynamic and Uncertain Environments, S. Yang (Ed.). Springer, 371--388.
[37]
Kagan Tumer, Zachary T. Welch, and Adrian Agogino. 2008. Aligning Social Welfare and Agent Preferences to Alleviate Traffic Congestion. AAMAS (2008).
[38]
Eiji Uchibe, Masateru Nakamura, and Minoru Asada. 1999. Cooperative and Competitive Behavior Acquisition for Mobile Robots Through Co-evolution. In Proc. of the Genetic and Evolutionary Computation Conference. Citeseer, 1406--1413.
[39]
Connor Yates, Reid Christopher, and Kagan Tumer. 2020. Multi-Fitness Learning for Behavior-Driven Cooperation. In Proceedings of the 2020 Genetic and Evolutionary Computation Conference. 453--461.
[40]
Nicholas Zerbel and Kagan Tumer. 2020. The Power of Suggestion. In Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems. 1602--1610.

Cited By

View all
  • (2024)Interpersonal skills: a comparative study of the fitness industry and the technological sectorQainar Journal of Social Science10.58732/2958-7212-2024-3-54-713:3(54-71)Online publication date: 8-Dec-2024
  • (2024)Indirect Credit Assignment in a Multiagent SystemProceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems10.5555/3635637.3663136(2288-2290)Online publication date: 6-May-2024
  • (2024)Multidimensional Archive Of The State SpaceProceedings of the Genetic and Evolutionary Computation Conference Companion10.1145/3638530.3654352(267-270)Online publication date: 14-Jul-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

GECCO '23: Proceedings of the Genetic and Evolutionary Computation Conference
July 2023
1667 pages
ISBN:9798400701191
DOI:10.1145/3583131
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 July 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. multiagent learning
  2. fitness approximation
  3. evolutionary machine learning

Qualifiers

  • Research-article

Funding Sources

Conference

GECCO '23
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)76
  • Downloads (Last 6 weeks)4
Reflects downloads up to 20 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Interpersonal skills: a comparative study of the fitness industry and the technological sectorQainar Journal of Social Science10.58732/2958-7212-2024-3-54-713:3(54-71)Online publication date: 8-Dec-2024
  • (2024)Indirect Credit Assignment in a Multiagent SystemProceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems10.5555/3635637.3663136(2288-2290)Online publication date: 6-May-2024
  • (2024)Multidimensional Archive Of The State SpaceProceedings of the Genetic and Evolutionary Computation Conference Companion10.1145/3638530.3654352(267-270)Online publication date: 14-Jul-2024
  • (2024)Learning Aligned Local Evaluations For Better Credit Assignment In Cooperative CoevolutionProceedings of the Genetic and Evolutionary Computation Conference10.1145/3638529.3654157(286-294)Online publication date: 14-Jul-2024

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy