|
|
Table
of Contents
AAMAS'22
Chairs Welcome
Piotr Faliszewski
Viviana Mascardi
Catherine Pelachaud
Matthew E. Taylor
Conference
Organization
Area
Chairs
Senior
Programme Committee
Programme
Committee
Auxiliary
Reviewers
Special
Track Reviewers
Awards
Sponsors
& Supporters
|
|
|
|
Main
Track
Using
Agent-Based Simulator to Assess Interventions Against COVID-19 in a Small
Community Generated from Map Data (Page
1)
Mitsuteru Abe (University of Tsukuba)
Fabio Tanaka (University of Tsukuba)
Jair Pereira Junior (University of Tsukuba)
Anna Bogdanova (University of Tsukuba)
Tetsuya Sakurai (University of Tsukuba)
Claus Aranha (University of Tsukuba)
Multi-Objective
Reinforcement Learning with Non-Linear Scalarization (Page
9)
Mridul Agarwal (Purdue University)
Vaneet Aggarwal (Purdue University)
Tian Lan (George Washington University)
Be
Considerate: Avoiding Negative Side Effects in Reinforcement Learning (Page
18)
Parand Alizadeh Alamdari (University of Toronto & Vector Institute)
Toryn Q. Klassen (University of Toronto & Vector Institute)
Rodrigo Toro Icarte (Pontificia Universidad Católica de Chile & Vector
Institute)
Sheila A. McIlraith (University of Toronto & Vector Institute)
Hacking
the Colony: On the Disruptive Effect of Misleading Pheromone and How to
Defend against It (Page
27)
Ashay Aswale (Worcester Polytechnic Institute)
Antonio López (Worcester Polytechnic Institute)
Aukkawut Ammartayakun (Worcester Polytechnic Institute)
Carlo Pinciroli (Worcester Polytechnic Institute) |
(Return
to Top) |
State
Supervised Steering Function for Sampling-based Kinodynamic Planning (Page
35)
Pranav Atreya (University of Texas at Austin)
Joydeep Biswas (University of Texas at Austin)
Unbiased
Asymmetric Reinforcement Learning under Partial Observability (Page
44)
Andrea Baisero (Northeastern University)
Christopher Amato (Northeastern University)
Multi-Agent
Heterogeneous Digital Twin Framework with Dynamic Responsibility Allocation
for Complex Task Simulation (Page
53)
Adrian Simon Bauer (German Aerospace Center (DLR) & Robotics and Mechatronics
Center (RMC))
Anne Köpken (German Aerospace Center (DLR) & Robotics and Mechatronics
Center (RMC))
Daniel Leidner (German Aerospace Center (DLR) & Robotics and Mechatronics
Center (RMC))
Reasoning
about Human-Friendly Strategies in Repeated Keyword Auctions (Page
62)
Francesco Belardinelli (Université d'Evry)
Wojtek Jamroga (University of Luxembourg & Institute of Computer Science,
Polish Academy of Sciences)
Vadim Malvone (Télécom Paris)
Munyque Mittelmann (Université de Toulouse - IRIT)
Aniello Murano (University of Naples Federico II)
Laurent Perrussel (Université de Toulouse - IRIT)
COPALZ:
A Computational Model of Pathological Appraisal Biases for an Interactive
Virtual Alzheimer Patient (Page
72)
Amine Benamara (CNRS-LISN, Université Paris-Saclay)
Jean-Claude Martin (CNRS-LISN, Université Paris-Saclay)
Elise Prigent (CNRS-LISN, Université Paris-Saclay)
Laurence Chaby (Sorbonne Université)
Mohamed Chetouani (Sorbonne Université)
Jean Zagdoun (Sorbonne Université)
Hélène Vanderstichel (CIREL - EA 4354, Université de Lille)
Sébastien Dacunha (Hôpitaux de Paris)
Brian Ravenet (CNRS-LISN, Université Paris-Saclay) |
(Return
to Top) |
Computing
Balanced Solutions for Large International Kidney Exchange Schemes (Page
82)
Márton Benedek (KRTK, Institute of Economics)
Péter Biró (KRTK, Institute of Economics)
Walter Kern (University of Twente)
Daniël Paulusma (Durham University)
Agent-based
Modeling and Simulation for Malware Spreading in D2D Networks (Page
91)
Ziyad Benomar (Orange Labs)
Chaima Ghribi (Orange Labs)
Elie Cali (Orange Labs)
Alexander Hinsen (Weierstrass Institute for Applied Analysis and Stochastics)
Benedikt Jahnel (Weierstrass Institute for Applied Analysis and Stochastics)
Quantitative
Group Trust: A Two-Stage Verification Approach (Page
100)
Jamal Bentahar (Concordia University)
Nagat Drawel (Concordia University)
Abdeladim Sadiki (Concordia University)
Asynchronous
Opinion Dynamics in Social Networks (Page
109)
Petra Berenbrink (Universität Hamburg)
Martin Hoefer (Goethe University Frankfurt)
Dominik Kaaser (Universität Hamburg)
Pascal Lenzner (Hasso Plattner Institute)
Malin Rau (Universität Hamburg)
Daniel Schmand (Universität Bremen)
Interpretable
Preference-based Reinforcement Learning with Tree-Structured Reward Functions (Page
118)
Tom Bewley (University of Bristol)
Freddy Lecue (CortAIx, Thales)
Multivariate
Algorithmics for Eliminating Envy by Donating Goods (Page
127)
Niclas Boehmer (TU Berlin)
Robert Bredereck (Humboldt-Universität zu Berlin)
Klaus Heeger (TU Berlin)
Dušan Knop (Czech Technical University in Prague)
Junjie Luo (Nanyang Technological University)
Proportional
Representation in Matching Markets: Selecting Multiple Matchings under
Dichotomous Preferences (Page
136)
Niclas Boehmer (TU Berlin)
Markus Brill (TU Berlin)
Ulrike Schmidt-Kraepelin (TU Berlin) |
(Return
to Top) |
A
Hierarchical Bayesian Process for Inverse RL in Partially-Controlled Environments (Page
145)
Kenneth Bogert (University of North Carolina at Asheville)
Prashant Doshi (University of Georgia)
Little
House (Seat) on the Prairie: Compactness, Gerrymandering, and Population
Distribution (Page
154)
Allan Borodin (University of Toronto)
Omer Lev (Ben-Gurion University of the Negev)
Nisarg Shah (University of Toronto)
Tyrone Strangway (Ben-Gurion University of the Negev)
Knowledge
Transmission and Improvement Across Generations do not Need Strong Selection (Page
163)
Yasser Bourahla (University Grenoble Alpes, Inria, CNRS, Grenoble INP,
LIG)
Manuel Atencia (Universidad de Málaga)
Jérôme Euzenat (University Grenoble Alpes, Inria, CNRS, Grenoble INP,
LIG)
Explainability
in Multi-Agent Path/Motion Planning: User-study-driven Taxonomy and Requirements (Page
172)
Martim Brandao (King's College London)
Masoumeh Mansouri (University of Birmingham)
Areeb Mohammed (King's College London)
Paul Luff (King's College London)
Amanda Coles (King's College London)
Relaxed
Notions of Condorcet-Consistency and Efficiency for Strategyproof Social
Decision Schemes (Page
181)
Felix Brandt (Technical University of Munich)
Patrick Lederer (Technical University of Munich)
René Romen (Technical University of Munich)
Fair
Stable Matching Meets Correlated Preferences (Page
190)
Angelina Brilliantova (Rochester Institute of Technology)
Hadi Hosseini (The Pennsylvania State University)
Exploiting
Causal Structure for Transportability in Online, Multi-Agent Environments (Page
199)
Axel Browne (Loyola Marymount University)
Andrew Forney (Loyola Marymount University) |
(Return
to Top) |
Beyond
Cake Cutting: Allocating Homogeneous Divisible Goods (Page
208)
Ioannis Caragiannis (Aarhus University)
Vasilis Gkatzelis (Drexel University)
Alexandros Psomas (Perdue University)
Daniel Schoepflin (Drexel University)
Planning,
Execution, and Adaptation for Multi-Robot Systems using Probabilistic
and Temporal Planning (Page
217)
Yaniel Carreno (Heriot-Watt University & The University of Edinburgh)
Jun Hao Alvin Ng (Heriot-Watt University & The University of Edinburgh)
Yvan Petillot (Heriot-Watt University & The University of Edinburgh)
Ron Petrick (Heriot-Watt University & The University of Edinburgh)
Bayesian
Persuasion Meets Mechanism Design: Going Beyond Intractability with Type
Reporting (Page 226)
Matteo Castiglioni (Politecnico di Milano)
Alberto Marchesi (Politecnico di Milano)
Nicola Gatti (Politecnico di Milano)
Best-Response
Bayesian Reinforcement Learning with Bayes-adaptive POMDPs for Centaurs (Page
235)
Mustafa Mert Çelikok (Aalto University)
Frans A. Oliehoek (Delft University of Technology)
Samuel Kaski (Aalto University & University of Manchester)
Anomaly
Guided Policy Learning from Imperfect Demonstrations (Page
244)
Zi-Xuan Chen (Nanjing University)
Xin-Qiang Cai (Nanjing University)
Yuan Jiang (Nanjing University)
Zhi-Hua Zhou (Nanjing University)
Individual-Level
Inverse Reinforcement Learning for Mean Field Games (Page
253)
Yang Chen (The University of Auckland)
Libo Zhang (The University of Auckland)
Jiamou Liu (The University of Auckland)
Shuyue Hu (National University of Singapore)
Simulating
Multiwinner Voting Rules in Judgment Aggregation (Page
263)
Julian Chingoma (University of Amsterdam)
Ulle Endriss (University of Amsterdam)
Ronald de Haan (University of Amsterdam)
Coordinated
Multi-Agent Pathfinding for Drones and Trucks over Road Networks (Page
272)
Shushman Choudhury (Stanford University)
Kiril Solovey (Stanford University)
Mykel Kochenderfer (Stanford University)
Marco Pavone (Stanford University) |
(Return
to Top) |
Pippi:
Practical Protocol Instantiation (Page
281)
Samuel H. Christie (North Carolina State University)
Amit K. Chopra (Lancaster University)
Munindar P. Singh (North Carolina State University)
Optimizing
Multi-Agent Coordination via Hierarchical Graph Probabilistic Recursive
Reasoning (Page 290)
Saar Cohen (Bar-Ilan University)
Noa Agmon (Bar-Ilan University)
Pareto
Optimal and Popular House Allocation with Lower and Upper Quotas (Page
300)
Ágnes Cseh (Institute of Economics, Centre for Economic and Regional
Studies)
Tobias Friedrich (Hasso Plattner Institute, University of Potsdam)
Jannik Peters (TU Berlin)
Three-Dimensional
Popular Matching with Cyclic Preferences (Page
309)
Ágnes Cseh (Institute of Economics, Centre for Economic and Regional
Studies)
Jannik Peters (TU Berlin)
Poincaré-Bendixson
Limit Sets in Multi-Agent Learning (Page
318)
Aleksander Czechowski (Delft University of Technology)
Georgios Piliouras (Singapore University of Technology and Design)
A
Distributed Differentially Private Algorithm for Resource Allocation in
Unboundedly Large Settings (Page
327)
Panayiotis Danassis (École Polytechnique Fédérale de Lausanne (EPFL))
Aleksei Triastcyn (École Polytechnique Fédérale de Lausanne (EPFL))
Boi Faltings (École Polytechnique Fédérale de Lausanne (EPFL))
Computation
and Bribery of Voting Power in Delegative Simple Games (Page
336)
Gianlorenzo D'Angelo (Gran Sasso Science Institute)
Esmaeil Delfaraz (Gran Sasso Science Institute)
Hugo Gilbert (Université Paris-Dauphine, Université PSL, CNRS, LAMSADE) |
(Return
to Top) |
Budgeted
Combinatorial Multi-Armed Bandits (Page
345)
Debojit Das (International Institute of Information Technology, Hyderabad)
Shweta Jain (Indian Institute of Technology, Ropar)
Sujit Gujar (International Institute of Information Technology, Hyderabad)
Efficient
Approximation Algorithms for the Inverse Semivalue Problem (Page
354)
Ilias Diakonikolas (University of Wisconsin-Madison)
Chrystalla Pavlou (TurinTech AI)
John Peebles (Princeton University)
Alistair Stewart (Web 3 Foundation)
Multiagent
Dynamics of Gradual Argumentation Semantics (Page
363)
Louise Dupuis de Tarlé (Université Paris-Dauphine)
Elise Bonzon (Université de Paris)
Nicolas Maudet (Sorbonne Université, CNRS)
How
to Fairly Allocate Easy and Difficult Chores (Page
372)
Soroush Ebadian (University of Toronto)
Dominik Peters (University of Toronto)
Nisarg Shah (University of Toronto)
Scalable
Multi-Agent Model-Based Reinforcement Learning (Page
381)
Vladimir Egorov (JetBrains Research & HSE University)
Alexei Shpilman (JetBrains Research & HSE University)
Facility
Location With Approval Preferences: Strategyproofness and Fairness (Page
391)
Edith Elkind (University of Oxford)
Minming Li (City University of Hong Kong)
Houyu Zhou (City University of Hong Kong)
Betweenness
Centrality in Multi-Agent Path Finding (Page
400)
Eric Ewing (University of Southern California)
Jingyao Ren (University of Southern California)
Dhvani Kansara (University of Southern California)
Vikraman Sathiyanarayanan (University of Southern California)
Nora Ayanian (University of Southern California) |
(Return
to Top) |
Welfare
vs. Representation in Participatory Budgeting (Page
409)
Roy Fairstein (Ben Gurion University of the Negev)
Dan Vilenchik (Ben Gurion University of the Negev)
Reshef Meir (Technion-Israel Institute of Technology)
Kobi Gal (Ben Gurion University of the Negev & University of Edinburgh)
A
Path-following Polynomial Equations Systems Approach for Computing Nash
Equilibria (Page
418)
Hélène Fargier (Université de Toulouse, IRIT)
Paul Jourdan (Université de Toulouse, INRAE-MIAT)
Régis Sabbadin (Université de Toulouse, INRAE-MIAT)
Ensemble
and Incremental Learning for Norm Violation Detection (Page
427)
Thiago Freitas dos Santos (Artificial Intelligence Research Institute
(IIIA-CSIC) & Universitat Autònoma de Barcelona)
Nardine Osman (Artificial Intelligence Research Institute (IIIA-CSIC))
Marco Schorlemmer (Artificial Intelligence Research Institute (IIIA-CSIC))
The
Price of Majority Support (Page
436)
Robin Fritsch (ETH Zürich)
Roger Wattenhofer (ETH Zürich)
A
Symbolic Representation for Probabilistic Dynamic Epistemic Logic (Page
445)
Sébastien Gamblin (Normandie University, UNICAEN, ENSICAEN, CNRS, GREYC)
Alexandre Niveau (Normandie University, UNICAEN, ENSICAEN, CNRS, GREYC)
Maroua Bouzid (Normandie University, UNICAEN, ENSICAEN, CNRS, GREYC)
Fully-Autonomous,
Vision-based Traffic Signal Control: From Simulation to Reality (Page
454)
Deepeka Garg (Aston University)
Maria Chli (Aston University)
George Vogiatzis (Aston University)
One-Sided
Matching Markets with Endowments: Equilibria and Algorithms (Page
463)
Jugal Garg (University of Illinois at Urbana-Champaign)
Thorben Tröbst (University of California, Irvine)
Vijay V. Vazirani (University of California, Irvine) |
(Return
to Top) |
Negotiated
Path Planning for Non-Cooperative Multi-Robot Systems (Page
472)
Anna Gautier (University of Oxford)
Alex Stephens (University of Oxford)
Bruno Lacerda (University of Oxford)
Nick Hawes (University of Oxford)
Michael Wooldridge (University of Oxford)
Refined
Hardness of Distance-Optimal Multi-Agent Path Finding (Page
481)
Tzvika Geft (Tel Aviv University)
Dan Halperin (Tel Aviv University)
Concave
Utility Reinforcement Learning: The Mean-field Game Viewpoint (Page
489)
Matthieu Geist (Google)
Julien Pérolat (Deepmind)
Mathieu Laurière (Google)
Romuald Elie (Deepmind)
Sarah Perrin (Univ. Lille, CNRS, Inria, Centrale Lille)
Oliver Bachem (Google)
Rémi Munos (Deepmind)
Olivier Pietquin (Google)
D3C:
Reducing the Price of Anarchy in Multi-Agent Learning (Page
498)
Ian Gemp (DeepMind)
Kevin R. McKee (DeepMind)
Richard Everett (DeepMind)
Edgar Duéñez-Guzmán (DeepMind)
Yoram Bachrach (DeepMind)
David Balduzzi (XTX Markets)
Andrea Tacchetti (DeepMind)
Sample-based
Approximation of Nash in Large Many-Player Games via Gradient Descent (Page
507)
Ian Gemp (DeepMind)
Rahul Savani (University of Liverpool)
Marc Lanctot (DeepMind)
Yoram Bachrach (DeepMind)
Thomas Anthony (DeepMind)
Richard Everett (DeepMind)
Andrea Tacchetti (DeepMind)
Tom Eccles (DeepMind)
János Kramár (DeepMind)
Building
Contrastive Explanations for Multi-Agent Team Formation (Page
516)
Athina Georgara (Artificial Intelligence Research Institute (IIIA-CSIC)
& Enzyme Advising Group)
Juan A. Rodriguez Aguilar (Artificial Intelligence Research Institute
(IIIA-CSIC))
Carles Sierra (Artificial Intelligence Research Institute (IIIA-CSIC))
Long-Term
Resource Allocation Fairness in Average Markov Decision Process (AMDP)
Environment (Page
525)
Ganesh Ghalme (Technion Israel Institute of Technology)
Vineet Nair (Technion Israel Institute of Technology)
Vishakha Patil (Indian Institute of Science)
Yilun Zhou (Massachusetts Institute of Technology) |
(Return
to Top) |
Fair
and Truthful Mechanism with Limited Subsidy (Page
534)
Hiromichi Goko (Toyota Motor Corporation)
Ayumi Igarashi (National Institute of Informatics)
Yasushi Kawase (University of Tokyo)
Kazuhisa Makino (Kyoto University)
Hanna Sumita (Tokyo Institute of Technology)
Akihisa Tamura (Keio University)
Yu Yokoi (National Institute of Informatics)
Makoto Yokoo (Kyushu University)
Robust
No-Regret Learning in Min-Max Stackelberg Games (Page
543)
Denizalp Goktas (Brown University)
Jiayi Zhao (Pomona College)
Amy Greenwald (Brown University)
Multi-Agent
Curricula and Emergent Implicit Signaling (Page
553)
Niko A. Grupen (Cornell University)
Daniel D. Lee (Cornell Tech)
Bart Selman (Cornell University)
Intention-Aware
Navigation in Crowds with Extended-Space POMDP Planning (Page
562)
Himanshu Gupta (University of Colorado Boulder)
Bradley Hayes (University of Colorado Boulder)
Zachary Sunberg (University of Colorado Boulder)
Multiagent
Model-based Credit Assignment for Continuous Control (Page
571)
Dongge Han (University of Oxford)
Chris Xiaoxuan Lu (University of Edinburgh)
Tomasz Michalak (University at Warsaw & IDEAS NCBR)
Michael Wooldridge (University of Oxford)
Hierarchical
Value Decomposition for Effective On-demand Ride-Pooling (Page
580)
Jiang Hao (Singapore Management University)
Pradeep Varakantham (Singapore Management University)
Computing
Nash Equilibria for District-based Nominations (Page
588)
Paul Harrenstein (University of Oxford)
Paolo Turrini (University of Warwick)
Ordinal
Maximin Share Approximation for Chores (Page
597)
Hadi Hosseini (The Pennsylvania State University)
Andrew Searns (Johns Hopkins University)
Erel Segal-Halevi (Ariel University) |
(Return
to Top) |
A
Mean Field Game Model of Spatial Evolutionary Games (Page
606)
Vincent Hsiao (University of Maryland)
Dana Nau (University of Maryland)
The
Dynamics of Q-learning in Population Games: A Physics-inspired Continuity
Equation Model (Page
615)
Shuyue Hu (National University of Singapore)
Chin-Wing Leung (The Chinese University of Hong Kong)
Ho-fung Leung (The Chinese University of Hong Kong)
Harold Soh (National University of Singapore)
Reduction-based
Solving of Multi-agent Pathfinding on Large Maps Using Graph Pruning (Page
624)
Matej Husár (Charles University)
Jiří Švancara (Charles University)
Philipp Obermeier (University of Potsdam)
Roman Barták (Charles University)
Torsten Schaub (Potassco Solutions & University of Potsdam)
Autonomous
Swarm Shepherding Using Curriculum-Based Reinforcement Learning (Page
633)
Aya Hussein (University of New South Wales)
Eleni Petraki (University of Canberra)
Sondoss Elsawah (University of New South Wales)
Hussein A. Abbass (University of New South Wales)
Cascades
and Overexposure in Social Networks: The Budgeted Case (Page
642)
Mohammad T. Irfan (Bowdoin College)
Kim Hancock (IBM)
Laura M. Friel (Bowdoin College)
Being
Central on the Cheap: Stability in Heterogeneous Multiagent Centrality
Games (Page 651)
Gabriel Istrate (West University of Timişoara)
Cosmin Bonchiş (West University of Timişoara)
A
Declarative Framework for Maximal k-plex Enumeration Problems (Page
660)
Said Jabbour (CRIL CNRS - Université d'Artois)
Nizar Mhadhbi (INSY2S)
Badran Raddaoui (Télécom SudParis & Institut Polytechnique de Paris)
Lakhdar Sais (CRIL CNRS - Université d'Artois) |
(Return
to Top) |
Lazy-MDPs:
Towards Interpretable RL by Learning When to Act (Page
669)
Alexis Jacq (Google Research)
Johan Ferret (Google Research, Inria, & Université de Lille)
Olivier Pietquin (Google Research)
Matthieu Geist (Google Research)
Balancing
Fairness and Efficiency in Traffic Routing via Interpolated Traffic Assignment (Page
678)
Devansh Jalota (Stanford University)
Kiril Solovey (Technion - Israel Institute of Technology)
Matthew Tsao (Stanford University)
Stephen Zoepf (Lacuna AI)
Marco Pavone (Stanford University)
Selecting
PhD Students and Projects with Limited Funding (Page
687)
Jatin Jindal (Google)
Jérôme Lang (CNRS, PSL)
Katarína Cechlárová (Pavol Jozef Šafárik University)
Julien Lesca (Huawei Technologies)
Optimal
Matchings with One-Sided Preferences: Fixed and Cost-Based Quotas (Page
696)
Santhini K. A. (Indian Institute of Technology Madras)
Govind S. Sankar (Duke University)
Meghana Nasre (Indian Institute of Technology Madras)
Planning
Not to Talk: Multiagent Systems that are Robust to Communication Loss (Page
705)
Mustafa O. Karabag (The University of Texas at Austin)
Cyrus Neary (The University of Texas at Austin)
Ufuk Topcu (The University of Texas at Austin)
How
Hard is Safe Bribery? (Page
714)
Neel Karia (Microsoft Research)
Faraaz Mallick (Indian Institute of Technology, Kharagpur)
Palash Dey (Indian Institute of Technology, Kharagpur)
BADDr:
Bayes-Adaptive Deep Dropout RL for POMDPs (Page
723)
Sammie Katt (Northeastern University)
Hai Nguyen (Northeastern University)
Frans A. Oliehoek (Delft University of Technology)
Christopher Amato (Northeastern University) |
(Return
to Top) |
Translating
Omega-Regular Specifications to Average Objectives for Model-Free Reinforcement
Learning (Page 732)
Milad Kazemi (Newcastle University)
Mateo Perez (University of Colorado Boulder)
Fabio Somenzi (University of Colorado Boulder)
Sadegh Soudjani (Newcastle University)
Ashutosh Trivedi (University of Colorado Boulder)
Alvaro Velasquez (Air Force Research Laboratory)
Tactile
Pose Estimation and Policy Learning for Unknown Object Manipulation (Page
742)
Tarik Kelestemur (Northeastern University)
Robert Platt (Northeastern University)
Taskin Padir (Northeastern University)
Disentangling
Successor Features for Coordination in Multi-agent Reinforcement Learning (Page
751)
Seung Hyun Kim (University of Illinois at Urbana Champaign)
Neale Van Stralen (University of Illinois at Urbana-Champaign)
Girish Chowdhary (University of Illinois at Urbana-Champaign)
Huy T. Tran (University of Illinois at Urbana-Champaign)
Equilibria
in Schelling Games: Computational Hardness and Robustness (Page
761)
Luca Kreisel (TU Berlin)
Niclas Boehmer (TU Berlin)
Vincent Froese (TU Berlin)
Rolf Niedermeier (TU Berlin)
Multimodal
Analysis of the Predictability of Hand-gesture Properties (Page
770)
Taras Kucherenko (KTH Royal Institute of Technology)
Rajmund Nagy (KTH Royal Institute of Technology)
Michael Neff (University of California, Davis)
Hedvig Kjellström (KTH Royal Institute of Technology)
Gustav Eje Henter (KTH Royal Institute of Technology)
Towards
Pluralistic Value Alignment: Aggregating Value Systems Through lp-Regression (Page
780)
Roger Lera-Leri (IIIA-CSIC)
Filippo Bistaffa (IIIA-CSIC)
Marc Serramia (IIIA-CSIC)
Maite Lopez-Sanchez (Universitat de Barcelona)
Juan Rodriguez-Aguilar (IIIA-CSIC)
Deploying
Vaccine Distribution Sites for Improved Accessibility and Equity to Support
Pandemic Response (Page
789)
George Z. Li (University of Maryland)
Ann Li (University of Virginia)
Madhav Marathe (University of Virginia)
Aravind Srinivasan (University of Maryland)
Leonidas Tsepenekas (University of Maryland)
Anil Vullikanti (University of Virginia) |
(Return
to Top) |
ASM-PPO:
Asynchronous and Scalable Multi-Agent PPO for Cooperative Charging (Page
798)
Yongheng Liang (Sun Yat-sen University)
Hejun Wu (Sun Yat-sen University)
Haitao Wang (Sun Yat-sen University)
Equilibrium
Computation For Knockout Tournaments Played By Groups (Page
807)
Grzegorz Lisowski (University of Warwick)
M. S. Ramanujan (University of Warwick)
Paolo Turrini (University of Warwick)
Residual
Entropy-based Graph Generative Algorithms (Page
816)
Wencong Liu (Beijing Institute of Technology & Southeast Institute
of Information Technology)
Jiamou Liu (The University of Auckland)
Zijian Zhang (Beijing Institute of Technology & Southeast Institute
of Information Technology)
Yiwei Liu (Defence Industry Secrecy Examination and Certification Center)
Liehuang Zhu (Beijing Institute of Technology)
The
Spoofing Resistance of Frequent Call Markets (Page
825)
Buhong Liu (King's College London)
Maria Polukarov (King's College London)
Carmine Ventre (King's College London)
Lingbo Li (Turing Intelligence Technology)
Leslie Kanthan (Turing Intelligence Technology)
Fan Wu (Turing Intelligence Technology)
Michail Basios (Turing Intelligence Technology)
Logical
Theories of Collective Attitudes and the Belief Base Perspective (Page
833)
Emiliano Lorini (IRIT, CNRS, Toulouse University)
Éloan Rapion (ENS Rennes)
Lyapunov
Exponents for Diversity in Differentiable Games (Page
842)
Jonathan Lorraine (University of Toronto)
Paul Vicol (University of Toronto)
Jack Parker-Holder (University of Oxford)
Tal Kachman (Radboud University)
Luke Metz (Google Research)
Jakob Foerster (University of Oxford)
Any-Play:
An Intrinsic Augmentation for Zero-Shot Coordination (Page
853)
Keane Lucas (Carnegie Mellon University)
Ross E. Allen (Massachusetts Institute of Technology) |
(Return
to Top) |
Coalition
Formation Games and Social Ranking Solutions (Page
862)
Roberto Lucchetti (Politecnico di Milano)
Stefano Moretti (Université Paris-Dauphine & Université PSL)
Tommaso Rea (Politecnico di Milano)
On
Parameterized Complexity of Binary Networked Public Goods Game (Page
871)
Arnab Maiti (Indian Institute of Technology Kharagpur)
Palash Dey (Indian Institute of Technology Kharagpur)
Efficient
Algorithms for Finite Horizon and Streaming Restless Multi-Armed Bandit
Problems (Page 880)
Aditya S. Mate (Harvard University)
Arpita Biswas (Harvard University)
Christoph Siebenbrunner (Harvard University)
Susobhan Ghosh (Harvard University)
Milind Tambe (Harvard University)
CAPS:
Comprehensible Abstract Policy Summaries for Explaining Reinforcement
Learning Agents (Page
889)
Joe McCalmon (Wake Forest University)
Thai Le (The Pennsylvania State University)
Sarra Alqahtani (Wake Forest University)
Dongwon Lee (The Pennsylvania State University)
Warmth
and Competence in Human-Agent Cooperation (Page
898)
Kevin R. McKee (DeepMind)
Xuechunzi Bai (Princeton University)
Susan T. Fiske (Princeton University)
Cooperation
and Learning Dynamics under Risk Diversity and Financial Incentives (Page
908)
Ramona Merhej (Instituto Superior Tecnico & Sorbonne University)
Fernando P. Santos (University of Amsterdam)
Francisco S. Melo (INESC-ID and Instituto Superior Tecnico, Universidade
de Lisboa)
Mohamed Chetouani (Sorbonne University)
Francisco C. Santos (INESC-ID and Instituto Superior Tecnico, Universidade
de Lisboa)
Preference-Based
Goal Refinement in BDI Agents (Page
917)
Mostafa Mohajeriparizi (University of Amsterdam)
Giovanni Sileno (University of Amsterdam)
Tom van Engers (University of Amsterdam) |
(Return
to Top) |
Learning
Equilibria in Mean-Field Games: Introducing Mean-Field PSRO (Page
926)
Paul Muller (Deepmind)
Mark Rowland (Deepmind)
Romuald Elie (Deepmind)
Georgios Piliouras (Singapore University of Technology and Design)
Julien Perolat (Deepmind)
Mathieu Lauriere (Google Brain)
Raphael Marinier (Google Brain)
Olivier Pietquin (Google Brain)
Karl Tuyls (Deepmind)
A
Graph-Based Algorithm for the Automated Justification of Collective Decisions (Page
935)
Oliviero Nardi (University of Amsterdam)
Arthur Boixel (University of Amsterdam)
Ulle Endriss (University of Amsterdam)
Deep
Reinforcement Learning for Active Wake Control (Page
944)
Grigory Neustroev (Delft University of Technology)
Sytze P. E. Andringa (Delft University of Technology)
Remco A. Verzijlbergh (Delft University of Technology & Whiffle)
Mathijs M. De Weerdt (Delft University of Technology)
Learning
Theory of Mind via Dynamic Traits Attribution (Page
954)
Dung Nguyen (Deakin University)
Phuoc Nguyen (Deakin University)
Hung Le (Deakin University)
Kien Do (Deakin University)
Svetha Venkatesh (Deakin University)
Truyen Tran (Deakin University)
Learning
to Transfer Role Assignment Across Team Sizes (Page
963)
Dung Nguyen (Deakin University)
Phuoc Nguyen (Deakin University)
Svetha Venkatesh (Deakin University)
Truyen Tran (Deakin University)
CTRMs:
Learning to Construct Cooperative Timed Roadmaps for Multi-agent Path
Planning in Continuous Spaces (Page
972)
Keisuke Okumura (Tokyo Institute of Technology)
Ryo Yonetani (OMRON SINIC X)
Mai Nishimura (OMRON SINIC X)
Asako Kanezaki (Tokyo Institute of Technology)
Factorial
Agent Markov Model: Modeling Other Agents' Behavior in presence of Dynamic
Latent Decision Factors (Page
982)
Liubove Orlov-Savko (Rice University)
Abhinav Jain (Rice University)
Gregory M. Gremillion (CCDC Army Research Lab)
Catherine E. Neubauer (CCDC Army Research Lab)
Jonroy D. Canady (CCDC Army Research Lab)
Vaibhav Unhelkar (Rice University) |
(Return
to Top) |
Networked
Restless Multi-Armed Bandits for Mobile Interventions (Page
1001)
Han-Ching Ou (Harvard University)
Christoph Siebenbrunner (Harvard University)
Jackson Killian (Harvard University)
Meredith B. Brooks (Harvard University)
David Kempe (University of Southern California)
Yevgeniy Vorobeychik (University of Washington in St. Louis)
Milind Tambe (Harvard University)
Characterizing
Attacks on Deep Reinforcement Learning (Page
1010)
Xinlei Pan (University of California, Berkeley)
Chaowei Xiao (NVIDIA & Arizona State University)
Warren He (University of California, Berkeley)
Shuang Yang (Alibaba)
Jian Peng (University of Illinois at Urbana-Champaign)
Mingjie Sun (Carnegie Mellon University)
Mingyan Liu (University of Michigan, Ann Arbor)
Bo Li (University of Illinois at Urbana-Champaign)
Dawn Song (University of California, Berkeley)
BOID*:
Autonomous Goal Deliberation through Abduction (Page
1019)
Stipe Pandžić (Utrecht University)
Jan Broersen (Utrecht University)
Henk Aarts (Utrecht University)
Scaling
Mean Field Games by Online Mirror Descent (Page
1028)
Julien Pérolat (DeepMind)
Sarah Perrin (University Lille, CNRS, Inria, Centrale Lille, UMR 9189
CRIStAL)
Romuald Elie (DeepMind)
Mathieu Laurière (Google Research)
Georgios Piliouras (Singapore University of Technology and Design)
Matthieu Geist (Google Research)
Karl Tuyls (DeepMind)
Olivier Pietquin (Google Research)
MORAL:
Aligning AI with Human Norms through Multi-Objective Reinforced Active
Learning (Page 1038)
Markus Peschl (Delft University of Technology)
Arkady Zgonnikov (Delft University of Technology)
Frans A. Oliehoek (Delft University of Technology)
Luciano C. Siebert (Delft University of Technology)
Emergent
Cooperation from Mutual Acknowledgment Exchange (Page
1047)
Thomy Phan (LMU Munich)
Felix Sommer (LMU Munich)
Philipp Altmann (LMU Munich)
Fabian Ritz (LMU Munich)
Lenz Belzner (Technische Hochschule Ingolstadt)
Claudia Linnhoff-Popien (LMU Munich) |
(Return
to Top) |
Auction-based
and Distributed Optimization Approaches for Scheduling Observations in
Satellite Constellations with Exclusive Orbit Portions (Page
1056)
Gauthier Picard (ONERA/DTIS, Université de Toulouse)
Trajectory
Coordination based on Distributed Constraint Optimization Techniques in
Unmanned Air Traffic Management (Page
1065)
Gauthier Picard (ONERA/DTIS, Université de Toulouse)
Learning
Heuristics for Combinatorial Assignment by Optimally Solving Subproblems (Page
1074)
Fredrik Präntare (Linköping University)
Herman Appelgren (Linköping University)
Mattias Tiger (Linköping University)
David Bergström (Linköping University)
Fredrik Heintz (Linköping University)
Evaluating
the Role of Interactivity on Improving Transparency in Autonomous Agents (Page
1083)
Peizhu Qian (Rice University)
Vaibhav Unhelkar (Rice University)
Revenue
and User Traffic Maximization in Mobile Short-Video Advertising (Page
1092)
Dezhi Ran (Peking University)
Weiqiang Zheng (Yale University)
Yunqi Li (Peking University)
Kaigui Bian (Peking University)
Jie Zhang (University of Southampton)
Xiaotie Deng (Peking University)
Automated
Configuration and Usage of Strategy Portfolios Mixed-Motive Bargaining (Page
1101)
Bram M. Renting (Leiden University & Delft University of Technology)
Holger H. Hoos (RWTH Aachen & Leiden University)
Catholijn M. Jonker (Delft University of Technology & Leiden University)
Pareto
Conditioned Networks (Page
1110)
Mathieu Reymond (Vrije Universiteit Brussel)
Eugenio Bargiacchi (Vrije Universiteit Brussel)
Ann Nowé (Vrije Universiteit Brussel) |
(Return
to Top) |
Testing
Requirements via User and System Stories in Agent Systems (Page
1119)
Sebastian Rodriguez (RMIT University)
John Thangarajah (RMIT University)
Michael Winikoff (Victoria University of Wellington)
Dhirendra Singh (RMIT University)
GCS:
Graph-Based Coordination Strategy for Multi-Agent Reinforcement Learning (Page
1128)
Jingqing Ruan (Institute of Automation, Chinese Academy of Sciences
& University of Chinese Academy of Sciences)
Yali Du (King's College London)
Xuantang Xiong (Institute of Automation, Chinese Academy of Sciences)
Dengpeng Xing (Institute of Automation, Chinese Academy of Sciences)
Xiyun Li (Institute of Automation, Chinese Academy of Sciences)
Linghui Meng (Institute of Automation, Chinese Academy of Sciences)
Haifeng Zhang (Institute of Automation, Chinese Academy of Sciences)
Jun Wang (University College London)
Bo Xu (Institute of Automation, Chinese Academy of Sciences)
REMAX:
Relational Representation for Multi-Agent Exploration (Page
1137)
Heechang Ryu (Samsung Research)
Hayong Shin (Korea Advanced Institute of Science and Technology)
Jinkyoo Park (Korea Advanced Institute of Science and Technology)
Decoupled
Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration (Page
1146)
Lukas Schäfer (University of Edinburgh)
Filippos Christianos (University of Edinburgh)
Josiah P. Hanna (University of Wisconsin - Madison)
Stefano V. Albrecht (University of Edinburgh)
Group
Fairness in Bandits with Biased Feedback (Page
1155)
Candice Schumann (University of Maryland)
Zhi Lang (University of Maryland)
Nicholas Mattei (Tulane University)
John P. Dickerson (University of Maryland)
Sympathy-based
Reinforcement Learning Agents (Page
1164)
Manisha Senadeera (Deakin University)
Thommen George Karimpanal (Deakin University)
Sunil Gupta (Deakin University)
Santu Rana (Deakin University) |
(Return
to Top) |
Learning
Efficient Diverse Communication for Cooperative Heterogeneous Teaming (Page
1173)
Esmaeil Seraj (Georgia Institute of Technology)
Zheyuan Wang (Georgia Institute of Technology)
Rohan Paleja (Georgia Institute of Technology)
Daniel Martin (Georgia Institute of Technology)
Matthew Sklar (Georgia Institute of Technology)
Anirudh Patel (Sandia National Laboratory)
Matthew Gombolay (Georgia Institute of Technology)
Using
Deep Learning to Bootstrap Abstractions for Hierarchical Robot Planning (Page
1183)
Naman Shah (Arizona State University)
Siddharth Srivastava (Arizona State University)
ACuTE:
Automatic Curriculum Transfer from Simple to Complex Environments (Page
1192)
Yash Shukla (Tufts University)
Christopher Thierauf (Tufts University)
Ramtin Hosseini (Tufts University)
Gyan Tatiya (Tufts University)
Jivko Sinapov (Tufts University)
Anti-Malware
Sandbox Games (Page
1201)
Sujoy Sikdar (Binghamton University)
Sikai Ruan (Rensselaer Polytechnic Institute)
Qishen Han (Rensselaer Polytechnic Institute)
Paween Pitimanaaree (SCB Securities Co. Ltd.)
Jeremy Blackthorne (Boston Cybernetics Institute)
Bulent Yener (Rensselaer Polytechnic Institute)
Lirong Xia (Rensselaer Polytechnic Institute)
Properties
of Reputation Lag Attack Strategies (Page
1210)
S. Sirur (University of Oxford)
Tim Muller (University of Nottingham)
The
Generalized Magician Problem under Unknown Distributions and Related Applications (Page
1219)
Aravind Srinivasan (University of Maryland, College Park)
Pan Xu (New Jersey Institute of Technology)
Context-Aware
Modelling for Multi-Robot Systems Under Uncertainty (Page
1228)
Charlie Street (University of Oxford)
Bruno Lacerda (University of Oxford)
Michal Staniaszek (University of Oxford)
Manuel Mühlig (Honda Research Institute Europe GmbH)
Nick Hawes (University of Oxford) |
(Return
to Top) |
Off-Policy
Evolutionary Reinforcement Learning with Maximum Mutations (Page
1237)
Karush Suri (University of Toronto)
Justifying
Social-Choice Mechanism Outcome for Improving Participant Satisfaction (Page
1246)
Sharadhi Alape Suryanarayana (Bar-Ilan University)
David Sarne (Bar-Ilan University)
Sarit Kraus (Bar-Ilan University)
Descriptive
and Prescriptive Visual Guidance to Improve Shared Situational Awareness
in Human-Robot Teaming (Page
1256)
Aaquib Tabrez (University of Colorado Boulder)
Matthew B. Luebbers (University of Colorado Boulder)
Bradley Hayes (University of Colorado Boulder)
How
Hard is Bribery in Elections with Randomly Selected Voters (Page
1265)
Liangde Tao (Zhejiang University)
Lin Chen (Texas Tech University)
Lei Xu (University of Texas Rio Grande Valley)
Weidong Shi (University of Houston)
Ahmed Sunny (Texas Tech University)
Md Mahabub Uz Zaman (Texas Tech University)
Socially
Supervised Representation Learning: The Role of Subjectivity in Learning
Efficient Representations (Page
1274)
Julius Taylor (Inria & Université de Bordeaux)
Eleni Nisioti (Inria & Université de Bordeaux)
Clément Moulin-Frier (Inria & Université de Bordeaux)
Corruption
in Auctions: Social Welfare Loss in Hybrid Multi-Unit Auctions (Page
1283)
Andries van Beek (Tilburg University)
Ruben Brokkelkamp (Centrum Wiskunde & Informatica)
Guido Schäfer (Centrum Wiskunde, Informatica ILLC, & University of
Amsterdam)
Coaching
Agent: Making Recommendations for Behavior Change. A Case Study on Improving
Eating Habits (Page
1292)
Jules Vandeputte (UMR MIA-Paris, AgroParisTech, INRAe, Université Paris-Saclay)
Antoine Cornuéjols (UMR MIA-Paris, AgroParisTech, INRAe, Université
Paris-Saclay)
Nicolas Darcel (UMR PNCA, AgroParisTech, INRAe, Université Paris-Saclay)
Fabien Delaere (Danone Nutricia Research)
Christine Martin (UMR MIA-Paris, AgroParisTech, INRAe, Université Paris-Saclay) |
(Return
to Top) |
How
to Sense the World: Leveraging Hierarchy in Multimodal Perception
for Robust Reinforcement Learning Agents (Page
1301)
Miguel Vasco (INESC-ID & Universidade de Lisboa)
Hang Yin (KTH Royal Institute of Technology)
Francisco S. Melo (INESC-ID & Universidade de Lisboa)
Ana Paiva (INESC-ID & Universidade de Lisboa)
Controller
Synthesis for Omega-Regular and Steady-State Specifications (Page
1310)
Alvaro Velasquez (Air Force Research Laboratory)
Ismail Alkhouri (University of Central Florida)
Andre Beckus (Air Force Research Laboratory)
Ashutosh Trivedi (University of Colorado Boulder)
George Atia (University of Central Florida)
Graphical
Representation Enhances Human Compliance with Principles for Graded Argumentation
Semantics (Page 1319)
Srdjan Vesic (CNRS, Université d'Artois, CRIL)
Bruno Yun (University of Aberdeen)
Predrag Teovanovic (University of Belgrade)
Epistemic
Reasoning in Jason (Page
1328)
Michael Vezina (Carleton University)
Babak Esfandiari (Carleton University)
Robust
Learning from Observation with Model Misspecification (Page
1337)
Luca Viano (LIONS, EPFL)
Yu-Ting Huang (EPFL)
Parameswaran Kamalaruban (The Alan Turing Institute)
Craig Innes (The University of Edinburgh)
Subramanian Ramamoorthy (The University of Edinburgh)
Adrian Weller (University of Cambridge & The Alan Turing Institute)
Evaluating
Strategy Exploration in Empirical Game-Theoretic Analysis (Page
1346)
Yongzhao Wang (University of Michigan)
Qiurui Ma (Harvard University)
Michael P. Wellman (University of Michigan)
FCMNet:
Full Communication Memory Net for Team-Level Cooperation in Multi-Agent
Systems (Page 1355)
Yutong Wang (National University of Singapore)
Guillaume Sartoretti (National University of Singapore) |
(Return
to Top) |
Online
Collective Multiagent Planning by Offline Policy Reuse with Applications
to City-Scale Mobility-on-Demand Systems (Page
1364)
Wanyuan Wang (Southeast University)
Gerong Wu (Southeast University)
Weiwei Wu (Southeast University)
Yichuan Jiang (Southeast University)
Bo An (Nanyang Technological University)
Position-Based
Matching with Multi-Modal Preferences (Page
1373)
Yinghui Wen (Shandong University)
Aizhong Zhou (Ocean University of China)
Jiong Guo (Shandong University)
Empirical
Estimates on Hand Manipulation are Recoverable: A Step Towards Individualized
and Explainable Robotic Support in Everyday Activities (Page
1382)
Alexander Wich (University of Bremen)
Holger Schultheis (University of Bremen)
Michael Beetz (University of Bremen)
Agent-Temporal
Attention for Reward Redistribution in Episodic Multi-Agent Reinforcement
Learning (Page 1391)
Baicen Xiao (University of Washington)
Bhaskar Ramasubramanian (Western Washington University)
Radha Poovendran (University of Washington)
SIDE:
State Inference for Partially Observable Cooperative Multi-Agent Reinforcement
Learning (Page 1400)
Zhiwei Xu (Institute of Automation, Chinese Academy of Sciences & University
of Chinese Academy of Sciences)
Yunpeng Bai (Institute of Automation, Chinese Academy of Sciences &
University of Chinese Academy of Sciences)
Dapeng Li (Institute of Automation, Chinese Academy of Sciences & University
of Chinese Academy of Sciences)
Bin Zhang (Institute of Automation, Chinese Academy of Sciences & University
of Chinese Academy of Sciences)
Guoliang Fan (Institute of Automation, Chinese Academy of Sciences
& University of Chinese Academy of Sciences)
Spiking
Pitch Black: Poisoning an Unknown Environment to Attack Unknown Reinforcement
Learners (Page 1409)
Hang Xu (Nanyang Technological University)
Xinghua Qu (ByteDance AI Lab)
Zinovi Rabinovich (Nanyang Technological University)
Mis-spoke
or mis-lead: Achieving Robustness in Multi-Agent Communicative Reinforcement
Learning (Page 1418)
Wanqi Xue (Nanyang Technological University)
Wei Qiu (Nanyang Technological University)
Bo An (Nanyang Technological University)
Zinovi Rabinovich (Nanyang Technological University)
Svetlana Obraztsova (Nanyang Technological University)
Chai Kiat Yeo (Nanyang Technological University) |
(Return
to Top) |
Standby-Based
Deadlock Avoidance Method for Multi-Agent Pickup and Delivery Tasks (Page
1427)
Tomoki Yamauchi (Waseda University)
Yuki Miyashita (Waseda University)
Toshiharu Sugawara (Waseda University)
Adaptive
Incentive Design with Multi-Agent Meta-Gradient Reinforcement Learning (Page
1436)
Jiachen Yang (Georgia Institute of Technology)
Ethan Wang (Georgia Institute of Technology)
Rakshit Trivedi (Harvard University)
Tuo Zhao (Georgia Institute of Technology)
Hongyuan Zha (Chinese University of Hong Kong, Shenzhen)
Strategy-Proof
House Allocation with Existing Tenants over Social Networks (Page
1446)
Bo You (Kyushu University)
Ludwig Dierks (Kyushu University & University of Zurich)
Taiki Todo (Kyushu University)
Minming Li (City University of Hong Kong)
Makoto Yokoo (Kyushu University)
Segregation
in Social Networks of Heterogeneous Agents Acting under Incomplete Information (Page
1455)
D. Kai Zhang (Imperial College London)
Alexander Carver (Imperial College London)
Multi-Agent
Path Finding for Precedence-Constrained Goal Sequences (Page
1464)
Han Zhang (University of Southern California)
Jingkai Chen (Massachusetts Institute of Technology)
Jiaoyang Li (University of Southern California)
Brian C. Williams (Massachusetts Institute of Technology)
Sven Koenig (University of Southern California)
The
Competition and Inefficiency in Urban Road Last-Mile Delivery (Page
1473)
Keyang Zhang (Imperial College London)
Jose Javier Escribano Macias (Imperial College London)
Dario Paccagnan (Imperial College London)
Panagiotis Angeloudis (Imperial College London) |
(Return
to Top) |
Tracking
Truth by Weighting Proxies in Liquid Democracy (Page
1482)
Yuzhe Zhang (University of Groningen)
Davide Grossi (University of Groningen & University of Amsterdam)
A
Deeper Look at Discounting Mismatch in Actor-Critic Algorithms (Page
1491)
Shangtong Zhang (University of Oxford)
Romain Laroche (Microsoft Research Montreal)
Harm van Seijen (Microsoft Research Montreal)
Shimon Whiteson (University of Oxford)
Remi Tachet des Combes (Microsoft Research Montreal)
Centralized
Model and Exploration Policy for Multi-Agent RL (Page
1500)
Qizhen Zhang (University of Toronto & Vector Institute)
Chris Lu (University of Oxford)
Animesh Garg (University of Toronto, Vector Institute, & NVIDIA)
Jakob Foerster (University of Oxford)
Incentives
to Invite Others to Form Larger Coalitions (Page
1509)
Yao Zhang (ShanghaiTech University)
Dengji Zhao (ShanghaiTech University)
Extended
Abstracts
R-CHECK:
A Model Checker for Verifying Reconfigurable MAS (Page
1518)
Yehia Abd Alrahman (University of Gothenburg)
Shaun Azzopardi (University of Gothenburg)
Nir Piterman (University of Gothenburg)
RASS:
Risk-Aware Swarm Storage (Page
1521)
Samuel Arseneault (Polytechnique Montréal)
David Vielfaure (Polytechnique Montréal)
Giovanni Beltrame (Polytechnique Montréal)
Local
Advantage Networks for Cooperative Multi-Agent Reinforcement Learning (Page
1524)
Raphaël Avalos (Vrije Universiteit Brussel)
Mathieu Reymond (Vrije Universiteit Brussel)
Ann Nowé (Vrije Universiteit Brussel)
Diederik M. Roijers (Vrije Universiteit Brussel (BE) & HU University
of Applied Science Utrecht (NL)) |
(Return
to Top) |
Advising
Agent for Service-Providing Live-Chat Operators (Page
1527)
Aviram Aviv (Bar Ilan University)
Yaniv Oshrat (Bar Ilan University)
Samuel Assefa (US Bank AI Innovation)
Toby Mustapha (J.P. Morgan AI Research)
Daniel Borrajo (J.P. Morgan AI Research)
Manuela Veloso (J.P. Morgan AI Research)
Sarit Kraus (Bar Ilan University)
Status-quo
Policy Gradient in Multi-Agent Reinforcement Learning (Page
1530)
Pinkesh Badjatiya (Microsoft)
Mausoom Sarkar (Adobe)
Nikaash Puri (Adobe)
Jayakumar Subramanian (Adobe)
Abhishek Sinha (Waymo)
Siddharth Singh (University of Maryland)
Balaji Krishnamurthy (Adobe)
Deep
Learnable Strategy Templates for Multi-Issue Bilateral Negotiation (Page
1533)
Pallavi Bagga (Royal Holloway, University of London)
Nicola Paoletti (Royal Holloway, University of London)
Kostas Stathis (Royal Holloway, University of London)
Can
Algorithms be Explained Without Compromising Efficiency? The Benefits
of Detection and Imitation in Strategic Classification (Page
1536)
Flavia Barsotti (ING Analytics & University of Amsterdam)
Rüya Gökhan Koçer (ING Analytics)
Fernando P. Santos (University of Amsterdam)
A
New Porous Structure for Modular Robots (Page
1539)
Jad Bassil (University of Bourgogne Franche-Comté)
Benoît Piranda (University of Bourgogne Franche-Comté)
Abdallah Makhoul (University of Bourgogne Franche-Comté)
Julien Bourgeois (University of Bourgogne Franche-Comté)
On
the Average-Case Complexity of Predicting Round-Robin Tournaments (Page
1542)
Dorothea Baumeister (Heinrich-Heine-Universität Düsseldorf)
Tobias Hogrebe (Heinrich-Heine-Universität Düsseldorf) |
(Return
to Top) |
The
Evolutionary Dynamics of Soft-Max Policy Gradient in Multi-Agent Settings (Page
1545)
Martino Bernasconi (Politecnico di Milano)
Federico Cacciamani (Politecnico di Milano)
Simone Fioravanti (Gran Sasso Science Institute)
Nicola Gatti (Politecnico di Milano)
Francesco Trovò (Politecnico di Milano)
A
Refined Complexity Analysis of Fair Districting over Graphs (Page
1548)
Niclas Boehmer (TU Berlin)
Tomohiro Koana (TU Berlin)
Rolf Niedermeier (TU Berlin)
Contrastive
Explanations for Argumentation-Based Conclusions (Page
1551)
AnneMarie Borg (Utrecht University)
Floris Bex (Utrecht University & Tilburg University)
Voting
for Centrality (Page
1554)
Ulrik Brandes (ETH Zürich)
Christian Laußmann (Heinrich-Heine-University Düsseldorf)
Jörg Rothe (Heinrich-Heine-University Düsseldorf)
Solving
N-Player Dynamic Routing Games with Congestion: A Mean-Field Approach (Page
1557)
Theophile Cabannes (University of California, Berkeley & Google)
Mathieu Laurière (Google Research)
Julien Perolat (DeepMind)
Raphael Marinier (Google Research)
Sertan Girgin (Google Research)
Sarah Perrin (University Lille, CNRS, Inria, Centrale Lille, UMR 9189
CRIStAL)
Olivier Pietquin (Google Research)
Alexandre M. Bayen (University of California, Berkeley)
Eric Goubault (LIX, CNRS, Ecole Polytechnique, IPP)
Romuald Elie (DeepMind)
On
Fair and Efficient Solutions for Budget Apportionment (Page
1560)
Pierre Cardi (Université Paris-Dauphine, Université PSL, CNRS, LAMSADE)
Laurent Gourves (Université Paris-Dauphine, Université PSL, CNRS, LAMSADE)
Julien Lesca (Huawei Technologies) |
(Return
to Top) |
Optimal
Local Bayesian Differential Privacy over Markov Chains (Page
1563)
Darshan Chakrabarti (Carnegie Mellon University)
Jie Gao (Rutgers University)
Aditya Saraf (University of Washington)
Grant Schoenebeck (University of Michigan)
Fang-Yi Yu (Harvard University)
Augmented
Reality Visualizations using Imitation Learning for Collaborative Warehouse
Robots (Page 1566)
Kishan Chandan (SUNY Binghamton)
Jack Albertson (SUNY Binghamton)
Shiqi Zhang (SUNY Binghamton)
Multi-unit
Double Auctions: Equilibrium Analysis and Bidding Strategy using DDPG
in Smart-grids (Page
1569)
Sanjay Chandlekar (International Institute of Information Technology,
Hyderabad)
Easwar Subramanian (TCS Innovation Labs)
Sanjay Bhat (TCS Innovation Labs)
Praveen Paruchuri (International Institute of Information Technology,
Hyderabad)
Sujit Gujar (International Institute of Information Technology, Hyderabad)
Multi-agent
Covering Option Discovery through Kronecker Product of Factor Graphs (Page
1572)
Jiayu Chen (Purdue University)
Jingdi Chen (The George Washington University)
Tian Lan (The George Washington University)
Vaneet Aggarwal (Purdue University)
Priced
Gerrymandering (Page
1575)
Palash Dey (Indian Institute of Technology)
Behavior
Exploration and Team Balancing for Heterogeneous Multiagent Coordination (Page
1578)
Gaurav Dixit (Oregon State University)
Kagan Tumer (Oregon State University) |
(Return
to Top) |
Multi-Agent
Adversarial Attacks for Multi-Channel Communications (Page
1580)
Juncheng Dong (Duke University)
Suya Wu (Duke University)
Mohammadreza Soltani (Duke University)
Vahid Tarokh (Duke University)
Rawlsian
Fairness in Online Bipartite Matching: Two-sided, Group, and Individual (Page
1583)
Seyed A. Esmaeili (University of Maryland, College Park)
Sharmila Duppala (University of Maryland, College Park)
Vedant Nanda (University of Maryland, College Park)
Aravind Srinivasan (University of Maryland, College Park)
John P. Dickerson (University of Maryland, College Park)
Approaching
the Overbidding Puzzle in All-Pay Auctions: Explaining Human Behavior
through Bayesian Optimization and Equilibrium Learning (Page
1586)
Markus Ewert (Technical University of Munich)
Stefan Heidekrüger (Technical University of Munich)
Martin Bichler (Technical University of Munich)
Safety
Shields, an Automated Failure Handling Mechanism for BDI Agents (Page
1589)
Angelo Ferrando (University of Genova)
Rafael C. Cardoso (The University of Manchester)
Beyond
Uninformed Search: Improving Branch-and-bound Based Acceleration Algorithms
for Belief Propagation via Heuristic Strategies (Page
1592)
Junsong Gao (Chongqing University)
Ziyu Chen (Chongqing University)
Dingding Chen (Chongqing University)
Wenxin Zhang (Chongqing University)
Stable
Matching Games (Page
1595)
Felipe Garrido-Lucero (LAMSADE (CNRS, UMR 7243), Université Paris Dauphine)
Rida Laraki (LAMSADE (CNRS, UMR 7243), Université Paris Dauphine &
University of Liverpool)
An
Anytime Heuristic Algorithm for Allocating Many Teams to Many Tasks (Page
1598)
Athina Georgara (Artificial Intelligence Research Institute (IIIA-CSIC)
& & Enzyme Advising Group)
Juan A. Rodríguez-Aguilar (Artificial Intelligence Research Institute
(IIIA-CSIC))
Carles Sierra (Artificial Intelligence Research Institute (IIIA-CSIC))
Ornella Mich (Fondazione Bruno Kessler (FBK))
Raman Kazhamiakin (Fondazione Bruno Kessler (FBK))
Alessio Palmero Aprosio (Fondazione Bruno Kessler (FBK))
Jean-Christophe Pazzaglia (SAP) |
(Return
to Top) |
Influencing
Emergent Self-Assembled Structures in Robotic Collectives Through Traffic
Control (Page 1601)
Everardo Gonzalez (Oregon State University)
Lucie Houel (Ecole Polytechnique Fédérale de Lausanne)
Radhika Nagpal (Harvard University)
Melinda Malley (Olin College of Engineering)
Minimizing
Robot Navigation Graph for Position-Based Predictability by Humans (Page
1604)
Sriram Gopalakrishnan (Arizona State University)
Subbarao Kambhampati (Arizona State University)
A
Graph Neural Network Reasoner for Game Description Language (Page
1607)
Alvaro Gunawan (Auckland University of Technology)
Ji Ruan (Auckland University of Technology)
Xiaowei Huang (University of Liverpool)
Adaptive
Aggregation Weight Assignment for Federated Learning: A Deep Reinforcement
Learning Approach (Page
1610)
Enwei Guo (South China University of Technology)
Xiumin Wang (South China University of Technology)
Weiwei Wu (Southeast University)
Proof-of-Work
as a Stigmergic Consensus Algorithm (Page
1613)
Önder Gürcan (Université Paris-Saclay, CEA, List)
Capacitated
Network Design Games on a Generalized Fair Allocation Model (Page
1616)
Tesshu Hanaka (Nagoya University)
Toshiyuki Hirose (KDDI Corporation)
Hirotaka Ono (Nagoya University) |
(Return
to Top) |
Multi-agent
Task Allocation for Fruit Picker Team Formation (Page
1618)
Helen Harman (University of Lincoln)
Elizabeth I. Sklar (University of Lincoln)
Decision-Theoretic
Planning for the Expected Scalarised Returns (Page
1621)
Conor F. Hayes (National University of Ireland Galway)
Diederik M. Roijers (Vrije Universiteit Brussel & & HU University of
Applied Science Utrecht)
Enda Howley (National University of Ireland Galway)
Patrick Mannion (National University of Ireland Galway)
Implementation
of Actual Data for Artificial Market Simulation (Page
1624)
Masanori Hirano (The University of Tokyo)
Kiyoshi Izumi (The University of Tokyo)
Hiroki Sakaji (The University of Tokyo)
Intelligent
Communication over Realistic Wireless Networks in Multi-Agent Cooperative
Games (Page 1627)
Diyi Hu (University of Southern California)
Chi Zhang (University of Southern California)
Viktor Prasanna (University of Southern California)
Bhaskar Krishnamachari (University of Southern California)
Multiagent
Q-learning with Sub-Team Coordination (Page
1630)
Wenhan Huang (Shanghai Jiao Tong University)
Kai Li (Shanghai Jiao Tong University)
Kun Shao (Huawei Noah's Ark Lab)
Tianze Zhou (Beijing Institute of Technology)
Jun Luo (Huawei Noah's Ark Lab)
Dongge Wang (EPFL)
Hangyu Mao (Huawei Noah's Ark Lab)
Jianye Hao (Huawei Noah's Ark Lab)
Jun Wang (University College London)
Xiaotie Deng (Peking University)
Guaranteeing
Half-Maximin Shares Under Cardinality Constraints (Page
1633)
Halvard Hummel (Norwegian University of Science and Technology)
Magnus Lie Hetland (Norwegian University of Science and Technology) |
(Return
to Top) |
Argumentative
Forecasting (Page
1636)
Benjamin Irwin (Imperial College London)
Antonio Rago (Imperial College London)
Francesca Toni (Imperial College London)
Data-driven
Agent-based Models for Optimal Evacuation of Large Metropolitan Areas
for Improved Disaster Planning (Page
1639)
Kazi Ashik Islam (Biocomplexity Institute and Initiative & University
of Virginia)
Madhav Marathe (Biocomplexity Institute and Initiative & University
of Virginia)
Henning Mortveit (Biocomplexity Institute and Initiative & University
of Virginia)
Samarth Swarup (Biocomplexity Institute and Initiative & University
of Virginia)
Anil Vullikanti (Biocomplexity Institute and Initiative & University
of Virginia)
Near-Optimal
Reviewer Splitting in Two-Phase Paper Reviewing and Conference Experiment
Design (Page 1642)
Steven Jecmen (Carnegie Mellon University)
Hanrui Zhang (Carnegie Mellon University)
Ryan Liu (Carnegie Mellon University)
Fei Fang (Carnegie Mellon University)
Vincent Conitzer (Duke University)
Nihar B. Shah (Carnegie Mellon University)
Learning
to Advise and Learning from Advice in Cooperative Multiagent Reinforcement
Learning (Page 1645)
Yue Jin (Tsinghua University)
Shuangqing Wei (Louisiana State University)
Jian Yuan (Tsinghua University)
Xudong Zhang (Tsinghua University)
REFORM:
Reputation Based Fair and Temporal Reward Framework for Crowdsourcing (Page
1648)
Samhita Kanaparthy (International Institute of Information Technology,
Hyderabad)
Sankarshan Damle (International Institute of Information Technology,
Hyderabad)
Sujit Gujar (International Institute of Information Technology, Hyderabad)
Forgiving
Debt in Financial Network Games (Page
1651)
Panagiotis Kanellopoulos (University of Essex)
Maria Kyropoulou (University of Essex)
Hao Zhou (University of Essex) |
(Return
to Top) |
How
to Train Your Agent: Active Learning from Human Preferences and Justifications
in Safety-critical Environments (Page
1654)
Ilias Kazantzidis (University of Southampton)
Timothy J. Norman (University of Southampton)
Yali Du (King's College London)
Christopher T. Freeman (University of Southampton)
Popularity
and Strict Popularity in Altruistic Hedonic Games and Minimum-Based Altruistic
Hedonic Games (Page
1657)
Anna Maria Kerkmann (Heinrich-Heine-Universität Düsseldorf)
Jörg Rothe (Heinrich-Heine-Universität Düsseldorf)
Minimizing
Expected Intrusion Detection Time in Adversarial Patrolling (Page
1660)
David Klaška (Masaryk University)
Antonín Kučera (Masaryk University)
Vit Musil (Masaryk University)
Vojtěch Řehák (Masaryk University)
Learning
Generalizable Multi-Lane Mixed-Autonomy Behaviors in Single Lane Representations
of Traffic (Page
1663)
Abdul Rahman Kreidieh (University of California, Berkeley)
Yibo Zhao (University of California, Berkeley)
Samyak Parajuli (University of California, Berkeley)
Alexandre M. Bayen (University of California, Berkeley)
Measuring
Resilience in Collective Robotic Algorithms (Page
1666)
Jennifer Leaf (Oregon State University)
Julie A. Adams (Oregon State University)
Automated
Story Sifting Using Story Arcs (Page
1669)
Wilkins Leong (RMIT University)
Julie Porteous (RMIT University)
John Thangarajah (RMIT University) |
(Return
to Top) |
Theoretical
Models and Preliminary Results for Contact Tracing and Isolation (Page
1672)
George Z. Li (University of Maryland)
Arash Haddadan (University of Virginia)
Ann Li (University of Virginia)
Madhav V. Marathe (University of Virginia)
Aravind Srinivasan (University of Maryland)
Anil Vullikanti (University of Virginia)
Zeyu Zhao (University of Maryland)
Improving
Generalization with Cross-State Behavior Matching in Deep Reinforcement
Learning (Page 1675)
Guan-Ting Liu (National Taiwan University)
Guan-Yu Lin (National Taiwan University)
Pu-Jen Cheng (National Taiwan University)
(Almost)
Envy-Free, Proportional and Efficient Allocations of an Indivisible Mixed
Manna (Page 1678)
Vasilis Livanos (University of Illinois at Urbana-Champaign)
Ruta Mehta (University of Illinois at Urbana-Champaign)
Aniket Murhekar (University of Illinois at Urbana-Champaign)
Modeling
Affective Reaction in Multi-agent Systems (Page
1681)
Jieting Luo (Zhejiang University)
Mehdi Dastani (Utrecht University)
Multimodal
Reinforcement Learning with Effective State Representation Learning (Page
1684)
Jinming Ma (University of Science and Technology of China)
Yingfeng Chen (Netease Fuxi AI Lab)
Feng Wu (University of Science and Technology of China)
Xianpeng Ji (Netease Fuxi AI Lab)
Yu Ding (Netease Fuxi AI Lab)
Group-level
Fairness Maximization in Online Bipartite Matching (Page
1687)
Will Ma (Columbia University)
Pan Xu (New Jersey Institute of Technology)
Yifan Xu (Southeast University)
A
Simulation Based Online Planning Algorithm for Multi-Agent Cooperative
Environments (Page
1690)
Rafid Ameer Mahmud (University of Dhaka)
Fahim Faisal (University of Dhaka)
Saaduddin Mahmud (University of Massachusetts, Amherst)
Md. Mosaddek Khan (University of Dhaka) |
(Return
to Top) |
Parameterized
Algorithms for Kidney Exchange (Page
1693)
Arnab Maiti (Indian Institute of Technology Kharagpur)
Palash Dey (Indian Institute of Technology Kharagpur)
Active
Generation of Logical Rules for POMCP Shielding (Page
1696)
Giulio Mazzi (Università degli Studi di Verona)
Alberto Castellini (Università degli Studi di Verona)
Alessandro Farinelli (Università degli Studi di Verona)
Reinforcement
Learning for Traffic Signal Control Optimization: A Concept for Real-World
Implementation (Page
1699)
Henri Meess (Fraunhofer IVI)
Jeremias Gerner (Technische Hochschule Ingolstadt)
Daniel Hein (GEVAS software GmbH)
Stefanie Schmidtner (Technische Hochschule Ingolstadt)
Gordon Elger (Fraunhofer IVI)
Towards
Assume-Guarantee Verification of Strategic Ability (Page
1702)
Łukasz Mikulski (Nicolaus Copernicus University & Institute of Computer
Science, Polish Academy of Sciences)
Wojciech Jamroga (Institute of Computer Science, Polish Academy of
Sciences & University of Luxembourg)
Damian Kurpiewski (Institute of Computer Science Polish Academy of
Sciences & Nicolaus Copernicus University)
On
Achieving Leximin Fairness and Stability in Many-to-One Matchings (Page
1705)
Shivika Narang (Indian Institute of Science)
Arpita Biswas (Harvard University)
Yadati Narahari (Indian Institute of Science)
Towards
an Enthymeme-Based Communication Framework (Page
1708)
Alison R. Panisson (Universidade Federal de Santa Catarina)
Peter McBurney (King's College London)
Rafael H. Bordini (Pontifical Catholic University of Rio Grande do
Sul) |
(Return
to Top) |
I
Will Have Order! Optimizing Orders for Fair Reviewer Assignment (Page
1711)
Justin Payan (University of Massachusetts, Amherst)
Yair Zick (University of Massachusetts, Amherst)
Concise
Representations and Complexity of Combinatorial Assignment Problems (Page
1714)
Fredrik Präntare (Linköping University)
George Osipov (Linköping University)
Leif Eriksson (Linköping University)
A
Stit Logic of Responsibility (Page
1717)
Aldo Iván Ramírez Abarca (Utrecht University)
Jan Broersen (Utrecht University)
Behavior
vs Appearance: What Type of Adaptations are More Socially Motivated? (Page
1720)
Diogo Rato (INESC-ID & Universidade de Lisboa)
Marta Couto (INESC-ID)
Rui Prada (INESC-ID & Universidade de Lisboa)
Agent-Time
Attention for Sparse Rewards Multi-Agent Reinforcement Learning (Page
1723)
Jennifer She (Stanford University)
Jayesh K. Gupta (Microsoft)
Mykel J. Kochenderfer (Stanford University)
Environment
Guided Interactive Reinforcement Learning: Learning from Binary Feedback
in High-Dimensional Robot Task Environments (Page
1726)
Isaac Sheidlower (Tufts University)
Elaine Schaertl Short (Tufts University)
Allison Moore (Tufts University)
Pre-trained
Language Models as Prior Knowledge for Playing Text-based Games (Page
1729)
Ishika Singh (Indian Institute of Technology Kanpur)
Gargi Singh (Indian Institute of Technology Kanpur)
Ashutosh Modi (Indian Institute of Technology Kanpur) |
(Return
to Top) |
Resource-Aware
Adaptation of Heterogeneous Strategies for Coalition Formation (Page
1732)
Anusha Srikanthan (University of Pennsylvania)
Harish Ravichandar (Georgia Institute of Technology)
Speeding
up Deep Reinforcement Learning through Influence-Augmented Local Simulators (Page
1735)
Miguel Suau (Delft University of Technology)
Jinke He (Delft University of Technology)
Matthijs T. J. Spaan (Delft University of Technology)
Frans A. Oliehoek (Delft University of Technology)
Maximizing
Resource Allocation Likelihood with Minimum Compromise (Page
1738)
Yohai Trabelsi (Bar-Ilan University)
Abhijin Adiga (University of Virginia)
Sarit Kraus (Bar Ilan University)
S. S. Ravi (University of Virginia & University of Albany - SUNY)
Max-sum
with Quadtrees for Continuous DCOPs with Application to Lane-Free Autonomous
Driving (Page 1741)
Dimitrios Troullinos (Technical University of Crete)
Georgios Chalkiadakis (Technical University of Crete)
Vasilis Samoladas (Technical University of Crete)
Markos Papageorgiou (Technical University of Crete)
Autonomous
Flight Arcade Challenge: Single- and Multi-Agent Learning Environments
for Aerial Vehicles (Page
1744)
Paul Tylkin (Massachusetts Institute of Technology)
Tsun-Hsuan Wang (Massachusetts Institute of Technology)
Tim Seyde (Massachusetts Institute of Technology)
Kyle Palko (U.S. Air Force Artificial Intelligence Accelerator)
Ross Allen (Massachusetts Institute of Technology)
Alexander Amini (Massachusetts Institute of Technology)
Daniela Rus (Massachusetts Institute of Technology)
Non-Parametric
Neuro-Adaptive Coordination of Multi-Agent Systems (Page
1747)
Christos K. Verginis (University of Texas at Austin)
Zhe Xu (Arizona State University)
Ufuk Topcu (University of Texas at Austin)
Moving
Target Defense under Uncertainty for Web Applications (Page
1750)
Vignesh Viswanathan (University of Massachusetts, Amherst)
Megha Bose (International Institute of Information Technology, Hyderabad)
Praveen Paruchuri (International Institute of Information Technology,
Hyderabad) |
(Return
to Top) |
The
Ethical Acceptability of Artificial Social Agents (Page
1753)
Ravi Vythilingam (Macquarie University)
Deborah Richards (Macquarie University)
Paul Formosa (Macquarie University)
Near
On-Policy Experience Sampling in Multi-Objective Reinforcement Learning (Page
1756)
Shang Wang (University of Washington)
Mathieu Reymond (Vrije Universiteit Brussel)
Athirai A. Irissappane (University of Washington)
Diederik M. Roijers (Vrije Universiteit Brussel & HU University of
Applied Sciences Utrecht)
On
Agent Incentives to Manipulate Human Feedback in Multi-Agent Reward Learning
Scenarios (Page 1759)
Francis Rhys Ward (Imperial College London)
Francesca Toni (Imperial College London)
Francesco Belardinelli (Imperial College London)
How
to Train PointGoal Navigation Agents on a (Sample and Compute) Budget (Page
1762)
Erik Wijmans (Georgia Institute of Technology & Facebook AI Research)
Irfan Essa (Georgia Institute of Technology & Google Atlanta)
Dhruv Batra (Georgia Institute of Technology & Facebook AI Research)
Performance
of Deep Reinforcement Learning for High Frequency Market Making on Actual
Tick Data (Page 1765)
Ziyi Xu (Peking University)
Xue Cheng (Peking University)
Yangbo He (Peking University)
On
the Complexity of Controlling Amendment and Successive Winners (Page
1768)
Yongjie Yang (Saarland University)
On-the-fly
Strategy Adaptation for ad-hoc Agent Coordination (Page
1771)
Jaleh Zand (University of Oxford)
Jack Parker-Holder (University of Oxford)
Stephen J. Roberts (University of Oxford) |
(Return
to Top) |
Off-Policy
Correction For Multi-Agent Reinforcement Learning (Page
1774)
Michał Zawalski (University of Warsaw)
Błażej Osiński (University of Warsaw)
Henryk Michalewski (Google Research)
Piotr Miłoś (Polish Academy of Sciences)
An
Agent-based Model for Emergency Evacuation from a Multi-floor Building (Page
1777)
Xiaoyan Zhang (Newcastle University)
Graham Coates (Newcastle University)
Sarah Dunn (Newcastle University)
Jean Hall (Newcastle University)
Irrational
Behaviour and Globalisation (Page
1780)
Yuanzi Zhu (King's College London)
Carmine Ventre (King's College London)
Blue
Sky Ideas Track
Robots
Teaching Humans: A New Communication Paradigm via Reverse Teleoperation (Page
1783)
Rika Antonova (Stanford University)
Ankur Handa (NVIDIA)
Social
Choice Around the Block: On the Computational Social Choice of Blockchain (Page
1788)
Davide Grossi (University of Groningen & University of Amsterdam)
Augmented
Democratic Deliberation: Can Conversational Agents Boost Deliberation
in Social Media? (Page
1794)
Rafik Hadfi (Kyoto University)
Takayuki Ito (Kyoto University)
Towards
Anomaly Detection in Reinforcement Learning (Page
1799)
Robert Müller (LMU Munich)
Steffen Illium (LMU Munich)
Thomy Phan (LMU Munich)
Tom Haider (Fraunhofer IKS)
Claudia Linnhoff-Popien (LMU Munich) |
(Return
to Top) |
The
Holy Grail of Multi-Robot Planning: Learning to Generate Online-Scalable
Solutions from Offline-Optimal Experts (Page
1804)
Amanda Prorok (University of Cambridge)
Jan Blumenkamp (University of Cambridge)
Qingbiao Li (University of Cambridge)
Ryan Kortvelesy (University of Cambridge)
Zhe Liu (University of Cambridge)
Ethan Stump (DEVCOM Army Research Laboratory)
"Go
to the Children": Rethinking Intelligent Agent Design and Programming
in a Developmental Learning Perspective (Page
1809)
Alessandro Ricci (Università di Bologna)
Foundations
for Grassroots Democratic Metaverse (Page
1814)
Ehud Shapiro (Weizmann Institute of Science & Columbia University)
Nimrod Talmon (Ben-Gurion University)
Agent-Assisted
Life-Long Education and Learning (Page
1819)
Tomas Trescak (Western Sydney University)
Roger Lera-Leri (Artificial Intelligence Research Institute (IIIA-CSIC))
Filippo Bistaffa (Artificial Intelligence Research Institute (IIIA-CSIC))
Juan A. Rodriguez-Aguilar (Artificial Intelligence Research Institute
(IIIA-CSIC))
Macro
Ethics for Governing Equitable Sociotechnical Systems (Page
1824)
Jessica Woodgate (The University of Bristol)
Nirav Ajmeri (The University of Bristol)
Doctoral
Consortium
Exploration
and Communication for Partially Observable Collaborative Multi-Agent Reinforcement
Learning (Page 1829)
Raphaël Avalos (Vrije Universiteit Brussel) |
(Return
to Top) |
Manipulation
of Machine Learning Algoirhtms (Page
1833)
Nicholas Bishop (University of Southampton)
Collaborative
Training of Multiple Autonomous Agents (Page
1836)
Filippos Christianos (University of Edinburgh)
Towards
Multi-Agent Interactive Reinforcement Learning for Opportunistic Software
Composition in Ambient Environments (Page
1839)
Kevin Delcourt (IRIT, Université de Toulouse, CNRS, Toulouse INP, UT3)
Online
Learning against Strategic Adversary (Page
1841)
Le Cong Dinh (University of Southampton)
Non-Cooperative
Multi-Robot Planning Under Shared Resources (Page
1843)
Anna Gautier (University of Oxford)
Incentive
Design for Equitable Resource Allocation: Artificial Currencies and Allocation
Constraints (Page
1846)
Devansh Jalota (Stanford University) |
(Return
to Top) |
Model-free
and Model-based Reinforcement Learning, the Intersection of Learning and
Planning (Page 1849)
Piotr Januszewski (Gdańsk University of Technology)
Data-driven
Approaches for Formal Synthesis of Dynamical Systems (Page
1852)
Milad Kazemi (Newcastle University)
Budget
Feasible Mechanisms in Auction Markets: Truthfulness, Diffusion and Fairness (Page
1854)
Xiang Liu (Southeast University)
Fair
Allocation Problems in Reviewer Assignment (Page
1857)
Justin Payan (University of Massachusetts, Amherst)
Designing
Mechanisms for Participatory Budgeting (Page
1860)
Simon Rey (University of Amsterdam)
Task
Generalisation in Multi-Agent Reinforcement Learning (Page
1863)
Lukas Schäfer (University of Edinburgh)
Empathetic
Reinforcement Learning Agents (Page
1866)
Manisha Senadeera (Deakin University) |
(Return
to Top) |
Embodied
Team Intelligence in Multi-Robot Systems (Page
1869)
Esmaeil Seraj (Georgia Institute of Technology)
The
Reputation Lag Attack (Page
1872)
Sean Sirur (University of Oxford)
Using
Multi-objective Optimization to Generate Timely Responsive BDI Agents (Page
1875)
Márcio Fernando Stabile Junior (Universidade de São Paulo)
Engineering
Normative and Cognitive Agents with Emotions and Values (Page
1878)
Sz-Ting Tzeng (North Carolina State University)
The
Coaching Scenario: Recommender Systems with a Long Term Goal. A Case Study
in Changing Dietary Habits (Page
1881)
Jules Vandeputte (UMR MIA-Paris, AgroParisTech, INRAe, Université Paris-Saclay)
Transferable
Environment Poisoning: Training-time Attack on Reinforcement Learner with
Limited Prior Knowledge (Page
1884)
Hang Xu (Nanyang Technological University)
Demonstration
Track |
(Return
to Top) |
Chameleon
- A Framework for Developing Conversational Agents for Medical Training
Purposes (Page 1887)
Al-Hussein Abutaleb (University of Aberdeen)
Bruno Yun (University of Aberdeen)
An
Agent-Based Simulator for Maritime Transport Decarbonisation (Page
1890)
Jan Buermann (University of Southampton)
Dimitar Georgiev (University of Southampton)
Enrico H. Gerding (University of Southampton)
Lewis Hill (University of Southampton)
Obaid Malik (University of Southampton)
Alexandru Pop (University of Southampton)
Matthew Pun (Shell Shipping & Maritime)
Sarvapali D. Ramchurn (University of Southampton)
Elliot Salisbury (University of Southampton)
Ivan Stojanovic (Shell Shipping & Maritime)
AdLeap-MAS:
An Open-source Multi-Agent Simulator for Ad-hoc Reasoning (Page
1893)
Matheus Aparecido do Carmo Alves (Lancaster University)
Amokh Varma (Indian Institute of Technology)
Yehia Elkhatib (University of Glasgow)
Leandro Soriano Marcolino (Lancaster University)
KnowLedger
- A Multi-Agent System Blockchain for Smart Cities Data (Page
1896)
Bruno Fernandes (University of Minho)
André Diogo (University of Minho)
Fábio Silva (Polytechnic Institute of Porto)
José Neves (University of Minho)
Cesar Analide (University of Minho)
A
Multi-Agent System for Automated Machine Learning (Page
1899)
Bruno Fernandes (University of Minho)
Paulo Novais (University of Minho)
Cesar Analide (University of Minho)
Demonstrating
the Rapid Integration & Development Environment (RIDE): Embodied Conversational
Agent (ECA) and Multiagent Capabilities (Page
1902)
Arno Hartholt (University of Southern California Institute for Creative
Technologies)
Ed Fast (University of Southern California Institute for Creative Technologies)
Andrew Leeds (University of Southern California Institute for Creative
Technologies)
Kevin Kim (University of Southern California Institute for Creative
Technologies)
Andrew Gordon (University of Southern California Institute for Creative
Technologies)
Kyle McCullough (University of Southern California Institute for Creative
Technologies)
Volkan Ustun (University of Southern California Institute for Creative
Technologies)
Sharon Mozgai (University of Southern California Institute for Creative
Technologies)
SIERRA:
A Modular Framework for Research Automation (Page
1905)
John Harwell (University of Minnesota)
London Lowmanstone (University of Minnesota)
Maria Gini (University of Minnesota) |
(Return
to Top) |
Cellulan
World: Interactive Platform to Learn Swarm Behaviors (Page
1908)
Hala Khodr (Swiss Federal Institute of Technology (EPFL))
Barbara Bruno (Swiss Federal Institute of Technology (EPFL))
Aditi Kothiyal (Swiss Federal Institute of Technology (EPFL))
Pierre Dillenbourg (Swiss Federal Institute of Technology (EPFL))
Ev-IDID:
Enhancing Solutions to Interactive Dynamic Influence Diagrams through
Evolutionary Algorithms (Page
1911)
Biyang Ma (Minnan Normal University)
Yinghui Pan (Shenzhen University)
Yifeng Zeng (Northumbria University)
Zhong Ming (Shenzhen University)
LBfT:
Learning Bayesian Network Structures from Text in Autonomous Typhoon Response
Systems (Page 1914)
Yinghui Pan (Shenzhen University)
Junhan Chen (Xiamen University)
Yifeng Zeng (Northumbria University)
Zhangrui Yao (Xiamen University)
Qianwen Li (Shenzhen University)
Biyang Ma (Northumbria University)
Yi Ji (Shenzhen University)
Zhong Ming (Shenzhen University)
JEDAI:
A System for Skill-Aligned Explainable Robot Planning (Page
1917)
Naman Shah (Arizona State University)
Pulkit Verma (Arizona State University)
Trevor Angle (Arizona State University)
Siddharth Srivastava (Arizona State University)
JAAMAS
Track
Reaching
Consensus Under a Deadline (Page
1920)
Marina Bánnikova (Universidad Autónoma de Barcelona)
Lihi Dery (Ariel University)
Svetlana Obraztsova (Nanyang Technological University)
Zinovi Rabinovich (Nanyang Technological University)
Jeffrey S. Rosenschein (The Hebrew University of Jerusalem)
Goal-Driven
Active Learning (Page
1923)
Nicolas Bougie (The Graduate University for Advanced Studies (Sokendai)
& National Institute of Informatics)
Ryutaro Ichise (National Institute of Informatics & The Graduate University
for Advanced Studies (Sokendai))
Combining
Quantitative and Qualitative Reasoning in Concurrent Multi-player Games (Page
1926)
Nils Bulling (Clausthal University of Technology)
Valentin Goranko (Stockholm University) |
(Return
to Top) |
Voting
with Random Classifiers (VORACE): Theoretical and Experimental Analysis (Page
1929)
Cristina Cornelio (Samsung AI)
Michele Donini (Amazon)
Andrea Loreggia (University of Brescia)
Maria Silvia Pini (University of Padova)
Francesca Rossi (IBM Research)
Enabling
BDI Group Plans with Coordination Middleware: Semantics and Implementation (Page
1932)
Stephen Cranefield (University of Otago)
GDL
as a Unifying Domain Description Language for Declarative Automated Negotiation (Page
1935)
Dave de Jonge (IIIA-CSIC)
Dongmo Zhang (Western Sydney University)
Designing
Efficient and Fair Mechanisms for Multi-Type Resource Allocation (Page
1938)
Xiaoxi Guo (Peking University)
Sujoy Sikdar (Binghamton University)
Haibin Wang (Peking University)
Lirong Xia (Rensselaer Polytechnic Institute)
Yongzhi Cao (Peking University)
Hanpin Wang (Guangzhou University & Peking University)
Automatic
Calibration Framework of Agent-based Models for Dynamic and Heterogeneous
Parameters (Page
1941)
Dongjun Kim (Korea Advanced Institute of Science and Technology)
Tae-Sub Yun (Korea Advanced Institute of Science and Technology)
Il-Chul Moon (Korea Advanced Institute of Science and Technology)
Jang Won Bae (Korea University of Technology and Education)
Trust
Repair in Human-Agent Teams: The Effectiveness of Explanations and Expressing
Regret (Page 1944)
E.S. Kox (TNO)
J.H. Kerstholt (TNO)
T.F. Hueting (TNO)
P.W. de Vries (University of Twente) |
(Return
to Top) |
Concurrent
Negotiations with Global Utility Functions (Page
1947)
Yasser Mohammad (NEC Corporation & National Institute of Advanced Industrial
Science and Technology)
Shinji Nakadai (NEC Corporation & National Institute of Advanced Industrial
Science and Technology)
Towards
Addressing Dynamic Multi-agent Task Allocation in Law Enforcement (Page
1950)
Itshak Tkach (London University)
Sofia Amador Nelke (Holon Institute of Technology) |
|
|