|
|
Table
of Contents
AAMAS'24
Chairs Welcome
Conference
Organization
Area
Chairs
Senior
Programme Committee
Programme
Committee
Auxiliary
Reviewers
Special
Track Reviewers
Awards
Sponsors
& Supporters
|
|
|
(Return
to Top) |
Keynote
Talks
Trustworthy
Reinforcement Learning: Opportunities and Challenges (Page
1)
Ann Nowé (Vrije Universiteit Brussel)
Agents
and Humans: Trajectories and Perspectives (Page
2)
Liz Sonenberg (The University of Melbourne)
30
Years of Engineering Multi-Agent Systems: What and Why? (Page
3)
Michael Winikoff (Victoria University of Wellington) |
(Return
to Top) |
Full
Research Papers
Team
Performance and User Satisfaction in Mixed Human-Agent Teams (Page
4)
Sami Abuhaimed (The University of Tulsa)
Sandip Sen (The University of Tulsa)
Value-based
Resource Matching with Fairness Criteria: Application to Agricultural
Water Trading (Page
13)
Abhijin Adiga (University of Virginia)
Yohai Trabelsi (Bar-Ilan University)
Tanvir Ferdousi (University of Virginia)
Madhav Marathe (University of Virginia)
S. S. Ravi (University of Virginia)
Samarth Swarup (University of Virginia)
Anil Kumar Vullikanti (University of Virginia)
Mandy L. Wilson (University of Virginia)
Sarit Kraus (Bar Ilan University)
Reetwika Basu (Washington State University)
Supriya Savalkar (Washington State University)
Matthew Yourek (Washington State University)
Michael Brady (Washington State University)
Kirti Rajagopalan (Washington State University)
Jonathan Yoder (Washington State University)
Can
Poverty Be Reduced by Acting on Discrimination? An Agent-based Model for
Policy Making (Page
22)
Alba Aguilera (Artificial Intelligence Research Institute, IIIA-CSIC)
Nieves Montes (Artificial Intelligence Research Institute, IIIA-CSIC)
Georgina Curto (University of Notre Dame)
Carles Sierra (Artificial Intelligence Research Institute, IIIA-CSIC)
Nardine Osman (Artificial Intelligence Research Institute, IIIA-CSIC)
Provably
Learning Nash Policies in Constrained Markov Potential Games (Page
31)
Pragnya Alatur (Department of Computer Science, ETH AI Center)
Giorgia Ramponi (Department of Computer Science, University of Zurich)
Niao He (Department of Computer Science, ETH Zurich)
Andreas Krause (Department of Computer Science, ETH Zurich)
Beliefs,
Shocks, and the Emergence of Roles in Asset Markets: An Agent-Based Modeling
Approach (Page 40)
Evan Albers (Khoury College of Computer Sciences, Northeastern University)
Mohammad T. Irfan (Department of Computer Science, Bowdoin College)
Matthew J. Botsch (Department of Economics, Bowdoin College)
On
the Potential and Limitations of Proxy Voting: Delegation with Incomplete
Votes (Page 49)
Georgios Amanatidis (University of Essex)
Aris Filos-Ratsikas (University of Edinburgh)
Philip Lazos (Input Output Global (IOG))
Evangelos Markakis (Athens University of Economics and Business & Input
Output Global (IOG))
Georgios Papasotiropoulos (Athens University of Economics and Business,
Input Output Global (IOG), & University of Warsaw)
|
(Return
to Top) |
Offline
Risk-sensitive RL with Partial Observability to Enhance Performance in
Human-Robot Teaming (Page
58)
Giorgio Angelotti (ANITI, Fédération ENAC ISAE-SUPAERO ONERA, Université
de Toulouse)
Caroline P. C. Chanel (ANITI, Fédération ENAC ISAE-SUPAERO ONERA, Université
de Toulouse)
Adam Henrique Moreira Pinto (Fédération ENAC ISAE-SUPAERO ONERA, Université
de Toulouse)
Christophe Lounis (Fédération ENAC ISAE-SUPAERO ONERA, Université de
Toulouse)
Corentin Chauffaut (Fédération ENAC ISAE-SUPAERO ONERA, Université
de Toulouse)
Nicolas Drougard (ANITI, Fédération ENAC ISAE-SUPAERO ONERA, Université
de Toulouse)
Collective
Robustness of Heterogeneous Decision-Makers Against Stubborn Individuals (Page
68)
Nemanja Antonic (IRIDIA, Université Libre de Bruxelles, DEIB, Politecnico
di Milano, & naXys, Université de Namur)
Raina Zakir (IRIDIA, Université Libre de Bruxelles)
Marco Dorigo (IRIDIA, Université Libre de Bruxelles)
Andreagiovanni Reina (IRIDIA, Université Libre de Bruxelles & CASCB,
Max Planck Institute of Animal Behavior, University of Konstanz)
Willy
Wonka Mechanisms (Page
78)
Thomas Archbold (King's College London)
Bart de Keijzer (King's College London)
Carmine Ventre (King's College London)
Extended
Ranking Mechanisms for the m-Capacitated Facility Location Problem
in Bayesian Mechanism Design (Page
87)
Gennaro Auricchio (University of Bath)
Jie Zhang (University of Bath)
Mengxiao Zhang (Durham University)
Stability
of Weighted Majority Voting under Estimated Weights (Page
96)
Shaojie Bai (Zhejiang University)
Dongxia Wang (Zhejiang University & ZJU-Hangzhou Global Scientific
and Technological Innovation Center)
Tim Muller (University of Nottingham)
Peng Cheng (Zhejiang University)
Jiming Chen (Zhejiang University)
Impact
of Tie-Breaking on the Manipulability of Elections (Page
105)
James P. Bailey (Rensselaer Polytechnic Institute)
Craig A. Tovey (Georgia Institute of Technology)
|
(Return
to Top) |
Minimax
Exploiter: A Data Efficient Approach for Competitive Self-Play (Page
114)
Daniel Bairamian (Mila - Quebec AI Institute, McGill University)
Philippe Marcotte (Ubisoft Montreal)
Joshua Romoff (Ubisoft Montreal)
Gabriel Robert (Ubisoft Montreal)
Derek Nowrouzezahrai (Mila - Quebec AI Institute, McGill University)
Strategic
Reasoning under Capacity-constrained Agents (Page
123)
Gabriel Ballot (SEIDO Lab, EDF R&D and Télécom Paris, Institut Polytechnique
de Paris)
Vadim Malvone (LTCI, Télécom Paris, Institut Polytechnique de Paris)
Jean Leneutre (LTCI, Télécom Paris, Institut Polytechnique de Paris)
Youssef Laarouchi (SEIDO Lab, EDF R&D)
Trust
in Shapley: A Cooperative Quest for Global Trust in P2P Network (Page
132)
Arti Bandhana (Artificial Intelligence Center, Department of Computer
Science, Faculty of Electrical Engineering, Czech Technical University
in Prague)
Tomá Kroupa (Artificial Intelligence Center, Department of Computer
Science, Faculty of Electrical Engineering, Czech Technical University
in Prague)
Sebastián Garcia (Artificial Intelligence Center, Department of Computer
Science, Faculty of Electrical Engineering, Czech Technical University
in Prague)
A
Model-Based Solution to the Offline Multi-Agent Reinforcement Learning
Coordination Problem (Page
141)
Paul Barde (Mila - Quebec AI Institute, McGill University)
Jakob Foerster (University of Oxford)
Derek Nowrouzezahrai (Mila - Quebec AI Institute, McGill University)
Amy Zhang (University of Texas at Austin & Meta AI - FAIR)
Parameterized
Guarantees for Almost Envy-Free Allocations (Page
151)
Siddharth Barman (Indian Institute of Science)
Debajyoti Kar (Indian Institute of Science)
Shraddha Pathak (Penn State University)
|
(Return
to Top) |
Verification
of Stochastic Multi-Agent Systems with Forgetful Strategies (Page
160)
Francesco Belardinelli (Imperial College London)
Wojtek Jamroga (SnT, University of Luxembourg & ICS, Polish Academy
of Sciences)
Munyque Mittelmann (University of Naples Federico II)
Aniello Murano (University of Naples Federico II)
Combining
Voting and Abstract Argumentation to Understand Online Discussions (Page
170)
Michael Bernreiter (DBAI, TU Wien)
Jan Maly (DBAI, TU Wien)
Oliviero Nardi (DBAI, TU Wien)
Stefan Woltran (DBAI, TU Wien)
Monitoring
Second-Order Hyperproperties (Page
180)
Raven Beutner (CISPA Helmholtz Center for Information Security)
Bernd Finkbeiner (CISPA Helmholtz Center for Information Security)
Hadar Frenkel (CISPA Helmholtz Center for Information Security)
Niklas Metzger (CISPA Helmholtz Center for Information Security)
Hyper
Strategy Logic (Page
189)
Raven Beutner (CISPA Helmholtz Center for Information Security)
Bernd Finkbeiner (CISPA Helmholtz Center for Information Security)
Optimal
Referral Auction Design (Page
198)
Rangeet Bhattacharyya (IIT Kanpur)
Parvik Dave (IIT Bombay)
Palash Dey (IIT Kharagpur)
Swaprava Nath (IIT Bombay)
On
Green Sustainability of Resource Selection Games with Equitable Cost-Sharing (Page
207)
Vittorio Bilň (University of Salento)
Michele Flammini (Gran Sasso Science Institute & University of Calabria)
Gianpiero Monaco (University of Chieti-Pescara)
Luca Moscardelli (University of Chieti-Pescara)
Cosimo Vinci (University of Salento)
|
(Return
to Top) |
An
Online Learning Theory of Brokerage (Page
216)
Nataša Bolić (University of Ottawa)
Tommaso Cesari (University of Ottawa)
Roberto Colomboni (University of Milan & IIT)
Robust
Popular Matchings (Page
225)
Martin Bullinger (University of Oxford)
Rohith Reddy Gangam (University of California, Irvine)
Parnian Shahkar (University of California, Irvine)
HELP!
Providing Proactive Support in the Presence of Knowledge Asymmetry (Page
234)
Turgay Caglar (Colorado State University)
Sarath Sreedharan (Colorado State University)
On
the Complexity of Pareto-Optimal and Envy-Free Lotteries (Page
244)
Ioannis Caragiannis (Aarhus University)
Kristoffer Arnsfelt Hansen (Aarhus University)
Nidhi Rathi (Max Planck Institute for Informatics, University of Saarland)
A
Distributed Approach for Fault Detection in Swarms of Robots (Page
253)
Alessandro Carminati (Politecnico di Milano)
Davide Azzalini (Politecnico di Milano)
Simone Vantini (Politecnico di Milano)
Francesco Amigoni (Politecnico di Milano)
Finding
Effective Ad Allocations: How to Exploit User History (Page
262)
Matteo Castiglioni (Politecnico di Milano)
Alberto Latino (Politecnico di Milano)
Alberto Marchesi (Politecnico di Milano)
Giulia Romano (Politecnico di Milano)
Nicola Gatti (Politecnico di Milano)
Chokha Palayamkottai (Locify)
|
(Return
to Top) |
Obstruction
Alternating-time Temporal Logic: A Strategic Logic to Reason about Dynamic
Models (Page 271)
Davide Catta (Universitŕ di Napoli, Federico II)
Jean Leneutre (Télécom Paris)
Vadim Malvone (Télécom Paris)
Aniello Murano (Universitŕ di Napoli, Federico II)
Aligning
Credit for Multi-Agent Cooperation via Model-based Counterfactual Imagination (Page
281)
Jiajun Chai (Institute of Automation, Chinese Academy of Sciences &
School of Artificial Intelligence, University of Chinese Academy of Sciences)
Yuqian Fu (Institute of Automation, Chinese Academy of Sciences & School
of Artificial Intelligence, University of Chinese Academy of Sciences)
Dongbin Zhao (Institute of Automation, Chinese Academy of Sciences
& School of Artificial Intelligence, University of Chinese Academy of
Sciences)
Yuanheng Zhu (Institute of Automation, Chinese Academy of Sciences
& School of Artificial Intelligence, University of Chinese Academy of
Sciences)
Cooperative
Electric Vehicles Planning (Page
290)
Jaël Champagne Gareau (Université du Québec ŕ Montréal)
Marc-André Lavoie (Cégep du Vieux Montréal)
Guillaume Gosset (Université du Québec ŕ Montréal)
Éric Beaudry (Université du Québec ŕ Montréal)
Think
Global, Act Local - Agent-Based Inline Recovery for Airline Operations (Page
299)
Yashovardhan S. Chati (Tata Consultancy Services Research)
Ramasubramanian Suriyanarayanan (Tata Consultancy Services Research)
Arunchandar Vasan (Tata Consultancy Services Research)
Deep
Anomaly Detection via Active Anomaly Search (Page
308)
Chao Chen (National Key Laboratory for Novel Software Technology, School
of Artificial Intelligence, Nanjing University)
Dawei Wang (Alibaba Group)
Feng Mao (Alibaba Group)
Jiacheng Xu (National Key Laboratory for Novel Software Technology,
School of Artificial Intelligence, Nanjing University)
Zongzhang Zhang (National Key Laboratory for Novel Software Technology,
School of Artificial Intelligence, Nanjing University)
Yang Yu (National Key Laboratory for Novel Software Technology, School
of Artificial Intelligence, Nanjing University)
Foresight
Distribution Adjustment for Off-policy Reinforcement Learning (Page
317)
Ruifeng Chen (National Key Laboratory for Novel Software Technology,
School of Artificial Intelligence, Nanjing University)
Xu-Hui Liu (National Key Laboratory for Novel Software Technology,
School of Artificial Intelligence, Nanjing University & Polixir Technologies)
Tian-Shuo Liu (National Key Laboratory for Novel Software Technology,
School of Artificial Intelligence, Nanjing University & Polixir Technologies)
Shengyi Jiang (The University of Hong Kong)
Feng Xu (National Key Laboratory for Novel Software Technology, School
of Artificial Intelligence, Nanjing University)
Yang Yu (National Key Laboratory for Novel Software Technology, School
of Artificial Intelligence, Nanjing University & Polixir Technologies)
|
(Return
to Top) |
Adaptive
Primal-Dual Method for Safe Reinforcement Learning (Page
326)
Weiqin Chen (Rensselaer Polytechnic Institute)
James Onyejizu (Rensselaer Polytechnic Institute)
Long Vu (IBM T.J. Watson Research Center)
Lan Hoang (IBM Research)
Dharmashankar Subramanian (IBM T.J. Watson Research Center)
Koushik Kar (Rensselaer Polytechnic Institute)
Sandipan Mishra (Rensselaer Polytechnic Institute)
Santiago Paternain (Rensselaer Polytechnic Institute)
Boosting
Continuous Control with Consistency Policy (Page
335)
Yuhui Chen (Institute of Automation, Chinese Academy of Sciences &
School of Artificial Intelligence, University of Chinese Academy of Sciences)
Haoran Li (Institute of Automation, Chinese Academy of Sciences & School
of Artificial Intelligence, University of Chinese Academy of Sciences)
Dongbin Zhao (Institute of Automation, Chinese Academy of Sciences
& School of Artificial Intelligence, University of Chinese Academy of
Sciences)
ODEs
Learn to Walk: ODE-Net based Data-Driven Modeling for Crowd Dynamics (Page
345)
Chen Cheng (Shanghai Jiao Tong University)
Jinglai Li (University of Birmingham)
Fast
and Slow Goal Recognition (Page
354)
Mattia Chiari (University of Brescia)
Alfonso Emilio Gerevini (University of Brescia)
Andrea Loreggia (University of Brescia)
Luca Putelli (University of Brescia)
Ivan Serina (University of Brescia)
Learning
a Social Network by Influencing Opinions (Page
363)
Dmitry Chistikov (University of Warwick)
Luisa Estrada (University of Warwick)
Mike Paterson (University of Warwick)
Paolo Turrini (University of Warwick)
Fairness
and Efficiency Trade-off in Two-sided Matching (Page
372)
Sung-Ho Cho (Kyushu University)
Kei Kimura (Kyushu University)
Kiki Liu (Kyushu University)
Kwei-guu Liu (Kyushu University)
Zhengjie Liu (Kyushu University)
Zhaohong Sun (Kyushu University)
Kentaro Yahiro (Kyushu University)
Makoto Yokoo (Kyushu University)
|
(Return
to Top) |
Private
Agent-Based Modeling (Page
381)
Ayush Chopra (Massachusetts Institute of Technology)
Arnau Quera-Bofarull (University of Oxford)
Nurullah Giray-Kuru (Massachusetts Institute of Technology)
Michael Wooldridge (University of Oxford)
Ramesh Raskar (Massachusetts Institute of Technology)
flame:
A Framework for Learning in Agent-based ModEls (Page
391)
Ayush Chopra (Massachusetts Institute of Technology)
Jayakumar Subramanian (Adobe)
Balaji Krishnamurthy (Adobe)
Ramesh Raskar (Massachusetts Institute of Technology)
Multi-Robot
Allocation of Assistance from a Shared Uncertain Operator (Page
400)
Clarissa Costen (University of Oxford)
Anna Gautier (KTH Royal Institute of Technology)
Nick Hawes (University of Oxford)
Bruno Lacerda (University of Oxford)
A
Simple 1.5-approximation Algorithm for a Wide Range of Maximum Size Stable
Matching Problems (Page
409)
Gergely Csáji (HUN-REN Centre for Economic and Regional Studies & ELTE
Eötvös Loránd University)
Designing
Redistribution Mechanisms for Reducing Transaction Fees in Blockchains (Page
416)
Sankarshan Damle (IIIT, Hyderabad)
Manisha Padala (IISc, Bangalore)
Sujit Gujar (IIIT, Hyderabad)
|
(Return
to Top) |
The
Parameterized Complexity of Welfare Guarantees in Schelling Segregation (Page
425)
Argyrios Deligkas (Royal Holloway, University of London)
Eduard Eiben (Royal Holloway, University of London)
Tiger-Lily Goldsmith (Royal Holloway, University of London)
Toward
a Quality Model for Hybrid Intelligence Teams (Page
434)
Davide Dell'Anna (Utrecht University)
Pradeep K. Murukannaiah (Delft University of Technology)
Bernd Dudzik (Delft University of Technology)
Davide Grossi (University of Groningen)
Catholijn M. Jonker (Delft University of Technology)
Catharine Oertel (Delft University of Technology)
Pinar Yolum (Utrecht University)
Informativeness
of Reward Functions in Reinforcement Learning (Page
444)
Rati Devidze (MPI-SWS)
Parameswaran Kamalaruban (The Alan Turing Institute)
Adish Singla (MPI-SWS)
Continual
Optimistic Initialization for Value-Based Reinforcement Learning (Page
453)
Sheelabhadra Dey (Texas A&M University)
James Ault (Texas A&M University)
Guni Sharon (Texas A&M University)
Gerrymandering
Planar Graphs (Page
463)
Jack Dippel (McGill University)
Max Dupré la Tour (McGill University)
April Niu (McGill University)
Sanjukta Roy (Pennsylvania State University & University of Leeds)
Adrian Vetta (McGill Univeristy)
It
Is Among Us: Identifying Adversaries in Ad-hoc Domains using Q-valued
Bayesian Estimations (Page
472)
Matheus Aparecido Do Carmo Alves (Lancaster University)
Amokh Varma (Indian Institute of Technology)
Yehia Elkhatib (University of Glasgow)
Leandro Soriano Marcolino (Lancaster University)
|
(Return
to Top) |
Dynamic
Epistemic Logic of Resource Bounded Information Mining Agents (Page
481)
Vitaliy Dolgorukov (HSE University)
Rustam Galimullin (University of Bergen)
Maksim Gladyshev (Utrecht University)
Population
Synthesis as Scenario Generation for Simulation-based Planning under Uncertainty (Page
490)
Joel Dyer (University of Oxford)
Arnau Quera-Bofarull (University of Oxford)
Nicholas Bishop (University of Oxford)
J. Doyne Farmer (University of Oxford)
Anisoara Calinescu (University of Oxford)
Michael Wooldridge (University of Oxford)
Computational
Aspects of Distortion (Page
499)
Soroush Ebadian (University of Toronto)
Aris Filos-Ratsikas (University of Edinburgh)
Mohamad Latifian (University of Toronto)
Nisarg Shah (University of Toronto)
Multi-Agent
Reinforcement Learning for Assessing False-Data Injection Attacks on Transportation
Networks (Page 508)
Taha Eghtesad (Pennsylvania State University)
Sirui Li (Massachusetts Institute of Technology)
Yevgeniy Vorobeychik (Washington University in St. Louis)
Aron Laszka (Pennsylvania State University)
Reinforcement
Learning in the Wild with Maximum Likelihood-based Model Transfer (Page
516)
Hannes Eriksson (Zenseact & Chalmers University of Technology)
Tommy Tram (Zenseact & Chalmers University of Technology)
Debabrota Basu (Univ. Lille, Inria, CNRS & Centrale Lille, UMR 9189
- CRIStAL)
Mina Alibeigi (Zenseact)
Christos Dimitrakakis (University of Oslo & University of Neuchatel)
Holonic
Learning: A Flexible Agent-based Distributed Machine Learning Framework (Page
525)
Ahmad Esmaeili (Purdue University)
Zahra Ghorrati (Purdue University)
Eric T. Matson (Purdue University)
|
(Return
to Top) |
Learning
and Calibrating Heterogeneous Bounded Rational Market Behaviour with Multi-agent
Reinforcement Learning (Page
534)
Benjamin Patrick Evans (J.P. Morgan AI Research)
Sumitra Ganesh (J.P. Morgan AI Research)
High-Level,
Collaborative Task Planning Grammar and Execution for Heterogeneous Agents (Page
544)
Amy Fang (Cornell University)
Hadas Kress-Gazit (Cornell University)
Facility
Location Games with Fractional Preferences and Limited Resources (Page
553)
Jiazhu Fang (Ocean University of China)
Wenjing Liu (Ocean University of China)
Generalized
Strategy Synthesis of Infinite-state Impartial Combinatorial Games via
Exact Binary Classification (Page
562)
Liangda Fang (Jinan University & Pazhou Lab)
Meihong Yang (Jinan University)
Dingliang Cheng (Jinan University)
Yunlai Hao (Jinan University)
Quanlong Guan (Jinan University)
Liping Xiong (Wuyi University)
Probabilistic
Multi-agent Only-Believing (Page
571)
Qihui Feng (RWTH Aachen University)
Gerhard Lakemeyer (RWTH Aachen University)
Preventing
Deadlocks for Multi-Agent Pickup and Delivery in Dynamic Environments (Page
580)
Benedetta Flammini (Politecnico di Milano)
Davide Azzalini (Politecnico di Milano)
Francesco Amigoni (Politecnico di Milano)
|
(Return
to Top) |
Potential-Based
Reward Shaping for Intrinsic Motivation (Page
589)
Grant C. Forbes (North Carolina State University)
Nitish Gupta (North Carolina State University)
Leonardo Villalobos-Arias (North Carolina State University)
Colin M. Potts (North Carolina State University)
Arnav Jhala (North Carolina State University)
David L. Roberts (North Carolina State University)
Learning
Complex Teamwork Tasks using a Given Sub-task Decomposition (Page
598)
Elliot Fosong (University of Edinburgh)
Arrasy Rahman (University of Texas at Austin)
Ignacio Carlucho (Heriot-Watt University)
Stefano V. Albrecht (University of Edinburgh)
BrainSLAM:
SLAM on Neural Population Activity Data (Page
607)
Kipp Freud (University of Bristol)
Nathan Lepora (University of Bristol)
Matt W. Jones (University of Bristol)
Cian O'Donnell (Ulster University)
From
Market Saturation to Social Reinforcement: Understanding the Impact of
Non-Linearity in Information Diffusion Models (Page
614)
Tobias Friedrich (Hasso Plattner Institute, University of Potsdam)
Andreas Göbel (Hasso Plattner Institute, University of Potsdam)
Nicolas Klodt (Hasso Plattner Institute, University of Potsdam)
Martin S. Krejca (Laboratoire d'Informatique (LIX), CNRS, École Polytechnique,
Institut Polytechnique de Paris)
Marcus Pappik (Hasso Plattner Institute, University of Potsdam)
Analysing
the Sample Complexity of Opponent Shaping (Page
623)
Kitty Fung (University of Oxford)
Qizhen Zhang (University of Oxford)
Chris Lu (University of Oxford)
Jia Wan (Massachusetts Institute of Technology)
Timon Willi (University of Oxford)
Jakob Foerster (University of Oxford)
RACCER:
Towards Reachable and Certain Counterfactual Explanations for Reinforcement
Learning (Page 632)
Jasmina Gajcin (Trinity College Dublin)
Ivana Dusparic (Trinity College Dublin)
|
(Return
to Top) |
Surge
Routing: Event-informed Multiagent Reinforcement Learning for Autonomous
Rideshare (Page 641)
Daniel Garces (Harvard University)
Stephanie Gil (Harvard University)
Incentives
for Early Arrival in Cooperative Games (Page
651)
Yaoxin Ge (ShanghaiTech University)
Yao Zhang (ShanghaiTech University)
Dengji Zhao (ShanghaiTech University)
Zhihao Gavin Tang (Shanghai University of Finance and Economics)
Hu Fu (Shanghai University of Finance and Economics)
Pinyan Lu (Shanghai University of Finance and Economics)
Deep
Reinforcement Learning with Coalition Action Selection for Online Combinatorial
Resource Allocation with Arbitrary Action Space (Page
660)
Tesfay Zemuy Gebrekidan (University of Southampton)
Sebastian Stein (University of Southampton)
Timothy J. Norman (University of Southampton)
Approximating
the Core via Iterative Coalition Sampling (Page
669)
Ian Gemp (Google DeepMind)
Marc Lanctot (Google DeepMind)
Luke Marris (Google DeepMind)
Yiran Mao (Google DeepMind)
Edgar Duéńez-Guzmán (Google DeepMind)
Sarah Perrin (Google DeepMind)
Andras Gyorgy (Google DeepMind)
Romuald Elie (Google DeepMind)
Georgios Piliouras (Google DeepMind)
Michael Kaisers (Google DeepMind)
Daniel Hennes (Google DeepMind)
Kalesha Bullard (Google DeepMind)
Kate Larson (Google DeepMind)
Yoram Bachrach (Google DeepMind)
Modelling
the Rise and Fall of Two-sided Markets (Page
679)
Farnoud Ghasemi (Faculty of Mathematics and Computer Science, Jagiellonian
University)
Rafał Kucharski (Faculty of Mathematics and Computer Science, Jagiellonian
University)
NovelGym:
A Flexible Ecosystem for Hybrid Planning and Learning Agents Designed
for Open Worlds (Page
688)
Shivam Goel (Tufts University)
Yichen Wei (Brown University)
Panagiotis Lymperopoulos (Tufts University)
Klára Churá (Tufts University)
Matthias Scheutz (Tufts University)
Jivko Sinapov (Tufts University)
|
(Return
to Top) |
Capacity
Modification in the Stable Matching Problem (Page
697)
Salil Gokhale (Indian Institute of Technology Delhi)
Samarth Singla (Indian Institute of Technology Delhi)
Shivika Narang (Simons Laufer Mathematical Science Institute)
Rohit Vaish (Indian Institute of Technology Delhi)
Nash
Stability in Hedonic Skill Games (Page
706)
Laurent Gourves (Université Paris Dauphine-PSL, CNRS, LAMSADE)
Gianpiero Monaco (University of Chieti-Pescara)
Symbolic
Computation of Sequential Equilibria (Page
715)
Moritz Graf (University of Freiburg)
Thorsten Engesser (IRIT)
Bernhard Nebel (University of Freiburg)
Reinforcement
Learning with Ensemble Model Predictive Safety Certification (Page
724)
Sven Gronauer (Technical University of Munich (TUM))
Tom Haider (Fraunhofer IKS)
Felippe Schmoeller da Roza (Fraunhofer IKS)
Klaus Diepold (Technical University of Munich (TUM))
MaDi:
Learning to Mask Distractions for Generalization in Visual Deep Reinforcement
Learning (Page 733)
Bram Grooten (Eindhoven University of Technology)
Tristan Tomilin (Eindhoven University of Technology)
Gautham Vasan (University of Alberta)
Matthew E. Taylor (University of Alberta & Alberta Machine Intelligence
Institute (Amii))
A. Rupam Mahmood (University of Alberta & Alberta Machine Intelligence
Institute (Amii))
Meng Fang (University of Liverpool)
Mykola Pechenizkiy (Eindhoven University of Technology)
Decebal Constantin Mocanu (University of Luxembourg)
Cost-aware
Offline Safe Meta Reinforcement Learning with Robust In-Distribution Online
Task Adaptation (Page
743)
Cong Guan (Nanjing University)
Ruiqi Xue (Nanjing University)
Ziqian Zhang (Nanjing University)
Lihe Li (Nanjing University)
Yi-Chen Li (Nanjing University)
Lei Yuan (Nanjing University)
Yang Yu (Nanjing University)
|
(Return
to Top) |
Cooperation
and Coordination in Heterogeneous Populations with Interaction Diversity (Page
752)
Hao Guo (Tsinghua University)
Zhen Wang (Northwestern Polytechnical University)
Junliang Xing (Tsinghua University)
Pin Tao (Tsinghua University)
Yuanchun Shi (Tsinghua University)
First
100 days of Pandemic: An Interplay of Pharmaceutical, Behavioral and Digital
Interventions - A Study using Agent Based Modeling (Page
761)
Gauri Gupta (Massachusetts Institute of Technology)
Ritvik Kapila (University of California San Diego)
Ayush Chopra (Massachusetts Institute of Technology)
Ramesh Raskar (Massachusetts Institute of Technology)
Causal
Explanations for Sequential Decision-Making in Multi-Agent Systems (Page
771)
Balint Gyevnar (University of Edinburgh)
Cheng Wang (University of Edinburgh)
Christopher G. Lucas (University of Edinburgh)
Shay B. Cohen (University of Edinburgh)
Stefano V. Albrecht (University of Edinburgh)
Weighted
Proportional Allocations of Indivisible Goods and Chores: Insights via
Matchings (Page 780)
Vishwa Prakash H.V. (Chennai Mathematical Institute)
Prajakta Nimbhorkar (Chennai Mathematical Institute)
Sample
and Communication Efficient Fully Decentralized MARL Policy Evaluation
via a New Approach: Local TD Update (Page
789)
Hairi (University of Wisconsin-Whitewater)
Zifan Zhang (North Carolina State University)
Jia Liu (The Ohio State University)
Forecasting
and Mitigating Disruptions in Public Bus Transit Services (Page
798)
Chaeeun Han (Pennsylvania State University)
Jose Paolo Talusan (Vanderbilt University)
Dan Freudberg (WeGo Public Transit)
Ayan Mukhopadhyay (Vanderbilt University)
Abhishek Dubey (Vanderbilt University)
Aron Laszka (Pennsylvania State University)
|
(Return
to Top) |
Solving
Two-player Games with QBF Solvers in General Game Playing (Page
807)
Yifan He (UNSW Sydney)
Abdallah Saffidine (UNSW Sydney)
Michael Thielscher (UNSW Sydney)
Facility
Location Games with Scaling Effects (Page
816)
Yu He (City University of Hong Kong)
Alexander Lam (City University of Hong Kong)
Minming Li (City University of Hong Kong)
Tight
Approximations for Graphical House Allocation (Page
825)
Hadi Hosseini (Penn State University)
Andrew McGregor (UMass Amherst)
Rik Sengupta (UMass Amherst)
Rohit Vaish (IIT Delhi)
Vignesh Viswanathan (UMass Amherst)
Measuring
Policy Distance for Multi-Agent Reinforcement Learning (Page
834)
Tianyi Hu (Institute of Automation, CAS & School of Artificial Intelligence,
UCAS)
Zhiqiang Pu (Institute of Automation, CAS & School of Artificial Intelligence,
UCAS)
Xiaolin Ai (Institute of Automation, CAS)
Tenghai Qiu (Institute of Automation, CAS)
Jianqiang Yi (Institute of Automation, CAS & School of Artificial Intelligence,
UCAS)
Applying
Opponent Modeling for Automatic Bidding in Online Repeated Auctions (Page
843)
Yudong Hu (University of Chinese Academy of Sciences)
Congying Han (University of Chinese Academy of Sciences)
Tiande Guo (University of Chinese Academy of Sciences)
Hao Xiao (Institute of Electrical Engineering, Chinese Academy of Sciences)
Keeping
the Harmony Between Neighbors: Local Fairness in Graph Fair Division (Page
852)
Halvard Hummel (Norwegian University of Science and Technology)
Ayumi Igarashi (University of Tokyo)
|
(Return
to Top) |
On
the Stability of Learning in Network Games with Many Players (Page
861)
Aamal Hussain (Imperial College London)
Dan Leonte (Imperial College London)
Francesco Belardinelli (Imperial College London)
Georgios Piliouras (Singapore University of Technology and Design)
Rational
Verification with Quantitative Probabilistic Goals (Page
871)
David Hyland (University of Oxford)
Julian Gutierrez (Monash University)
Krishna Shankaranarayanan (IIT Bombay)
Michael Wooldridge (University of Oxford)
BDI
Agents in Natural Language Environments (Page
880)
Alexandre Yukio Ichida (Pontifical Catholic University of Rio)
Felipe Meneguzzi (University of Aberdeen & Pontifical Catholic University
of Rio)
Rafael C. Cardoso (University of Aberdeen)
A
Cloud-Based Microservices Solution for Multi-Agent Traffic Control Systems (Page
889)
Chikadibia Ihejimba (The University of Texas at Dallas)
Rym Z. Wenkstern (The University of Texas at Dallas)
Is
Limited Information Enough? An Approximate Multi-agent Coverage Control
in Non-Convex Discrete Environments (Page
898)
Tatsuya Iwase (Toyota Motor Europe NV/SA)
Aurélie Beynier (Sorbonne Université, CNRS LIP6)
Nicolas Bredeche (Sorbonne Université, CNRS ISIR)
Nicolas Maudet (Sorbonne Université, CNRS LIP6)
Jason R. Marden (University of California, Santa Barbara)
Towards
a Principle-based Framework for Repair Selection in Inconsistent Knowledge
Bases (Page 907)
Said Jabbour (CRIL, CNRS UMR 8188, Université d'Artois)
Yue Ma (LISN, CNRS UMR 9015, Université Paris-Saclay)
Badran Raddaoui (SAMOVAR, Télécom SudParis, Institut Polytechnique
de Paris)
|
(Return
to Top) |
Unraveling
the Tapestry of Deception and Personality: A Deep Dive into Multi-Issue
Human-Agent Negotiation Dynamics (Page
916)
Nusrath Jahan (University of Central Florida)
Johnathan Mell (University of Central Florida)
Playing
Quantitative Games Against an Authority: On the Module Checking Problem (Page
926)
Wojciech Jamroga (SnT, University of Luxembourg & ICS, Polish Academy
of Sciences)
Munyque Mittelmann (University of Naples Federico II)
Aniello Murano (University of Naples Federico II)
Giuseppe Perelli (Sapienza University of Rome)
Discovering
Consistent Subelections (Page
935)
Łukasz Janeczko (AGH University)
Jérôme Lang (CNRS)
Grzegorz Lisowski (AGH University)
Stanisław Szufa (AGH University)
Disentangling
Policy from Offline Task Representation Learning via Adversarial Data
Augmentation (Page
944)
Chengxing Jia (National Key Laboratory for Novel Software Technology,
School of Artificial Intelligence, Nanjing University & Polixir Technologies)
Fuxiang Zhang (National Key Laboratory for Novel Software Technology,
School of Artificial Intelligence, Nanjing University)
Yi-Chen Li (National Key Laboratory for Novel Software Technology,
School of Artificial Intelligence, Nanjing University & Polixir Technologies)
Chen-Xiao Gao (National Key Laboratory for Novel Software Technology,
School of Artificial Intelligence, Nanjing University)
Xu-Hui Liu (National Key Laboratory for Novel Software Technology,
School of Artificial Intelligence, Nanjing University & Polixir Technologies)
Lei Yuan (National Key Laboratory for Novel Software Technology, School
of Artificial Intelligence, Nanjing University & Polixir Technologies)
Zongzhang Zhang (National Key Laboratory for Novel Software Technology,
School of Artificial Intelligence, Nanjing University)
Yang Yu (National Key Laboratory for Novel Software Technology, School
of Artificial Intelligence, Nanjing University & Polixir Technologies)
Recourse
under Model Multiplicity via Argumentative Ensembling (Page
954)
Junqi Jiang (Imperial College London)
Francesco Leofante (Imperial College London)
Antonio Rago (Imperial College London)
Francesca Toni (Imperial College London)
Decentralized
Federated Policy Gradient with Byzantine Fault-Tolerance and Provably
Fast Convergence (Page
964)
Philip Jordan (ETH Zürich)
Florian Grötschla (ETH Zürich)
Flint Xiaofeng Fan (National University of Singapore)
Roger Wattenhofer (ETH Zürich)
|
(Return
to Top) |
Safe
Model-Based Multi-Agent Mean-Field Reinforcement Learning (Page
973)
Matej Jusup (ETH Zurich)
Barna Pásztor (ETH Zurich)
Tadeusz Janik (ETH Zurich)
Kenan Zhang (EPFL)
Francesco Corman (ETH Zurich)
Andreas Krause (ETH Zurich)
Ilija Bogunovic (University College London)
To
Lead or to be Led: A Generalized Condorcet Jury Theorem under Dependence (Page
983)
Jonas Karge (TU Dresden)
Juliette-Michelle Burkhardt (TU Dresden)
Sebastian Rudolph (TU Dresden)
Dominik Rusovac (TU Dresden)
Efficient
Method for Finding Optimal Strategies in Chopstick Auctions with Uniform
Objects Values (Page
992)
Stanisław Kaźmierowski (University of Warsaw)
Marcin Dziubiński (University of Warsaw)
Scaling
Opponent Shaping to High Dimensional Games (Page
1001)
Akbir Khan (University College London)
Timon Willi (University Of Oxford)
Newton Kwan (University College London)
Andrea Tacchetti (Deepmind)
Chris Lu (University Of Oxford)
Edward Grefenstette (University College London)
Tim Rocktäschel (University College London)
Jakob Foerster (University of Oxford)
Catfished!
Impacts of Strategic Misrepresentation in Online Dating (Page
1011)
Oz Kilic (Carleton University)
Alan Tsang (Carleton University)
Veto
Core Consistent Preference Aggregation (Page
1020)
Aleksei Y. Kondratev (HSE University)
Egor Ianovski (Unaffiliated)
|
(Return
to Top) |
Fine-Grained
Liquid Democracy for Cumulative Ballots (Page
1029)
Matthias Köppe (University of California)
Martin Koutecký (Charles University)
Krzysztof Sornat (University of Wrocław)
Nimrod Talmon (Ben-Gurion University of the Negev)
Minimizing
State Exploration While Searching Graphs with Unknown Obstacles (Page
1038)
Daniel Koyfman (Ben-Gurion University)
Shahaf S. Shperberg (Ben-Gurion University)
Dor Atzmon (Bar-Ilan University)
Ariel Felner (Ben-Gurion University)
Continuous
Monte Carlo Graph Search (Page
1047)
Kalle Kujanpää (Aalto University)
Amin Babadi (Bugbear Entertainment & Aalto University)
Yi Zhao (Aalto University)
Juho Kannala (Aalto University)
Alexander Ilin (Aalto University, System 2 AI)
Joni Pajarinen (Aalto University)
Approximating
APS Under Submodular and XOS Valuations with Binary Marginals (Page
1057)
Pooja Kulkarni (University of Illinois at Urbana-Champaign)
Rucha Kulkarni (University of Illinois at Urbana-Champaign)
Ruta Mehta (University of Illinois at Urbana-Champaign)
Higher
Order Reasoning under Intent Uncertainty Reinforces the Hobbesian Trap (Page
1066)
Otto Kuusela (University of Amsterdam)
Debraj Roy (University of Amsterdam)
Proportional
Fairness in Obnoxious Facility Location (Page
1075)
Alexander Lam (City University of Hong Kong)
Haris Aziz (UNSW Sydney)
Bo Li (Hong Kong Polytechnic University)
Fahimeh Ramezani (UNSW Sydney)
Toby Walsh (UNSW Sydney)
|
(Return
to Top) |
Beyond
Surprise: Improving Exploration Through Surprise Novelty (Page
1084)
Hung Le (Deakin University)
Kien Do (Deakin University)
Dung Nguyen (Deakin University)
Svetha Venkatesh (Deakin University)
Policy
Learning for Off-Dynamics RL with Deficient Support (Page
1093)
Linh Le Pham Van (Applied Artificial Intelligence Institute, Deakin
University)
Hung The Tran (Applied Artificial Intelligence Institute, Deakin University)
Sunil Gupta (Applied Artificial Intelligence Institute, Deakin University)
The
Stochastic Evolutionary Dynamics of Softmax Policy Gradient in Games (Page
1101)
Chin-wing Leung (University of Warwick)
Shuyue Hu (Shanghai Artificial Intelligence Laboratory)
Ho-fung Leung (Independent Researcher)
Learning
Partner Selection Rules that Sustain Cooperation in Social Dilemmas with
the Option of Opting Out (Page
1110)
Chin-wing Leung (University of Warwick)
Paolo Turrini (University of Warwick)
Coalition
Formation with Bounded Coalition Size (Page
1119)
Chaya Levinger (Ariel University)
Noam Hazon (Ariel University)
Sofia Simola (Technische Universität Wien)
Amos Azaria (Ariel University)
Bounding
the Incentive Ratio of the Probabilistic Serial Rule (Page
1128)
Bo Li (Hong Kong Polytechnic University)
Ankang Sun (Hong Kong Polytechnic University)
Shiji Xing (Hong Kong Polytechnic University)
|
(Return
to Top) |
Normalization
Enhances Generalization in Visual Reinforcement Learning (Page
1137)
Lu Li (Tsinghua University)
Jiafei Lyu (Tsinghua University)
Guozheng Ma (Tsinghua University)
Zilin Wang (Tsinghua University)
Zhenjie Yang (Shanghai Jiao Tong University)
Xiu Li (Tsinghua University)
Zhiheng Li (Tsinghua University)
Grasper:
A Generalist Pursuer for Pursuit-Evasion Problems (Page
1147)
Pengdeng Li (Nanyang Technological University)
Shuxin Li (Nanyang Technological University)
Xinrun Wang (Nanyang Technological University)
Jakub Černý (Columbia University)
Youzhi Zhang (CAIR, HKISI, CAS)
Stephen McAleer (Carnegie Mellon University)
Hau Chan (University of Nebraska-Lincoln)
Bo An (Nanyang Technological University)
Context-aware
Communication for Multi-agent Reinforcement Learning (Page
1156)
Xinran Li (The Hong Kong University of Science and Technology)
Jun Zhang (The Hong Kong University of Science and Technology)
Factor
Graph Neural Network Meets Max-Sum: A Real-Time Route Planning Algorithm
for Massive-Scale Trips (Page
1165)
Yixuan Li (Southeast University)
Wanyuan Wang (Southeast University)
Weiyi Xu (Southeast University)
Yanchen Deng (Nanyang Technological University)
Weiwei Wu (Southeast University)
Developing
a Multi-agent and Self-adaptive Framework with Deep Reinforcement Learning
for Dynamic Portfolio Risk Management (Page
1174)
Zhenglong Li (The University of Hong Kong)
Vincent Tam (The University of Hong Kong)
Kwan L. Yeung (The University of Hong Kong)
A
Complete Landscape for the Price of Envy-Freeness (Page
1183)
Zihao Li (Nanyang Technological University)
Shengxin Liu (Harbin Institute of Technology, Shenzhen)
Xinhang Lu (UNSW Sydney)
Biaoshuai Tao (Shanghai Jiao Tong University)
Yichen Tao (University of Michigan)
|
(Return
to Top) |
Episodic
Reinforcement Learning with Expanded State-reward Space (Page
1192)
Dayang Liang (Xiamen University)
Yaru Zhang (Xiamen University)
Yunlong Liu (Xiamen University)
Policy-regularized
Offline Multi-objective Reinforcement Learning (Page
1201)
Qian Lin (Sun Yat-sen University)
Chao Yu (Sun Yat-sen University)
Zongkai Liu (Sun Yat-sen University)
Zifan Wu (Sun Yat-sen University)
Progression
with Probabilities in the Situation Calculus: Representation and Succinctness (Page
1210)
Daxin Liu (The University of Edinburgh)
Vaishak Belle (The University of Edinburgh)
LLM-Powered
Hierarchical Language Agent for Real-time Human-AI Coordination (Page
1219)
Jijia Liu (Tsinghua University & SIGS)
Chao Yu (Tsinghua University)
Jiaxuan Gao (Tsinghua University)
Yuqing Xie (Tsinghua University)
Qingmin Liao (Tsinghua University & SIGS)
Yi Wu (Tsinghua University & Shanghai Qi Zhi Institute)
Yu Wang (Tsinghua University)
A
Trajectory Perspective on the Role of Data Sampling Techniques in Offline
Reinforcement Learning (Page
1229)
Jinyi Liu (College of Intelligence and Computing, Tianjin University)
Yi Ma (College of Intelligence and Computing, Tianjin University)
Jianye Hao (College of Intelligence and Computing, Tianjin University)
Yujing Hu (NetEase Fuxi AI Lab)
Yan Zheng (College of Intelligence and Computing, Tianjin University)
Tangjie Lv (NetEase Fuxi AI Lab)
Changjie Fan (NetEase Fuxi AI Lab)
2D-Ptr:
2D Array Pointer Network for Solving the Heterogeneous Capacitated Vehicle
Routing Problem (Page
1238)
Qidong Liu (Zhengzhou University & National Supercomputing Center in
Zhengzhou)
Chaoyue Liu (Zhengzhou University)
Shaoyao Niu (Zhengzhou University)
Cheng Long (Nanyang Technological University)
Jie Zhang (Nanyang Technological University)
Mingliang Xu (Zhengzhou University & National Supercomputing Center
in Zhengzhou)
|
(Return
to Top) |
Neural
Population Learning beyond Symmetric Zero-Sum Games (Page
1247)
Siqi Liu (Google DeepMind & University College London)
Luke Marris (Google DeepMind & University College London)
Marc Lanctot (Google DeepMind)
Georgios Piliouras (Google DeepMind)
Joel Z. Leibo (Google DeepMind)
Nicolas Heess (Google DeepMind)
GraphSAID:
Graph Sampling via Attention based Integer Programming
Method (Page
1256)
Ziqi Liu (New York University Center for Data Science)
Laurence Liu (FCC Analytics)
Uncoupled
Learning of Differential Stackelberg Equilibria with Commitments (Page
1265)
Robert Loftin (The University of Sheffield)
Mustafa Mert Çelikok (Delft University of Technology)
Herke van Hoof (University of Amsterdam)
Samuel Kaski (Aalto University & The University of Manchester)
Frans A. Oliehoek (Delft University of Technology)
Safe
Reinforcement Learning with Free-form Natural Language Constraints and
Pre-Trained Language Models (Page
1274)
Xingzhou Lou (School of Artificial Intelligence, UCAS & Institute of
Automation, CAS)
Junge Zhang (School of Artificial Intelligence, UCAS & Institute of
Automation, CAS)
Ziyan Wang (King's College London)
Kaiqi Huang (School of Artificial Intelligence, UCAS & Institute of
Automation, CAS)
Yali Du (King's College London)
DuaLight:
Enhancing Traffic Signal Control by Leveraging Scenario-Specific and Scenario-Shared
Knowledge (Page 1283)
Jiaming Lu (SenseTime Research & ISTBI of Fudan University)
Jingqing Ruan (SenseTime Research)
Haoyuan Jiang (SenseTime Research & Baidu Inc.)
Ziyue Li (University of Cologne & EWI gGmbH)
Hangyu Mao (SenseTime Research)
Rui Zhao (SenseTime Research & Qing Yuan Research Institute of Shanghai
Jiao Tong University)
A
Task-Driven Multi-UAV Coalition Formation Mechanism (Page
1292)
Xinpeng Lu (Yangzhou University)
Heng Song (Nanjing University of Information Science and Technology)
Huailing Ma (Yangzhou University)
Junwu Zhu (Yangzhou University)
|
(Return
to Top) |
Act
as You Learn: Adaptive Decision-Making in Non-Stationary Markov Decision
Processes (Page 1301)
Baiting Luo (Vanderbilt University)
Yunuo Zhang (Vanderbilt University)
Abhishek Dubey (Vanderbilt University)
Ayan Mukhopadhyay (Vanderbilt University)
Oh,
Now I See What You Want: Learning Agent Models with Internal States from
Observations (Page
1310)
Panagiotis Lymperopoulos (Tufts University)
Matthias Scheutz (Tufts University)
Covert
Planning aganist Imperfect Observers (Page
1319)
Haoxiang Ma (University of Florida)
Chongyang Shi (University of Florida)
Shuo Han (University of Illinois Chicago)
Michael R. Dorothy (DEVCOM Army Research Laboratory)
Jie Fu (University of Florida)
Mixed-Initiative
Bayesian Sub-Goal Optimization in Hierarchical Reinforcement Learning (Page
1328)
Haozhe Ma (National University of Singapore)
Thanh Vinh Vo (National University of Singapore)
Tze-Yun Leong (National University of Singapore)
Attacking
Multi-Player Bandits and How to Robustify Them (Page
1337)
Shivakumar Mahesh (University of Oxford)
Anshuka Rangi (Amazon)
Haifeng Xu (University of Chicago)
Long Tran-Thanh (University of Warwick)
Explaining
the Behavior of POMDP-based Agents Through the Impact of Counterfactual
Information (Page
1346)
Saaduddin Mahmud (University of Massachusetts Amherst)
Marcell Vazquez-Chanlatte (Nissan Advanced Technology Center Silicon
Valley)
Stefan Witwicki (Nissan Advanced Technology Center Silicon Valley)
Shlomo Zilberstein (University of Massachusetts Amherst)
|
(Return
to Top) |
Bayesian
Behavioural Model Estimation for Live Crowd Simulation (Page
1355)
Fumiyasu Makinoshima (Fujitsu Limited)
Tetsuro Takahashi (Fujitsu Limited)
Yusuke Oishi (Kyushu University)
PDiT:
Interleaving Perception and Decision-making Transformers for Deep Reinforcement
Learning (Page 1363)
Hangyu Mao (SenseTime Research)
Rui Zhao (SenseTime Research)
Ziyue Li (University of Cologne)
Zhiwei Xu (Institute of Automation, Chinese Academy of Sciences)
Hao Chen (University of Chinese Academy of Sciences)
Yiqun Chen (Gaoling School of AI, Renmin University of China)
Bin Zhang (Institute of Automation, Chinese Academy of Sciences)
Zhen Xiao (Peking University)
Junge Zhang (Institute of Automation, Chinese Academy of Sciences)
Jiangjin Yin (Huazhong Agricultural University)
Network
Agency: An Agent-based Model of Forced Migration from Ukraine (Page
1372)
Zakaria Mehrab (University of Virginia)
Logan Stundal (University of Virginia)
Samarth Swarup (University of Virginia)
Srinivasan Venaktramanan (University of Virginia)
Bryan Lewis (University of Virginia)
Henning Mortveit (University of Virginia)
Christopher Barrett (University of Virginia)
Abhishek Pandey (Yale School of Public Health)
Chad Wells (Yale School of Public Health)
Alison Galvani (Yale School of Public Health)
Burton Singer (University of Florida)
David Leblang (University of Virginia)
Rita Colwell (University of Maryland)
Madhav Marathe (University of Virginia)
Containing
the Spread of a Contagion on a Tree (Page
1381)
Michela Meister (Cornell University)
Jon Kleinberg (Cornell University)
TaxAI:
A Dynamic Economic Simulator and Benchmark for Multi-agent Reinforcement
Learning (Page 1390)
Qirui Mi (Institute of Automation, CAS & School of Artificial Intelligence,
UCAS)
Siyu Xia (Institute of Automation, CAS & School of Artificial Intelligence,
UCAS)
Yan Song (Institute of Automation, CAS)
Haifeng Zhang (Institute of Automation, CAS, School of Artificial Intelligence,
UCAS, & Nanjing Artificial Intelligence Research of IA)
Shenghao Zhu (University of International Business and Economics)
Jun Wang (University College London)
Evaluating
District-based Election Surveys with Synthetic Dirichlet Likelihood (Page
1400)
Adway Mitra (Indian Institute of Technology)
Palash Dey (Indian Institute of Technology)
|
(Return
to Top) |
Observer-Aware
Planning with Implicit and Explicit Communication (Page
1409)
Shuwa Miura (University of Massachusetts Amherst)
Shlomo Zilberstein (University of Massachusetts Amherst)
PI-NeuGODE:
Physics-Informed Graph Neural Ordinary Differential Equations for Spatiotemporal
Trajectory Prediction (Page
1418)
Zhaobin Mo (Department of Civil Engineering and Engineering Mechanics,
Columbia University)
Yongjie Fu (Department of Civil Engineering and Engineering Mechanics,
Columbia University)
Xuan Di (Department of Civil Engineering and Engineering Mechanics,
Columbia University)
Policy
Optimization using Horizon Regularized Advantage to Improve Generalization
in Reinforcement Learning (Page
1427)
Nasik Muhammad Nafi (Kansas State University)
Raja Farrukh Ali (Kansas State University)
William Hsu (Kansas State University)
Kevin Duong (Kansas State University)
Mason Vick (Kansas State University)
Linking
Vision and Multi-Agent Communication through Visible Light Communication
using Event Cameras (Page
1436)
Haruyuki Nakagawa (Tokyo Institute of Technology & Sony Semiconductor
Solutions Corporation)
Yoshitaka Miyatani (Sony Semiconductor Solutions Corporation)
Asako Kanezaki (Tokyo Institute of Technology)
Rethinking
Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods
for Evaluation and Detection (Page
1445)
Linas Nasvytis (Harvard University)
Kai Sandbrink (University of Oxford)
Jakob Foerster (University of Oxford)
Tim Franzmeyer (University of Oxford)
Christian Schroeder de Witt (University of Oxford)
Mixed-Initiative
Human-Robot Teaming under Suboptimality with Online Bayesian Adaptation (Page
1454)
Manisha Natarajan (Georgia Institute of Technology)
Chunyue Xue (Georgia Institute of Technology)
Sanne van Waveren (Georgia Institute of Technology)
Karen Feigh (Georgia Institute of Technology)
Matthew Gombolay (Georgia Institute of Technology)
|
(Return
to Top) |
Bootstrapping
Linear Models for Fast Online Adaptation in Human-Agent Collaboration (Page
1463)
Benjamin A. Newman (Carnegie Mellon University & Meta)
Chris Paxton (Meta)
Kris Kitani (Carnegie Mellon University & Meta)
Henny Admoni (Carnegie Mellon University)
Solution-oriented
Agent-based Models Generation with Verifier-assisted Iterative In-context
Learning (Page 1473)
Tong Niu (Center for Brain-Inspired Computing Research, Tsinghua University)
Weihao Zhang (Lynxi Technologies Co., Ltd.)
Rong Zhao (Center for Brain-Inspired Computing Research, Tsinghua University)
Reinforcement
Learning Interventions on Boundedly Rational Human Agents in Frictionful
Tasks (Page 1482)
Eura Nofshin (Harvard University)
Siddharth Swaroop (Harvard University)
Weiwei Pan (Harvard University)
Susan Murphy (Harvard University)
Finale Doshi-Velez (Harvard University)
RAISE
the Bar: Restriction of Action Spaces for Improved
Social Welfare and Equity in Traffic Management (Page
1492)
Michael Oesterle (University of Mannheim)
Tim Grams (University of Mannheim)
Christian Bartelt (University of Mannheim)
Heiner Stuckenschmidt (University of Mannheim)
Engineering
LaCAM*: Towards Real-time, Large-scale, and Near-optimal Multi-agent Pathfinding (Page
1501)
Keisuke Okumura (University of Cambridge & National Institute of Advanced
Industrial Science and Technology (AIST))
Learning
and Sustaining Shared Normative Systems via Bayesian Rule Induction in
Markov Games (Page
1510)
Ninell Oldenburg (University of Copenhagen)
Tan Zhi-Xuan (Massachusetts Institute of Technology)
|
(Return
to Top) |
Emergent
Cooperation under Uncertain Incentive Alignment (Page
1521)
Nicole Orzan (University of Groningen)
Erman Acar (ILLC & IvI, University of Amsterdam)
Davide Grossi (University of Groningen)
Roxana Rădulescu (Vrije Universiteit Brussel & Utrecht University)
A
Computational Framework of Human Values (Page
1531)
Nardine Osman (Artificial Intelligence Research Institute (IIIA-CSIC))
Mark d'Inverno (Goldsmiths, University of London & Artificial Intelligence
Research Institute (IIIA-CSIC))
Improving
Mobile Maternal and Child Health Care Programs: Collaborative Bandits
for Time Slot Selection (Page
1540)
Soumyabrata Pal (Google Research India & Adobe)
Milind Tambe (Google Research)
Arun Suggala (Google Research India)
Karthikeyan Shanmugam (Google Research India)
Aparna Taneja (Google Research India)
Monitored
Markov Decision Processes (Page
1549)
Simone Parisi (University of Alberta & Ami)
Montaser Mohammedalamen (University of Alberta)
Alireza Kazemipour (University of Alberta)
Matthew E. Taylor (University of Alberta & Ami)
Michael Bowling (University of Alberta & Ami)
Confidence-Based
Curriculum Learning for Multi-Agent Path Finding (Page
1558)
Thomy Phan (University of Southern California)
Joseph Driscoll (Georgia Institute of Technology)
Justin Romberg (Georgia Institute of Technology)
Sven Koenig (University of Southern California)
Single-Winner
Voting with Alliances: Avoiding the Spoiler Effect (Page
1567)
Grzegorz Pierczyński (AGH University)
Stanisław Szufa (AGH University & CNRS, LAMSADE, Université Paris Dauphine-PSL)
|
(Return
to Top) |
Simultaneously
Achieving Group Exposure Fairness and Within-Group Meritocracy in Stochastic
Bandits (Page 1576)
Subham Pokhriyal (Indian Institute of Technology Ropar)
Shweta Jain (Indian Institute of Technology Ropar)
Ganesh Ghalme (Indian Institute of Technology Hyderabad)
Swapnil Dhamal (Indian Institute of Technology Ropar)
Sujit Gujar (International Institute of Information Technology, Hyderabad)
Atlas-X
Equity Financing: Unlocking New Methods to Securely Obfuscate Axe Inventory
Data Based on Differential Privacy (Page
1585)
Antigoni Polychroniadou (J.P. Morgan AI Research & AlgoCRYPT CoE)
Gabriele Ciprianni (J.P. Morgan Quantitative Research)
Richard Hua (J.P. Morgan Quantitative Research)
Tucker Balch (J.P. Morgan AI Research)
Robust
Knowledge Extraction from Large Language Models using Social Choice Theory (Page
1593)
Nico Potyka (Cardiff University)
Yuqicheng Zhu (Bosch Center of AI & Univ. of Stuttgart)
Yunjie He (Bosch Center of AI & Univ. of Stuttgart)
Evgeny Kharlamov (Bosch Center of AI & Univ. of Oslo)
Steffen Staab (Univ. of Stuttgart & Univ. of Southampton)
Online
Decentralised Mechanisms for Dynamic Ridesharing (Page
1602)
Nicos Protopapas (Archimedes/RC Athena)
Vahid Yazdanpanah (University of Southampton)
Enrico H. Gerding (University of Southampton)
Sebastian Stein (University of Southampton)
Interactively
Learning the User's Utility for Best-Arm Identification in Multi-Objective
Multi-Armed Bandits (Page
1611)
Mathieu Reymond (Vrije Universiteit Brussel)
Eugenio Bargiacchi (Vrije Universiteit Brussel)
Diederik M. Roijers (Vrije Universiteit Brussel)
Ann Nowé (Vrije Universiteit Brussel)
Design
Patterns for Explainable Agents (XAg) (Page
1621)
Sebastian Rodriguez (RMIT University)
John Thangarajah (RMIT University)
Andrew Davey (RMIT University)
|
(Return
to Top) |
Multi-Agent
Diagnostics for Robustness via Illuminated Diversity (Page
1630)
Mikayel Samvelyan (UCL & Meta AI)
Davide Paglieri (UCL)
Minqi Jiang (UCL & Meta AI)
Jack Parker-Holder (UCL)
Tim Rocktäschel (UCL)
The
Triangles of Dishonesty: Modelling the Evolution of Lies, Bullshit, and
Deception in Agent Societies (Page
1645)
Stefan Sarkadi (King's College London)
Peter R. Lewis (Ontario Tech University)
Computing
Optimal Commitments to Strategies and Outcome-Conditional Utility Transfers (Page
1654)
Nathaniel Sauerberg (University of Texas - Austin)
Caspar Oesterheld (Carnegie Mellon University)
CORE:
Towards Scalable and Efficient Causal Discovery with Reinforcement Learning (Page
1664)
Andreas Sauter (Vrije Universiteit Amsterdam)
Nicolň Botteghi (University of Twente)
Erman Acar (IvI and ILLC, University of Amsterdam)
Aske Plaat (LIACS, Universiteit Leiden)
IDIL:
Imitation Learning of Intent-Driven Expert Behavior (Page
1673)
Sangwon Seo (Rice University)
Vaibhav Unhelkar (Rice University)
Multi-user
Norm Consensus (Page
1683)
Marc Serramia (King's College London & City, University of London)
Natalia Criado (Universitat Politčcnica de Valčncia)
Michael Luck (University of Sussex)
|
(Return
to Top) |
Value
Alignment in Participatory Budgeting (Page
1692)
Marc Serramia (City, University of London)
Maite Lopez-Sanchez (University of Barcelona)
Juan A. Rodriguez-Aguilar (Artificial Intelligence Research Institute
(IIIA-CSIC))
Stefano Moretti (LAMSADE, Universtité Paris-Dauphine)
Efficient
Public Health Intervention Planning Using Decomposition-Based Decision-focused
Learning (Page 1701)
Sanket Shah (Harvard University)
Arun Suggala (Google Research India)
Milind Tambe (Google Research)
Aparna Taneja (Google Research India)
Battlefield
Transfers in Coalitional Blotto Games (Page
1710)
Vade Shah (University of California, Santa Barbara)
Jason R. Marden (University of California, Santa Barbara)
Modeling
Cognitive Biases in Decision-theoretic Planning for Active Cyber Deception (Page
1718)
Aditya Shinde (The University of Georgia)
Prashant Doshi (The University of Georgia)
Relaxed
Exploration Constrained Reinforcement Learning (Page
1727)
Shahaf S. Shperberg (Ben-Gurion University)
Bo Liu (The University of Texas at Austin)
Peter Stone (The University of Texas at Austin & Sony AI)
LgTS:
Dynamic Task Sampling using LLM-generated Sub-Goals for Reinforcement
Learning Agents (Page
1736)
Yash Shukla (Tufts University)
Wenchang Gao (Tufts University)
Vasanth Sarathy (Tufts University)
Alvaro Velasquez (University of Colorado Boulder)
Robert Wright (Georgia Tech Research Institute)
Jivko Sinapov (Tufts University)
|
(Return
to Top) |
PAS:
Probably Approximate Safety Verification of Reinforcement Learning Policy
Using Scenario Optimization (Page
1745)
Arambam James Singh (Nanyang Technological University)
Arvind Easwaran (Nanyang Technological University)
Frugal
Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning
Using Unique Experiences (Page
1754)
Nikhil Kumar Singh (Indian Institute of Technology Kanpur)
Indranil Saha (Indian Institute of Technology Kanpur)
On
Dealing with False Beliefs and Maintaining KD45n Property (Page
1763)
Tran Cao Son (New Mexico State University)
Loc Pham (New Mexico State University)
Enrico Pontelli (New Mexico State University)
Boosting
Studies of Multi-Agent Reinforcement Learning on Google Research Football
Environment: The Past, Present, and Future (Page
1772)
Yan Song (Institution of Automation, Chinese Academy of Science)
He Jiang (Digital Brain Lab)
Haifeng Zhang (Institution of Automation, CAS, School of Artificial
Intelligence, UCAS, & Nanjing Artificial Intelligence Research of IA)
Zheng Tian (ShanghaiTech University)
Weinan Zhang (Shanghai Jiao Tong University)
Jun Wang (University College London)
Algorithmic
Filtering, Out-Group Stereotype, and Polarization on Social Media (Page
1782)
Jean Springsteen (Washington University in St. Louis)
William Yeoh (Washington University in St. Louis)
Dino Christenson (Washington University in St. Louis)
Multi-Agent
Alternate Q-Learning (Page
1791)
Kefan Su (Peking University)
Siyuan Zhou (HKUST)
Jiechuan Jiang (Peking University)
Chuang Gan (MIT-IBM Watson AI Lab)
Xiangjun Wang (inspir.ai)
Zongqing Lu (Peking University)
|
(Return
to Top) |
Allocating
Contiguous Blocks of Indivisible Chores Fairly: Revisited (Page
1800)
Ankang Sun (Hong Kong Polytechnic University)
Bo Li (Hong Kong Polytechnic University)
On
the Transit Obfuscation Problem (Page
1809)
Hideaki Takahashi (The University of Tokyo)
Alex Fukunaga (The University of Tokyo)
Towards
Efficient Auction Design with ROI Constraints (Page
1818)
Xinyu Tang (Shandong University)
Hongtao Lv (Shandong University)
Yingjie Gao (Shandong University)
Fan Wu (Shanghai Jiao Tong University)
Lei Liu (Shandong University & Shandong Research Institute of Industrial
Technology)
Lizhen Cui (Shandong University)
Assessing
Fairness of Residential Dynamic Pricing for Electricity using Active Learning
with Agent-based Simulation (Page
1827)
Swapna Thorve (University of Virginia)
Henning Mortveit (University of Virginia)
Anil Vullikanti (University of Virginia)
Madhav Marathe (University of Virginia)
Samarth Swarup (University of Virginia)
Norm
Enforcement with a Soft Touch: Faster Emergence, Happier Agents (Page
1837)
Sz-Ting Tzeng (North Carolina State University)
Nirav Ajmeri (University of Bristol)
Munindar P. Singh (North Carolina State University)
Reducing
Optimism Bias in Incomplete Cooperative Games (Page
1847)
Filip Úradník (Charles University)
David Sychrovský (Charles University)
Jakub Černý (Columbia University)
Martin Černý (Charles University)
|
(Return
to Top) |
Enabling
BDI Agents to Reason on a Dynamic Action Repertoire in Hypermedia Environments (Page
1856)
Danai Vachtsevanou (University of St. Gallen)
Bruno de Lima (Federal University of Santa Catarina)
Andrei Ciortea (University of St. Gallen)
Jomi Fred Hübner (Federal University of Santa Catarina)
Simon Mayer (University of St.Gallen)
Jérémy Lemée (University of St.Gallen)
MABL:
Bi-Level Latent-Variable World Model for Sample-Efficient Multi-Agent
Reinforcement Learning (Page
1865)
Aravind Venugopal (Carnegie Mellon University)
Stephanie Milani (Carnegie Mellon University)
Fei Fang (Carnegie Mellon University)
Balaraman Ravindran (Indian Institute of Technology, Madras)
Optimal
Flash Loan Fee Function with Respect to Leverage Strategies (Page
1874)
Chenmin Wang (The University of Aizu)
Peng Li (The University of Aizu)
Yulong Zeng (YeeZTech)
Xuepeng Fan (YeeZTech)
Positive
Intra-Group Externalities in Facility Location (Page
1883)
Ying Wang (City University of Hong Kong)
Houyu Zhou (City University of Hong Kong)
Minming Li (City University of Hong Kong)
Generalized
Response Objectives for Strategy Exploration in Empirical Game-Theoretic
Analysis (Page 1892)
Yongzhao Wang (The Alan Turing Institute)
Michael P. Wellman (University of Michigan)
The
Reasons that Agents Act: Intention and Instrumental Goals (Page
1901)
Francis Rhys Ward (Imperial College London)
Matt MacDermott (Imperial College London)
Francesco Belardinelli (Imperial College London)
Francesca Toni (Imperial College London)
Tom Everitt (Google DeepMind)
|
(Return
to Top) |
Distributed
Online Rollout for Multivehicle Routing in Unmapped Environments (Page
1910)
Jamison W. Weber (Arizona State University)
Dhanush R. Giriyan (Arizona State University)
Devendra R. Parkar (Arizona State University)
Dimitri P. Bertsekas (Arizona State University)
Andréa W. Richa (Arizona State University)
Towards
Generalizability of Multi-Agent Reinforcement Learning in Graphs with
Recurrent Message Passing (Page
1919)
Jannis Weil (Technical University of Darmstadt)
Zhenghua Bao (Technical University of Darmstadt)
Osama Abboud (Huawei Technologies)
Tobias Meuser (Technical University of Darmstadt)
Multi-Robot
Motion and Task Planning in Automotive Production Using Controller-based
Safe Reinforcement Learning (Page
1928)
Eric Wete (Leibniz University Hannover)
Joel Greenyer (FHDW Hannover)
Daniel Kudenko (Leibniz University Hannover)
Wolfgang Nejdl (Leibniz University Hannover)
New
Algorithms for Distributed Fair k-Center Clustering: Almost Accurate
as Sequential Algorithms (Page
1938)
Xiaoliang Wu (School of Computer Science and Engineering, Central South
University & Xiangjiang Laboratory)
Qilong Feng (School of Computer Science and Engineering, Central South
University & Xiangjiang Laboratory)
Ziyun Huang (Department of Computer Science and Software Engineering,
Penn State Erie, The Behrend College)
Jinhui Xu (Department of Computer Science and Engineering, State University
of New York at Buffalo)
Jianxin Wang (Hunan Provincial Key Lab on Bioinformatics, Central South
University & Xiangjiang Laboratory)
Adaptive
Evolutionary Reinforcement Learning Algorithm with Early Termination Strategy (Page
1947)
Xiaoqiang Wu (Shenzhen University)
Qingling Zhu (Shenzhen University)
Qiuzhen Lin (Shenzhen University)
Weineng Chen (South China University of Technology)
Jianqiang Li (Shenzhen University)
Collaborative
Deep Reinforcement Learning for Solving Multi-Objective Vehicle Routing
Problems (Page 1956)
Yaoxin Wu (Eindhoven University of Technology)
Mingfeng Fan (National University of Singapore)
Zhiguang Cao (Singapore Management University)
Ruobin Gao (Nanyang Technological University)
Yaqing Hou (Dalian University of Technology)
Guillaume Sartoretti (National University of Singapore)
|
(Return
to Top) |
Safeguard
Privacy for Minimal Data Collection with Trustworthy Autonomous Agents (Page
1966)
Mengwei Xu (Newcastle University)
Louise A. Dennis (The University of Manchester)
Mustafa A. Mustafa (The University of Manchester & COSIC, KU Leuven)
Learning
to Schedule Online Tasks with Bandit Feedback (Page
1975)
Yongxin Xu (ShanghaiTech University)
Shangshang Wang (ShanghaiTech University)
Hengquan Guo (ShanghaiTech University)
Xin Liu (ShanghaiTech University)
Ziyu Shao (ShanghaiTech University)
Successively
Pruned Q-Learning: Using Self Q-function to Reduce the Overestimation (Page
1984)
Zhaolin Xue (Fudan University)
Lihua Zhang (Fudan University)
Zhiyan Dong (Fudan University)
Attention-based
Priority Learning for Limited Time Multi-Agent Path Finding (Page
1993)
Yibin Yang (Tsinghua University)
Mingfeng Fan (Central South University)
Chengyang He (National University of Singapore)
Jianqiang Wang (Tsinghua University)
Heye Huang (Tsinghua University)
Guillaume Sartoretti (National University of Singapore)
Automatic
Curriculum for Unsupervised Reinforcement Learning (Page
2002)
Yucheng Yang (Department of Mathematics and Computer Science, Eindhoven
University of Technology)
Tianyi Zhou (Department of Mathematics and Computer Science, University
of Maryland, College Park)
Lei Han (Tencent AI Lab)
Meng Fang (Department of Mathematics and Computer Science, University
of Liverpool & Eindhoven University of Technology)
Mykola Pechenizkiy (Department of Mathematics and Computer Science,
Eindhoven University of Technology)
Multimodal
Pretrained Models for Verifiable Sequential Decision-Making: Planning,
Grounding, and Perception (Page
2011)
Yunhao Yang (University of Texas at Austin)
Cyrus Neary (University of Texas at Austin)
Ufuk Topcu (University of Texas at Austin)
|
(Return
to Top) |
Whom
to Trust? Elective Learning for Distributed Gaussian Process Regression (Page
2020)
Zewen Yang (Robert Koch Institute)
Xiaobing Dai (Technical University of Munich)
Akshat Dubey (Robert Koch Institute)
Sandra Hirche (Technical University of Munich)
Georges Hattab (Robert Koch Institute & Freie Universität Berlin)
Risk-Aware
Constrained Reinforcement Learning with Non-Stationary Policies (Page
2029)
Zhaoxing Yang (Shanghai Jiao Tong University)
Haiming Jin (Shanghai Jiao Tong University)
Yao Tang (Shanghai Jiao Tong University)
Guiyun Fan (Shanghai Jiao Tong University)
When
is Mean-Field Reinforcement Learning Tractable and Relevant? (Page
2038)
Batuhan Yardim (ETH Zürich)
Artur Goldman (HSE University)
Niao He (ETH Zürich)
Viral
Marketing in Social Networks with Competing Products (Page
2047)
Ahad N. Zehmakan (Australian National University)
Xiaotian Zhou (Fudan University)
Zhongzhi Zhang (Fudan University)
Majority-based
Preference Diffusion on Social Networks (Page
2057)
Ahad N. Zehmakan (Australian National University)
Human
Goal Recognition as Bayesian Inference: Investigating the Impact of Actions,
Timing, and Goal Solvability (Page
2066)
Chenyuan Zhang (The University of Melbourne)
Charles Kemp (The University of Melbourne)
Nir Lipovetzky (The University of Melbourne)
|
(Return
to Top) |
Memory-Based
Resilient Control Against Non-cooperation in Multi-agent Flocking (Page
2075)
Mingyue Zhang (Southwest University)
Nianyu Li (ZGC Lab)
Jialong Li (Waseda University)
Jiachun Liao (Nanhu Lab)
Jiamou Liu (University of Auckland)
MESA:
Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting
State-Action Space Structure (Page
2085)
Zhicheng Zhang (Carnegie Mellon University)
Yancheng Liang (University of Washington)
Yi Wu (Tsinghua University)
Fei Fang (Carnegie Mellon University)
Pragmatic
Instruction Following and Goal Assistance via Cooperative Language-Guided
Inverse Planning (Page
2094)
Tan Zhi-Xuan (Massachusetts Institute of Technology)
Lance Ying (Harvard University)
Vikash Mansinghka (Massachusetts Institute of Technology)
Joshua B. Tenenbaum (Massachusetts Institute of Technology)
Maximising
the Influence of Temporary Participants in Opinion Formation (Page
2104)
Zhiqiang Zhuang (Tianjin University)
Kewen Wang (Griffith University)
Zhe Wang (Griffith University)
Junhu Wang (Griffith University)
Yinong Yang (Liaoning University) |
(Return
to Top) |
Extended
Abstract
Defining
Deception in Decision Making (Page
2111)
Marwa Abdulhai (UC Berkeley)
Micah Carroll (UC Berkeley)
Justin Svegliato (UC Berkeley)
Anca Dragan (UC Berkeley)
Sergey Levine (UC Berkeley)
Actual
Trust in Multiagent Systems (Page
2114)
Michael Akintunde (King's College London)
Vahid Yazdanpanah (University of Southampton)
Asieh Salehi Fathabadi (University of Southampton)
Corina Cirstea (University of Southampton)
Mehdi Dastani (Utrecht University)
Luc Moreau (King's College London)
|
(Return
to Top) |
On
General Epistemic Abstract Argumentation Frameworks (Page
2117)
Gianvincenzo Alfano (University of Calabria)
Sergio Greco (University of Calabria)
Francesco Parisi (University of Calabria)
Irina Trubitsyna (University of Calabria)
Approximately
Fair Allocation of Indivisible Items with Random Valuations (Page
2120)
Alessandro Aloisio (Universitŕ degli Studi Internazionali di Roma)
Vittorio Bilň (Universitŕ del Salento)
Antonio Mario Caruso (Universitŕ del Salento)
Michele Flammini (Gran Sasso Science Institute & University of Calabria)
Cosimo Vinci (Universitŕ del Salento)
Quantum
Circuit Design: A Reinforcement Learning Challenge (Page
2123)
Philipp Altmann (LMU Munich)
Adelina Bärligea (TU Munich)
Jonas Stein (LMU Munich)
Michael Kölle (LMU Munich)
Thomas Gabor (LMU Munich)
Thomy Phan (University of Southern California)
Claudia Linnhof-Popien (LMU Munich)
Charging
Electric Vehicles Fairly and Efficiently (Page
2126)
Ramsundar Anandanarayanan (IIT Bombay)
Swaprava Nath (IIT Bombay)
Rohit Vaish (IIT Delhi)
Bounding
Consideration Probabilities in Consider-Then-Choose Ranking Models (Page
2129)
Ben Aoki-Sherwood (Johns Hopkins Applied Physics Lab)
Catherine Bregou (Carleton College)
David Liben-Nowell (Carleton College)
Kiran Tomlinson (Cornell University)
Thomas Zeng (University of Wisconsin)
Abstracting
Assumptions in Structured Argumentation (Page
2132)
Iosif Apostolakis (Institute of Software Technology, TU Graz)
Zeynep G. Saribatur (Institute of Logic and Computation, TU Wien)
Johannes P. Wallner (Institute of Software Technology, TU Graz)
Liquid
Democracy for Low-Cost Ensemble Pruning (Page
2135)
Ben Armstrong (University of Waterloo)
Kate Larson (University of Waterloo)
MiKe:
Task Scheduling for UAV-based Parcel Delivery (Page
2138)
Viviana Arrigoni (Department of Computer Science, Sapienza University
of Rome)
Giulio Attenni (Department of Computer Science, Sapienza University
of Rome)
Novella Bartolini (Department of Computer Science, Sapienza University
of Rome)
Matteo Finelli (Department of Computer Science, Sapienza University
of Rome)
Gaia Maselli (Department of Computer Science, Sapienza University of
Rome)
Entropy
Seeking Constrained Multiagent Reinforcement Learning (Page
2141)
Ayhan Alp Aydeniz (Collaborative Robotics and Intelligent Systems Institute,
Oregon State University)
Enrico Marchesini (Laboratory for Information Decision Systems, Massachusetts
Institute of Technology)
Christopher Amato (Khoury College of Computer Sciences, Northeastern
University)
Kagan Tumer (Collaborative Robotics and Intelligent Systems Institute,
Oregon State University)
Metric
Distortion Under Public-Spirited Voting (Page
2144)
Amirreza Bagheridelouee (Sharif University of Technology)
Marzie Nilipour (Sharif University of Technology & Tehran Institute
for Advanced Studies (TeIAS))
Masoud Seddighin (Tehran Institute for Advanced Studies (TeIAS))
Maziar Shamsipour (Sharif University of Technology)
Concurrency
Model of BDI Programming Frameworks: Why Should We Control It? (Page
2147)
Martina Baiardi (University of Bologna)
Samuele Burattini (University of Bologna)
Giovanni Ciatto (University of Bologna)
Danilo Pianini (University of Bologna)
Andrea Omicini (University of Bologna)
Alessandro Ricci (University of Bologna)
|
(Return
to Top) |
Adaptive
Discounting of Training Time Attacks (Page
2150)
Ridhima Bector (Nanyang Technological University)
Abhay Aradhya (Nanyang Technological University)
Chai Quek (Nanyang Technological University)
Zinovi Rabinovich (Nanyang Technological University)
Computing
Balanced Solutions for Large International Kidney Exchange Schemes when
Cycle Length is Unbounded (Page
2153)
Márton Benedek (KRTK, Institute of Economics)
Péter Biró (KRTK, Institute of Economics)
Gergely Csáji (KRTK, Institute of Economics)
Matthew Johnson (Durham University)
Daniël Paulusma (Durham University)
Xin Ye (Durham University)
Decentralized
Control of Distributed Manipulators: An Information Diffusion Approach (Page
2156)
Nicolas Bessone (IT University of Copenhagen)
Payam Zahadat (IT University of Copenhagen)
Kasper Stoy (IT University of Copenhagen)
Gaze
Supervision for Mitigating Causal Confusion in Driving Agents (Page
2159)
Abhijat Biswas (Carnegie Mellon University)
Badal Arun Pardhi (Carnegie Mellon University)
Caleb Chuck (University of Texas at Austin)
Jarrett Holtz (Robert Bosch LLC & University of Texas at Austin)
Scott Niekum (University of Massachusetts Amherst)
Henny Admoni (Carnegie Mellon University)
Alessandro Allievi (Robert Bosch LLC & University of Texas at Austin)
Fair
Allocation of Conflicting Courses under Additive Utilities (Page
2162)
Arpita Biswas (Harvard University)
Yiduo Ke (Northwestern University)
Samir Khuller (Northwestern University)
Quanquan C. Liu (Yale University)
Factored
MDP based Moving Target Defense with Dynamic Threat Modeling (Page
2165)
Megha Bose (International Institute of Information Technology)
Praveen Paruchuri (International Institute of Information Technology)
Akshat Kumar (Singapore Management University)
|
(Return
to Top) |
Decentralised
Emergence of Robust and Adaptive Linguistic Conventions in Populations
of Autonomous Agents Grounded in Continuous Worlds (Page
2168)
Jérôme Botoko Ekila (Vrije Universiteit Brussel)
Jens Nevens (Vrije Universiteit Brussel)
Lara Verheyen (Vrije Universiteit Brussel)
Katrien Beuls (Université de Namur)
Paul Van Eecke (Vrije Universiteit Brussel)
Who
gets the Maximal Extractable Value? A Dynamic Sharing Blockchain Mechanism (Page
2171)
Pedro Braga (King's College London)
Georgios Chionas (University of Liverpool)
Piotr Krysta (Augusta University)
Stefanos Leonardos (King's College London)
Georgios Piliouras (Singapore University of Technology and Design)
Carmine Ventre (King's College London)
User-centric
Explanation Strategies for Interactive Recommenders (Page
2174)
Berk Buzcu (Özyeğin University)
Emre Kuru (Özyeğin University)
Reyhan Aydoğan (Özyeğin University & Delft University of Technology)
Non
Stationary Bandits with Periodic Variation (Page
2177)
Titas Chakraborty (Indian Institute of Technology Bombay)
Parth Shettiwar (Indian Institute of Technology Bombay)
Mechanism
Design for Reducing Agent Distances to Prelocated Facilities (Page
2180)
Hau Chan (University of Nebraska-Lincoln)
Xinliang Fu (City University of Hong Kong)
Minming Li (City University of Hong Kong)
Chenhao Wang (Beijing Normal University & BNU-HKBU United International
College)
Anytime
Multi-Agent Path Finding using Operation Parallelism in Large Neighborhood
Search (Page 2183)
Shao-Hung Chan (University of Southern California)
Zhe Chen (Monash University)
Dian-Lun Lin (University of Wisconsin-Madison)
Yue Zhang (Monash University)
Daniel Harabor (Monash University)
Sven Koenig (University of Southern California)
Tsung-Wei Huang (University of Wisconsin-Madison)
Thomy Phan (University of Southern California)
|
(Return
to Top) |
Agent-Based
Triangle Counting and Its Applications in Anonymous Graphs (Page
2186)
Prabhat Kumar Chand (Indian Statistical Institute)
Apurba Das (BITS Pilani)
Anisur Rahaman Molla (Indian Statistical Institute)
HLG:
Bridging Human Heuristic Knowledge and Deep Reinforcement Learning for
Optimal Agent Performance (Page
2189)
Bin Chen (University of South Australia)
Zehong Cao (University of South Australia)
Cutsets
and EF1 Fair Division of Graphs (Page
2192)
Jiehua Chen (TU Wien)
William S. Zwicker (Union College)
ANOTO:
Improving Automated Negotiation via Offline-to-Online Reinforcement Learning (Page
2195)
Siqi Chen (School of Information Science and Engineering, Chongqing
Jiaotong University)
Jianing Zhao (College of Intelligence and Computing, Tianjin University)
Kai Zhao (College of Intelligence and Computing, Tianjin University)
Gerhard Weiss (Department of Advanced Computing Sciences, Maastricht
University)
Fengyun Zhang (College of Intelligence and Computing, Tianjin University)
Ran Su (College of Intelligence and Computing, Tianjin University)
Yang Dong (School of Information Science and Engineering, Chongqing
Jiaotong University)
Daqian Li (School of Information Science and Engineering, Chongqing
Jiaotong University)
Kaiyou Lei (College of Computer and Information Science, Southwest
University)
Mastering
Robot Control through Point-based Reinforcement Learning with Pre-training (Page
2198)
Yihong Chen (Tsinghua University)
Cong Wang (Fuxi Robotics in Netease)
Tianpei Yang (University of Alberta)
Meng Wang (Fuxi Robotics in Netease)
Yingfeng Chen (Fuxi Robotics in Netease)
Jifei Zhou (Fuxi Robotics in Netease)
Chaoyi Zhao (Netease Fuxi AI Lab)
Xinfeng Zhang (Netease Fuxi AI Lab)
Zeng Zhao (Netease Fuxi AI Lab)
Changjie Fan (Fuxi Robotics in Netease)
Zhipeng Hu (Fuxi Robotics in Netease)
Rong Xiong (Zhejiang University)
Long Zeng (Tsinghua University)
Quantifying
Agent Interaction in Multi-agent Reinforcement Learning for Cost-efficient
Generalization (Page
2201)
Yuxin Chen (University of California, Berkeley)
Chen Tang (The University of Texas at Austin)
Ran Tian (University of California, Berkeley)
Chenran Li (University of California, Berkeley)
Jinning Li (University of California, Berkeley)
Masayoshi Tomizuka (University of California, Berkeley)
Wei Zhan (University of California, Berkeley)
|
(Return
to Top) |
Cognizing
and Imitating Robotic Skills via a Dual Cognition-Action Architecture (Page
2204)
Zixuan Chen (State Key Laboratory for Novel Software Technology, Nanjing
University)
Ze Ji (Cardiff University)
Shuyang Liu (State Key Laboratory for Novel Software Technology, Nanjing
University)
Jing Huo (State Key Laboratory for Novel Software Technology, Nanjing
University)
Yiyu Chen (State Key Laboratory for Novel Software Technology, Nanjing
University)
Yang Gao (State Key Laboratory for Novel Software Technology, Nanjing
University)
Modelling
the Dynamics of Subjective Identity in Allocation Games (Page
2207)
Janvi Chhabra (International Institute of Information Technology, Bangalore)
Jayati Deshmukh (International Institute of Information Technology,
Bangalore)
Srinath Srinivasa (International Institute of Information Technology,
Bangalore)
Optimal
Task Assignment and Path Planning using Conflict-Based Search with Precedence
and Temporal Constraints (Page
2210)
Yu Quan Chong (Carnegie Mellon University)
Jiaoyang Li (Carnegie Mellon University)
Katia Sycara (Carnegie Mellon University)
Minimizing
Negative Side Effects in Cooperative Multi-Agent Systems using Distributed
Coordination (Page
2213)
Moumita Choudhury (University of Massachusetts Amherst)
Sandhya Saisubramanian (Oregon State University)
Hao Zhang (University of Massachusetts Amherst)
Shlomo Zilberstein (University of Massachusetts Amherst)
A
Reinforcement Learning Framework for Studying Group and Individual Fairness (Page
2216)
Alexandra Cimpean (Vrije Universiteit Brussel)
Catholijn Jonker (Technische Universiteit Delft)
Pieter Libin (Vrije Universiteit Brussel)
Ann Nowé (Vrije Universiteit Brussel)
Near-Optimal
Online Resource Allocation in the Random-Order Model (Page
2219)
Saar Cohen (Bar-Ilan University)
Noa Agmon (Bar-Ilan university)
|
(Return
to Top) |
Inferring
Lewisian Common Knowledge using Theory of Mind Reasoning in a Forward-chaining
Rule Engine (Page
2222)
Stephen Cranefield (University of Otago)
Sriashalya Srivathsan (Eastern University)
Jeremy Pitt (Imperial College London)
Analyzing
Crowdfunding of Public Projects Under Dynamic Beliefs (Page
2225)
Sankarshan Damle (IIIT, Hyderabad)
Sujit Gujar (IIIT, Hyderabad)
No
Transaction Fees? No Problem! Achieving Fairness in Transaction Fee Mechanism
Design (Page 2228)
Sankarshan Damle (IIIT, Hyderabad)
Varul Srivastava (IIIT, Hyderabad)
Sujit Gujar (IIIT, Hyderabad)
Deep
Learning for Population-Dependent Controls in Mean Field Control Problems
with Common Noise (Page
2231)
Gökçe Dayanikli (Univ. of Illinois Urbana-Champaign)
Mathieu Lauričre (Shanghai New York Univ.)
Jiacheng Zhang (Univ. of California Berkeley)
Attila:
A Negotiating Agent for the Game of Diplomacy, Based on Purely Symbolic
A.I. (Page 2234)
Dave de Jonge (IIIA-CSIC)
Laura Rodriguez Cima (IIIA-CSIC)
Evaluation
of Robustness of Off-Road Autonomous Driving Segmentation against Adversarial
Attacks: A Dataset-Centric Study (Page
2237)
Pankaj Deoli (Robotics Research Lab, RPTU)
Rohit Kumar (Robotics Research Lab, RPTU)
Axel Vierling (Robotics Research Lab, RPTU)
Karsten Berns (Robotics Research Lab, RPTU)
|
(Return
to Top) |
A
Comparison of the Myerson Value and the Position Value (Page
2240)
Ayşe Mutlu Derya (TOBB University of Economics and Technology)
Pruning
Neural Networks Using Cooperative Game Theory (Page
2243)
Mauricio Diaz-Ortiz Jr. (Radboud University)
Benjamin Kempinski (Radboud University)
Daphne Cornelisse (New York University)
Yoram Bachrach (Google DeepMind)
Tal Kachman (Radboud University)
Verifying
Proportionality in Temporal Voting (Page
2246)
Edith Elkind (University of Oxford)
Svetlana Obraztsova (Carleton University)
Nicholas Teh (University of Oxford)
Computational
Theory of Mind with Abstractions for Effective Human-Agent Collaboration (Page
2249)
Emre Erdogan (Utrecht University)
Rineke Verbrugge (University of Groningen)
Pinar Yolum (Utrecht University)
Attention
Graph for Multi-Robot Social Navigation with Deep Reinforcement Learning (Page
2252)
Erwan Escudie (Univ. Groningen, Bernoulli Institute, CITI Lab., INRIA-INSA
Chroma, & Univ Lyon, UCBL, LIRIS)
Laetitia Matignon (Univ Lyon, UCBL, CNRS, INSA Lyon, LIRIS, UMR5205)
Jacques Saraydaryan (CPE Lyon, CITI Lab., INRIA-INSA Chroma Team)
Strategic
Cost Selection in Participatory Budgeting (Page
2255)
Piotr Faliszewski (AGH University)
Łukasz Janeczko (AGH University)
Andrzej Kaczmarczyk (AGH University)
Grzegorz Lisowski (AGH University)
Piotr Skowron (University of Warsaw)
Stanisław Szufa (AGH University)
|
(Return
to Top) |
Deceptive
Path Planning via Reinforcement Learning with Graph Neural Networks (Page
2258)
Michael Y. Fatemi (University of Virginia)
Wesley A. Suttle (U.S. Army Research Laboratory)
Brian M. Sadler (U.S. Army Research Laboratory)
Influence-Focused
Asymmetric Island Model (Page
2261)
Andrew Festa (Oregon State University)
Gaurav Dixit (Oregon State University)
Kagan Tumer (Oregon State University)
A
Negotiator's Backup Plan: Optimal Concessions with a Reservation Value (Page
2264)
Tamara C.P. Florijn (Centrum Wiskunde & Informatica and Utrecht University)
Pinar Yolum (Utrecht University)
Tim Baarslag (Centrum Wiskunde & Informatica and Utrecht University)
Aleatoric
Predicates: Reasoning about Marbles (Page
2267)
Tim French (The University of Western Australia)
Synthesizing
Social Laws with ATL Conditions (Page
2270)
Rustam Galimullin (University of Bergen)
Louwe B. Kuijer (University of Liverpool)
Combinatorial
Client-Master Multiagent Deep Reinforcement Learning for Task Offloading
in Mobile Edge Computing (Page
2273)
Tesfay Zemuy Gebrekidan (University of Southampton)
Sebastian Stein (University of Southampton)
Timothy J. Norman (University of Southampton)
|
(Return
to Top) |
Behaviour
Modelling of Social Animals via Causal Structure Discovery and Graph Neural
Networks (Page 2276)
Gaël Gendron (NAOInstitute, The University of Auckland)
Yang Chen (NAOInstitute, The University of Auckland)
Mitchell Rogers (NAOInstitute, The University of Auckland)
Yiping Liu (NAOInstitute, The University of Auckland)
Mihailo Azhar (NAOInstitute, The University of Auckland)
Shahrokh Heidari (NAOInstitute, The University of Auckland)
David Arturo Soriano Valdez (NAOInstitute, The University of Auckland)
Kobe Knowles (NAOInstitute, The University of Auckland)
Padriac O'Leary (NAOInstitute, The University of Auckland)
Simon Eyre (Wellington Zoo)
Michael Witbrock (NAOInstitute, The University of Auckland)
Gillian Dobbie (NAOInstitute, The University of Auckland)
Jiamou Liu (NAOInstitute, The University of Auckland)
Patrice Delmas (NAOInstitute, The University of Auckland)
Benchmarking
MARL on Long Horizon Sequential Multi-Objective Tasks (Page
2279)
Minghong Geng (Singapore Management University)
Shubham Pateria (Singapore Management University)
Budhitama Subagdja (Singapore Management University)
Ah-Hwee Tan (Singapore Management University)
Risk-Sensitive
Multi-Agent Reinforcement Learning in Network Aggregative Markov Games (Page
2282)
Hafez Ghaemi (School of ECE, University of Tehran)
Hamed Kebriaei (School of ECE, University of Tehran)
Alireza Ramezani Moghaddam (School of ECE, University of Tehran)
Majid Nili Ahmadabadi (School of ECE, University of Tehran)
Facility
Location Games with Task Allocation (Page
2285)
Zifan Gong (City University of Hong Kong)
Minming Li (City University of Hong Kong)
Houyu Zhou (City University of Hong Kong)
Indirect
Credit Assignment in a Multiagent System (Page
2288)
Everardo Gonzalez (Oregon State University)
Siddarth Viswanathan (Cal Poly State University)
Kagan Tumer (Oregon State University)
Leveraging
Approximate Model-based Shielding for Probabilistic Safety Guarantees
in Continuous Environments (Page
2291)
Alexander W. Goodall (Imperial College London)
Francesco Belardinelli (Imperial College London)
|
(Return
to Top) |
Reinforcement
Learning for Question Answering in Programming Domain using Public Community
Scoring as a Human Feedback (Page
2294)
Alexey Gorbatovski (ITMO University)
Sergey Kovalchuk (Huawei)
Towards
Socially-Acceptable Multi-Criteria Resolution of the 4D-Contracts Repair
Problem (Page 2297)
Youssef Hamadi (Tempero)
Gauthier Picard (ONERA/DTIS, Université de Toulouse)
Taking
Agent-Based Social Simulation to the Next Level Using Exascale Computing:
Potential Use-Cases, Capacity Requirements and Threats (Page
2300)
Matt Hare (The James Hutton Institute)
Doug Salt (The James Hutton Institute)
Ric Colasanti (University of Glasgow)
Richard Milton (University College London)
Mike Batty (University College London)
Alison Heppenstall (University of Glasgow)
Gary Polhill (The James Hutton Institute)
Addressing
Permutation Challenges in Multi-Agent Reinforcement Learning (Page
2303)
Somnath Hazra (IIT Kharagpur)
Pallab Dasgupta (Synopsys)
Soumyajit Dey (IIT Kharagpur)
Distribution
of Chores with Information Asymmetry (Page
2306)
Hadi Hosseini (Pennsylvania State University)
Joshua Kavner (Rensselaer Polytechnic Institute)
Tomas Wąs (CNRS, LAMSADE, Université Paris Dauphine-PSL)
Lirong Xia (Rensselaer Polytechnic Institute)
Computing
Nash Equilibria in Multidimensional Congestion Games (Page
2309)
Mohammad T. Irfan (Bowdoin College)
Hau Chan (University of Nebraska-Lincoln)
Jared Soundy (University of Nebraska-Lincoln)
|
(Return
to Top) |
Strategic
Routing and Scheduling for Evacuations (Page
2312)
Kazi Ashik Islam (Biocomplexity Institute, University of Virginia)
Da Qi Chen (Biocomplexity Institute, University of Virginia)
Madhav Marathe (Biocomplexity Institute, University of Virginia)
Henning Mortveit (Biocomplexity Institute, University of Virginia)
Samarth Swarup (Biocomplexity Institute, University of Virginia)
Anil Vullikanti (Biocomplexity Institute, University of Virginia)
Dual-Policy-Guided
Offline Reinforcement Learning with Optimal Stopping (Page
2315)
Weibo Jiang (Tsinghua Shenzhen International Graduate School, Tsinghua
University)
Shaohui Li (Tsinghua Shenzhen International Graduate School, Tsinghua
University)
Zhi Li (Tsinghua Shenzhen International Graduate School, Tsinghua University)
Yuxin Ke (Tsinghua Shenzhen International Graduate School, Tsinghua
University)
Zhizhuo Jiang (Tsinghua Shenzhen International Graduate School, Tsinghua
University)
Yaowen Li (Tsinghua Shenzhen International Graduate School, Tsinghua
University)
Yu Liu (Department of Electronics, Tsinghua University)
NPPP-Completeness
of Control by Adding Players to Change the Penrose-Banzhaf Power Index
in Weighted Voting Games (Page
2318)
Joanna Kaczmarek (Heinrich-Heine-Universität Düsseldorf)
Jörg Rothe (Heinrich-Heine-Universität Düsseldorf)
TIMAT:
Temporal Information Multi-Agent Transformer (Page
2321)
Qitong Kang (Nankai University)
Fuyong Wang (Nankai University)
Zhongxin Liu (Nankai University)
Zengqiang Chen (Nankai University)
On
the Computational Complexity of Quasi-Variational Inequalities and Multi-Leader-Follower
Games (Page 2324)
Bruce M. Kapron (University of Victoria)
Koosha Samieefar (University of Victoria)
Contiguous
Allocation of Binary Valued Indivisible Items on a Path (Page
2327)
Yasushi Kawase (The University of Tokyo)
Bodhayan Roy (Indian Institute of Technology Kharagpur)
Mohammad Azharuddin Sanpui (Indian Institute of Technology Kharagpur)
|
(Return
to Top) |
Decentralized
Safe Control for Multi-Robot Navigation in Dynamic Environments with Limited
Sensing (Page 2330)
Saad Khan (Indian Institute of Technology, Bombay)
Mayank Baranwal (Tata Consultancy Services Research)
Srikant Sukumar (Indian Institute of Technology, Bombay)
GLIDE-RL:
Grounded Language Instruction through DEmonstration in RL (Page
2333)
Chaitanya Kharyal (Microsoft)
Sai Krishna Gottipati (AI Redefined)
Tanmay Sinha (Microsoft Research)
Srijita Das (University of Michigan-Dearborn)
Matthew E. Taylor (AI Redefined & University of Alberta)
Electric
Vehicle Routing for Emergency Power Supply with Deep Reinforcement Learning (Page
2336)
Daisuke Kikuta (NTT Corporation)
Hiroki Ikeuchi (NTT Corporation)
Kengo Tajiri (NTT Corporation)
Yuta Toyama (NTT DOCOMO)
Masaki Nakamura (NTT DOCOMO)
Yuusuke Nakano (NTT Corporation)
Difference
of Convex Functions Programming for Policy Optimization in Reinforcement
Learning (Page 2339)
Akshat Kumar (Singapore Management University)
Deep
Hawkes Process for High-Frequency Market Making (Page
2342)
Pankaj Kumar (Jheronimus Academy of Data Science)
Fair
Scheduling of Indivisible Chores (Page
2345)
Yatharth Kumar (IIT Delhi)
Sarfaraz Equbal (IIT Bombay)
Rohit Gurjar (IIT Bombay)
Swaprava Nath (IIT Bombay)
Rohit Vaish (IIT Delhi)
|
(Return
to Top) |
Guided
Exploration in Reinforcement Learning via Monte Carlo Critic Optimization (Page
2348)
Igor Kuznetsov (Independent Researcher)
A
SAT-based Approach for Argumentation Dynamics (Page
2351)
Jean-Marie Lagniez (CRIL, Université d'Artois - CNRS)
Emmanuel Lonca (CRIL, Université d'Artois - CNRS)
Jean-Guy Mailly (Université Paris Cité, LIPADE)
Which
Games are Unaffected by Absolute Commitments? (Page
2354)
Daji Landis (Bocconi University)
Nikolaj I. Schwartzbach (Bocconi University)
ELA:
Exploited Level Augmentation for Offline Learning in Zero-Sum Games (Page
2357)
Shiqi Lei (Institute of Automation, Chinese Academy of Sciences)
Kanghoon Lee (KAIST)
Linjing Li (Institute of Automation, Chinese Academy of Sciences)
Jinkyoo Park (KAIST)
Jiachen Li (University of California, Riverside)
From
Explicit Communication to Tacit Cooperation: A Novel Paradigm for Cooperative
MARL (Page 2360)
Dapeng Li (Institute of Automation, Chinese Academy of Sciences & School
of Artificial Intelligence, University of Chinese Academy of Sciences)
Zhiwei Xu (Institute of Automation, Chinese Academy of Sciences & School
of Artificial Intelligence, University of Chinese Academy of Sciences)
Bin Zhang (Institute of Automation, Chinese Academy of Sciences & School
of Artificial Intelligence, University of Chinese Academy of Sciences)
Guangchong Zhou (Institute of Automation, Chinese Academy of Sciences
& School of Artificial Intelligence, University of Chinese Academy of
Sciences)
Zeren Zhang (Institute of Automation, Chinese Academy of Sciences &
School of Artificial Intelligence, University of Chinese Academy of Sciences)
Guoliang Fan (Institute of Automation, Chinese Academy of Sciences
& School of Artificial Intelligence, University of Chinese Academy of
Sciences)
Efficient
Collaboration with Unknown Agents: Ignoring Similar Agents without Checking
Similarity (Page
2363)
Yansong Li (University of Illinois Chicago)
Shuo Han (University of Illinois Chicago)
|
(Return
to Top) |
Simple
k-crashing Plan with a Good Approximation Ratio (Page
2366)
Ruixi Luo (Shenzhen Campus of Sun Yat-sen University)
Kai Jin (Shenzhen Campus of Sun Yat-sen University)
Zelin Ye (Shenzhen Campus of Sun Yat-sen University)
Towards
Understanding How to Reduce Generalization Gap in Visual Reinforcement
Learning (Page 2369)
Jiafei Lyu (Tsinghua University)
Le Wan (Tencent)
Xiu Li (Tsinghua University)
Zongqing Lu (Peking University)
Opinion
Diffusion on Society Graphs Based on Approval Ballots (Page
2372)
Jayakrishnan Madathil (University of Glasgow)
Neeldhara Misra (Indian Institute of Technology, Gandhinagar)
Yash More (Indian Institute of Technology, Gandhinagar)
Time-Constrained
Restless Multi-Armed Bandits with Applications to City Service Scheduling (Page
2375)
Yi Mao (The Ohio State University)
Andrew Perrault (The Ohio State University)
Multi-level
Aggregation with Delays and Stochastic Arrivals (Page
2378)
Mathieu Mari (LIRMM, University of Montpellier)
Michał Pawłowski (University of Warsaw, IDEAS NCBR, & Sapienza University
of Rome)
Runtian Ren (IDEAS NCBR)
Piotr Sankowski (University of Warsaw, IDEAS NCBR, & MIM Solutions)
Projection-Optimal
Monotonic Value Function Factorization in Multi-Agent Reinforcement Learning (Page
2381)
Yongsheng Mei (The George Washington University)
Hanhan Zhou (The George Washington University)
Tian Lan (The George Washington University)
|
(Return
to Top) |
Shield
Decentralization for Safe Reinforcement Learning in General Partially
Observable Multi-Agent Environments (Page
2384)
Daniel Melcer (Northeastern University)
Christopher Amato (Northeastern University)
Stavros Tripakis (Northeastern University)
Enhancing
Search and Rescue Capabilities in Hazardous Communication-Denied Environments
through Path-Based Sensors with Backtracking (Page
2387)
Alexander Mendelsohn (University of Maryland)
Donald Sofge (Naval Research Laboratory)
Michael Otte (University of Maryland)
Fairness
in Repeated House Allocation (Page
2390)
Karl Jochen Micheel (Heinrich-Heine-Universität Düsseldorf)
Anaëlle Wilczynski (MICS, CentraleSupélec, Université Paris-Saclay)
Continual
Depth-limited Responses for Computing Counter-strategies in Sequential
Games (Page 2393)
David Milec (AI Center, FEE, CTU in Prague)
Ondřej Kubíček (AI Center, FEE, CTU in Prague)
Viliam Lisý (AI Center, FEE, CTU in Prague)
Simulated
Robotic Soft Body Manipulation (Page
2396)
Glareh Mir (Institute for Artificial Intelligence, University of Bremen)
Michael Beetz (Institute for Artificial Intelligence, University of
Bremen)
Leveraging
Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning (Page
2399)
Calarina Muslimani (University of Alberta)
Matthew E. Taylor (University of Alberta & Alberta Machine Intelligence
Institute (Amii))
MA-MIX:
Value Function Decomposition for Cooperative Multiagent Reinforcement
Learning Based on Multi-Head Attention Mechanism (Page
2402)
Yu Niu (Inner Mongolia University)
Hengxu Zhao (Inner Mongolia University)
Lei Yu (Inner Mongolia University)
Ontological
Modeling and Reasoning for Comparison and Contrastive Narration of Robot
Plans (Page 2405)
Alberto Olivares-Alarcos (Institut de Robňtica i Informŕtica Industrial,
CSIC-UPC)
Sergi Foix (Institut de Robňtica i Informŕtica Industrial, CSIC-UPC)
Júlia Borrŕs (Institut de Robňtica i Informŕtica Industrial, CSIC-UPC)
Gerard Canal (Department of Informatics, King's College London)
Guillem Alenyŕ (Institut de Robňtica i Informŕtica Industrial, CSIC-UPC)
Sentimental
Agents: Combining Sentiment Analysis and Non-Bayesian Updating for Cooperative
Decision-Making (Page
2408)
Daniele Orner (Brave Venture Labs)
Elizabeth Akinyi Ondula (University of Southern California)
Nick Mumero Mwangi (Brave Venture Labs)
Richa Goyal (University of Southern California)
DCT:
Dual Channel Training of Action Embeddings for Reinforcement Learning
with Large Discrete Action Spaces (Page
2411)
Pranavi Pathakota (TCS Research)
Hardik Meisheri (TCS Research)
Harshad Khadilkar (TCS Research & IIT Bombay)
Incentive-based
MARL Approach for Commons Dilemmas in Property-based Environments (Page
2414)
Lukasz Pelcner (Lancaster University)
Matheus Aparecido do Carmo Alves (Lancaster University)
Leandro Soriano Marcolino (Lancaster University)
Paula Harrison (UK Centre for Ecology & Hydrology)
Peter Atkinson (Lancaster University)
Decision
Making in Non-Stationary Environments with Policy-Augmented Search (Page
2417)
Ava Pettet (Vanderbilt University)
Yunuo Zhang (Vanderbilt University)
Baiting Luo (Vanderbilt University)
Kyle Wray (Stanford University)
Hendrik Baier (Eindhoven University of Technology)
Aron Laszka (Pennsylvania State University)
Abhishek Dubey (Vanderbilt University)
Ayan Mukhopadhyay (Vanderbilt University)
Optimal
Majority Rules and Quantitative Condorcet Properties of Setwise Kemeny
Voting Schemes (Page
2420)
Xuan Kien Phung (Université de Montréal)
Sylvie Hamel (Université de Montréal)
Fully
Independent Communication in Multi-Agent Reinforcement Learning (Page
2423)
Rafael Pina (Loughborough University London)
Varuna De Silva (Loughborough University London)
Corentin Artaud (Loughborough University London)
Xiaolan Liu (Loughborough University London)
Emergent
Dominance Hierarchies in Reinforcement Learning Agents (Page
2426)
Ram Rachum (Independent Researcher)
Yonatan Nakar (Tel Aviv University)
Bill Tomlinson (University of California, Irvine)
Nitay Alon (Max Planck Institute, Bio-Cybernetics & Hebrew University
of Jerusalem)
Reuth Mirsky (Bar-Ilan University)
GOV-REK:
Governed Reward Engineering Kernels for Designing Robust Multi-Agent Reinforcement
Learning Systems (Page
2429)
Ashish Rana (Institute for Enterprise Systems, University of Mannheim)
Michael Oesterle (Institute for Enterprise Systems, University of Mannheim)
Jannik Brinkmann (Institute for Enterprise Systems, University of Mannheim)
|
(Return
to Top) |
Banzhaf
Power in Hierarchical Games (Page
2432)
John Randolph (Brown University)
Amy Greenwald (Brown University)
Denizalp Goktas (Brown University)
BAR
Nash Equilibrium and Application to Blockchain Design (Page
2435)
Maxime Reynouard (Nomadic Labs & LAMSADE, Université - PSL)
Olga Gorelkina (Mohammed VI Polytechnic University & University of
Liverpool)
Rida Laraki (Mohammed VI Polytechnic University & CNRS (Dauphine -
PSL))
Psychophysiological
Models of Cognitive States Can Be Operator-Agnostic (Page
2438)
Erin E. Richardson (University of Colorado Boulder)
Savannah L. Buchner (University of Colorado Boulder)
Jacob R. Kintz (University of Colorado Boulder)
Torin K. Clark (University of Colorado Boulder)
Allison P. Anderson (University of Colorado Boulder)
The
Selfishness Level of Social Dilemmas (Page
2441)
Stefan Roesch (King's College London)
Stefanos Leonardos (King's College London)
Yali Du (King's College London)
JaxMARL:
Multi-Agent RL Environments and Algorithms in JAX (Page
2444)
Alexander Rutherford (University of Oxford)
Benjamin Ellis (University of Oxford)
Matteo Gallici (Universitat Politčcnica de Catalunya)
Jonathan Cook (University of Oxford)
Andrei Lupu (University of Oxford)
Garđar Ingvarsson (University College London)
Timon Willi (University of Oxford)
Akbir Khan (University College London)
Christian Schroeder de Witt (University of Oxford)
Alexandra Souly (University College London)
Saptarashmi Bandyopadhyay (University of Maryland)
Mikayel Samvelyan (University College London)
Minqi Jiang (University College London)
Robert Lange (Technical University Berlin)
Shimon Whiteson (University of Oxford)
Bruno Lacerda (University of Oxford)
Nick Hawes (University of Oxford)
Tim Rocktäschel (University College London)
Chris Lu (University of Oxford)
Jakob Foerster (University of Oxford)
Source
Detection in Networks using the Stationary Distribution of a Markov Chain (Page
2447)
Yael Sabato (Ariel Universiry)
Amos Azaria (Ariel Universiry)
Noam Hazon (Ariel Universiry)
|
(Return
to Top) |
Social
Identities and Responsible Agency (Page
2450)
Karthik Sama (International Institute of Information Technology, Bangalore)
Jayati Deshmukh (International Institute of Information Technology,
Bangalore)
Srinath Srinivasa (International Institute of Information Technology,
Bangalore)
Centralized
Training with Hybrid Execution in Multi-Agent Reinforcement Learning (Page
2453)
Pedro P. Santos (Instituto Superior Técnico & INESC-ID)
Diogo S. Carvalho (Instituto Superior Técnico & INESC-ID)
Miguel Vasco (KTH Royal Institute of Technology)
Alberto Sardinha (Pontifical Catholic University of Rio de Janeiro
& INESC-ID)
Pedro A. Santos (Instituto Superior Técnico & INESC-ID)
Ana Paiva (Instituto Superior Técnico & INESC-ID)
Francisco S. Melo (Instituto Superior Técnico & INESC-ID)
Geospatial
Active Search for Preventing Evictions (Page
2456)
Anindya Sarkar (Washington University in St. Louis)
Alex DiChristofano (Washington University in St. Louis)
Sanmay Das (George Mason University)
Patrick J. Fowler (Washington University in St. Louis)
Nathan Jacobs (Washington University in St. Louis)
Yevgeniy Vorobeychik (Washington University in St. Louis)
Balanced
and Incentivized Learning with Limited Shared Information in Multi-agent
Multi-armed Bandit (Page
2459)
Junning Shao (Tsinghua University & Shanghai Qi Zhi Institute)
Siwei Wang (Microsoft Research)
Zhixuan Fang (Tsinghua University & Shanghai Qi Zhi Institute)
|
(Return
to Top) |
Cournot
Queueing Games with Applications to Mobility Systems (Page
2462)
Matthew Sheldon (Imperial College London)
Dario Paccagnan (Imperial College London)
Giuliano Casale (Imperial College London)
OPEx:
A Large Language Model-Powered Framework for Embodied Instruction Following (Page
2465)
Haochen Shi (Université de Montréal & Mila)
Zhiyuan Sun (Université de Montréal & Mila)
Xingdi Yuan (Microsoft Research)
Marc-Alexandre Côté (Microsoft Research)
Bang Liu (Université de Montréal & Mila)
Fairness
and Cooperation between Independent Reinforcement Learners through Indirect
Reciprocity (Page
2468)
Jacobus Smit (University of Amsterdam)
Fernando P. Santos (University of Amsterdam)
Fairness
and Privacy Guarantees in Federated Contextual Bandits (Page
2471)
Sambhav Solanki (IIITH)
Sujit Gujar (IIITH)
Shweta Jain (IIT Ropar)
Fairness
of Exposure in Online Restless Multi-armed Bandits (Page
2474)
Archit Sood (Indian Institute of Technology Ropar)
Shweta Jain (Indian Institute of Technology Ropar)
Sujit Gujar (International Institute of Information Technology, Hyderabad)
Unlocking
the Potential of Machine Ethics with Explainability (Page
2477)
Timo Speith (University of Bayreuth)
|
(Return
to Top) |
Hybrid
Participatory Budgeting: Divisible, Indivisible, and Beyond (Page
2480)
Gogulapati Sreedurga (University of Edinburgh)
Decent-BRM:
Decentralization through Block Reward Mechanisms (Page
2483)
Varul Srivastava (IIIT, Hyderabad)
Sujit Gujar (IIIT, Hyderabad)
Ethical
Markov Decision Processes with Moral Worth as Rewards (Page
2486)
Mihail Stojanovski (Normandie Univ, UNICAEN, ENSICAEN, CNRS, GREYC)
Nadjet Bourdache (Normandie Univ, UNICAEN, ENSICAEN, CNRS, GREYC)
Grégory Bonnet (Normandie Univ, UNICAEN, ENSICAEN, CNRS, GREYC)
Abdel-Illah Mouaddib (Normandie Univ, UNICAEN, ENSICAEN, CNRS, GREYC)
A
Multiagent Path Search Algorithm for Large-Scale Coalition Structure Generation (Page
2489)
Redha Taguelmimt (Univ Lyon, UCBL, CNRS, INSA Lyon, Centrale Lyon,
Univ Lyon 2, LIRIS, UMR5205)
Samir Aknine (Univ Lyon, UCBL, CNRS, INSA Lyon, Centrale Lyon, Univ
Lyon 2, LIRIS, UMR5205)
Djamila Boukredera (Faculty of Exact Sciences, Laboratory of Applied
Mathematics, University of Bejaia)
Narayan Changder (TCG Centres for Research and Education in Science
and Technology)
Tuomas Sandholm (Carnegie Mellon University, Strategic Machine, Inc.,
Strategy Robot, Inc., & Optimized Markets, Inc.)
Efficient
Size-based Hybrid Algorithm for Optimal Coalition Structure Generation (Page
2492)
Redha Taguelmimt (Univ Lyon, UCBL, CNRS, INSA Lyon, Centrale Lyon,
Univ Lyon 2, LIRIS, UMR5205)
Samir Aknine (Univ Lyon, UCBL, CNRS, INSA Lyon, Centrale Lyon, Univ
Lyon 2, LIRIS, UMR5205)
Djamila Boukredera (Faculty of Exact Sciences, Laboratory of Applied
Mathematics, University of Bejaia)
Narayan Changder (TCG Centres for Research and Education in Science
and Technology)
Tuomas Sandholm (Carnegie Mellon University, Strategic Machine, Inc.,
Strategy Robot, Inc., & Optimized Markets, Inc.)
Pure
Nash Equilibria in Weighted Congestion Games with Complementarities and
Beyond (Page 2495)
Kenjiro Takazawa (Hosei University)
|
(Return
to Top) |
HiMAP:
Learning Heuristics-Informed Policies for Large-Scale Multi-Agent Pathfinding (Page
2498)
Huijie Tang (KAIST)
Federico Berto (KAIST)
Zihan Ma (KAIST)
Chuanbo Hua (KAIST)
Kyuree Ahn (Omelet)
Jinkyoo Park (KAIST, Omelet)
Fuzzy
Clustered Federated Learning Under Mixed Data Distributions (Page
2501)
Peng Tang (ShangHai Jiao Tong University)
Lifan Wang (ShangHai Jiao Tong University)
Weidong Qiu (ShangHai Jiao Tong University)
Zheng Huang (ShangHai Jiao Tong University)
Qiangmin Wang (ShangHai Jiao Tong University)
Neurological
Based Timing Mechanism for Reinforcement Learning (Page
2504)
Michael J. Tarlton (Oslo Metropolitan University)
Gustavo B. Mello (Oslo Metropolitan University)
Anis Yazidi (Oslo Metropolitan University)
Unifying
Regret and State-Action Space Coverage for Effective Unsupervised Environment
Design (Page 2507)
Jayden Teoh Jing Teoh (Singapore Management University)
Wenjun Li (Singapore Management University)
Pradeep Varakantham (Singapore Management University)
Persuasion
by Shaping Beliefs about Multidimensional Features of a Thing (Page
2510)
Kazunori Terada (Gifu University)
Yasuo Noma (Gifu University)
Masanori Hattori (Chubu Electric Power Co., Inc.)
Game
Transformations That Preserve Nash Equilibria or Best-Response Sets (Page
2513)
Emanuel Tewolde (Foundations of Cooperative AI Lab (FOCAL), Computer
Science Department, Carnegie Mellon University)
Vincent Conitzer (Foundations of Cooperative AI Lab (FOCAL), Computer
Science Department, Carnegie Mellon University)
|
(Return
to Top) |
Consensus
of Nonlinear Multi-Agent Systems with Semi-Markov Switching Under DoS
Attacks (Page 2516)
Sheng Tian (Faculty of Applied Sciences, Macao Polytechnic University)
Hong Shen (School of Engineering and Technology, Central Queensland
University & Faculty of Applied Sci., Macao Polytechnic University)
Yuan Tian (College of Intelligent Technology and Engineering, Chongqing
University of Science and Technology)
Hui Tian (School of Information and Communication Technology, Griffith
University)
Reducing
Systemic Risk in Financial Networks through Donations (Page
2519)
Jinyun Tong (King's College London)
Bart De Keijzer (King's College London)
Carmine Ventre (King's College London)
Joint
Intrinsic Motivation for Coordinated Exploration in Multi-Agent Deep Reinforcement
Learning (Page 2522)
Maxime Toquebiau (ECE Paris & Sorbonne Université, CNRS, ISIR)
Nicolas Bredeche (Sorbonne Université, CNRS, ISIR)
Faďz Benamar (Sorbonne Université, CNRS, ISIR)
Jae-Yun Jun (ECE Paris)
Embracing
Relational Reasoning in Multi-Agent Actor-Critic (Page
2525)
Sharlin Utke (University of Warwick)
Jeremie Houssineau (Nanyang Technological University)
Giovanni Montana (University of Warwick)
Bayesian
Ensembles for Exploration in Deep Q-Learning (Page
2528)
Pascal R. van der Vaart (Delft University of Technology)
Neil Yorke-Smith (Delft University of Technology)
Matthijs T. J. Spaan (Delft University of Technology)
Understanding
the Impact of Promotions on Consumer Behavior (Page
2531)
Jarod Vanderlynden (Univ. Lille, CNRS, Centrale Lille, UMR & fifty-five)
Philippe Mathieu (Univ. Lille, CNRS, Centrale Lille, UMR)
Romain Warlop (fifty-five)
|
(Return
to Top) |
On
the existence of EFX under picky or non-differentiative agents (Page
2534)
Maya Viswanathan (University of Illinois Laboratory High School)
Ruta Mehta (University of Illinois, Urbana-Champaign)
Explaining
Sequences of Actions in Multi-agent Deep Reinforcement Learning Models (Page
2537)
Khaing Phyo Wai (Singapore Management University)
Minghong Geng (Singapore Management University)
Shubham Pateria (Singapore Management University)
Budhitama Subagdja (Singapore Management University)
Ah-Hwee Tan (Singapore Management University)
Clique
Analysis and Bypassing in Continuous-Time Conflict-Based Search (Page
2540)
Thayne T. Walker (University of Denver & Lockheed Martin Corporation)
Nathan R. Sturtevant (Department of Computing Science, Alberta Machine
Intelligence Institute (Amii), University of Alberta)
Ariel Felner (Ben-Gurion University)
Detecting
Anomalous Agent Decision Sequences Based on Offline Imitation Learning (Page
2543)
Chen Wang (The University of Melbourne)
Sarah Erfani (The University of Melbourne)
Tansu Alpcan (The University of Melbourne)
Christopher Leckie (The University of Melbourne)
On
the Utility of External Agent Intention Predictor for Human-AI Coordination (Page
2546)
Chenxu Wang (Tsinghua University)
Zilong Chen (Tsinghua University)
Huaping Liu (Tsinghua University)
Decision
Market Based Learning for Multi-agent Contextual Bandit Problems (Page
2549)
Wenlong Wang (Massey University)
Thomas Pfeiffer (Massey University)
|
(Return
to Top) |
Reinforcement
Nash Equilibrium Solver (Page
2552)
Xinrun Wang (Nanyang Technological University)
Chang Yang (The Hong Kong Polytechnic University)
Shuxin Li (Nanyang Technological University)
Pengdeng Li (Nanyang Technological University)
Xiao Huang (The Hong Kong Polytechnic University)
Hau Chan (University of Nebraska-Lincoln)
Bo An (Nanyang Technological University)
Potential
Games on Cubic Splines for Multi-Agent Motion Planning of Autonomous Agents (Page
2555)
Sam Williams (University of Southern California)
Jyotirmoy Deshmukh (University of Southern California)
Competitive
Analysis of Online Facility Open Problem (Page
2558)
Binghan Wu (The University of Sydney)
Wei Bao (The University of Sydney)
Bing Zhou (The University of Sydney)
Population-aware
Online Mirror Descent for Mean-Field Games by Deep Reinforcement Learning (Page
2561)
Zida Wu (University of California, Los Angeles)
Mathieu Lauriere (New York University Shanghai)
Samuel Jia Cong Chua (University of California, Los Angeles)
Matthieu Geist (Cohere)
Olivier Pietquin (Cohere)
Ankur Mehta (University of California, Los Angeles)
Truthful
and Stable One-sided Matching on Networks (Page
2564)
Tianyi Yang (ShanghaiTech University)
Yuxiang Zhai (ShanghaiTech University)
Dengji Zhao (ShanghaiTech University)
Xinwei Song (ShanghaiTech University)
Miao Li (ShanghaiTech University)
On
the Complexity of Candidates-Embedded Multiwinner Voting under the Hausdorff
Function (Page 2567)
Yongjie Yang (Chair of Economic Theory, Saarland University)
|
(Return
to Top) |
Dual
Role AoI-based Incentive Mechanism for HD map Crowdsourcing (Page
2570)
Wentao Ye (Chinese University of Hong Kong, Shenzhen & Shenzhen Institute
of Artificial Intelligence and Robotics for Society)
Bo Liu (Shenzhen Institute of Artificial Intelligence and Robotics
for Society)
Yuan Luo (Chinese University of Hong Kong, Shenzhen & Shenzhen Institute
of Artificial Intelligence and Robotics for Society)
Jianwei Huang (Chinese University of Hong Kong, Shenzhen & Shenzhen
Institute of Artificial Intelligence and Robotics for Society)
Toward
Socially Friendly Autonomous Driving Using Multi-agent Deep Reinforcement
Learning (Page 2573)
Jhih-Ching Yeh (National Tsing Hua University)
Von-Wun Soo (Chang Gung University)
Solving
Offline 3D Bin Packing Problem with Large-sized Bin via Two-stage Deep
Reinforcement Learning (Page
2576)
Hao Yin (Southwest Jiaotong University)
Fan Chen (Southwest Jiaotong University)
Hongjie He (Southwest Jiaotong University)
Overview of t-DGR: A Trajectory-Based Deep Generative Replay Method for Continual Learning in Decision Making (Page
2579)
William Yue (The University of Texas at Austin)
Bo Liu (The University of Texas at Austin)
Peter Stone (The University of Texas at Austin)
MATLight:
Traffic Signal Coordinated Control Algorithm based on Heterogeneous-Agent
Mirror Learning with Transformer (Page
2582)
Haipeng Zhang (Guangxi University of Science and Technology)
Zhiwen Wang (Guangxi University of Science and Technology)
Na Li (Guangxi University of Science and Technology)
PADDLE:
Logic Program Guided Policy Reuse in Deep Reinforcement Learning (Page
2585)
Hao Zhang (College of Intelligence and Computing, Tianjin University)
Tianpei Yang (University of Alberta & Alberta Machine Intelligence
Institute)
Yan Zheng (College of Intelligence and Computing, Tianjin University)
Jianye Hao (College of Intelligence and Computing, Tianjin University)
Matthew E. Taylor (University of Alberta & Alberta Machine Intelligence
Institute)
|
(Return
to Top) |
Bellman
Momentum on Deep Reinforcement Learning (Page
2588)
Huihui Zhang (Dongsheng Intelligent Technolody Co., Ltd.)
Auto-Encoding
Adversarial Imitation Learning (Page
2591)
Kaifeng Zhang (Shanghai Qi Zhi Institute)
Rui Zhao (Tencent AI Lab)
Ziming Zhang (Worcester Polytechnic Institute)
Yang Gao (Tsinghua University, Shanghai Qi Zhi Institute, & Shanghai
Artificial Intelligence Laboratory)
Large
Language Model Assissted Multi-Agent Dialogue for Ontology Alignment (Page
2594)
Shiyao Zhang (Xi'an Jiaotong-Liverpool University)
Yuji Dong (Xi'an Jiaotong-Liverpool University)
Yichuan Zhang (Xi'an Jiaotong-Liverpool University)
Terry R. Payne (University of Liverpool)
Jie Zhang (Xi'an Jiaotong-Liverpool University)
Mutual
Information as Intrinsic Reward of Reinforcement Learning Agents for On-demand
Ride Pooling (Page
2597)
Xianjie Zhang (Dalian University of Technology)
Jiahao Sun (Dalian University of Technology)
Chen Gong (University of Virginia)
Kai Wang (Nanyang Technological University)
Yifei Cao (Dalian University of Technology)
Hao Chen (Institute of Automation, Chinese Academy of Sciencec)
Yu Liu (Dalian University of Technology)
Optimal
Diffusion Auctions (Page
2600)
Yao Zhang (ShanghaiTech University)
Shanshan Zheng (ShanghaiTech University)
Dengji Zhao (ShanghaiTech University)
Decentralized
Competing Bandits in Many-to-One Matching Markets (Page
2603)
Yirui Zhang (Tsinghua University & Shanghai Qi Zhi Institute)
Zhixuan Fang (Tsinghua University & Shanghai Qi Zhi Institute)
|
(Return
to Top) |
Distance-Aware
Attentive Framework for Multi-Agent Collaborative Perception in Presence
of Pose Error (Page
2606)
Binyu Zhao (Harbin Institute of Technology)
Wei Zhang (Harbin Institute of Technology)
Zhaonian Zou (Harbin Institute of Technology)
ENOTO:
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles (Page
2609)
Kai Zhao (College of Intelligence and Computing, Tianjin University)
Jianye Hao (College of Intelligence and Computing, Tianjin University)
Yi Ma (College of Intelligence and Computing, Tianjin University)
Jinyi Liu (College of Intelligence and Computing, Tianjin University)
Yan Zheng (College of Intelligence and Computing, Tianjin University)
Zhaopeng Meng (College of Intelligence and Computing, Tianjin University)
JDRec:
Practical Actor-Critic Framework for Online Combinatorial Recommender
System (Page 2612)
Xin Zhao (Tsinghua University)
Jiaxin Li (Tsinghua University)
Zhiwei Fang (JD.com)
Yuchen Guo (Tsinghua University)
Jinyuan Zhao (JD.com)
Jie He (JD.com)
Wenlong Chen (JD.com)
Changping Peng (JD.com)
Guiguang Ding (Tsinghua University)
Bootstrapped
Policy Learning: Goal Shaping for Efficient Task-oriented Dialogue Policy
Learning (Page 2615)
Yangyang Zhao (Changsha University of Science and Technologu & Utrecht
University)
Mehdi Dastani (Utrecht University)
Shihan Wang (Utrecht University)
Towards
Zero Shot Learning in Restless Multi-armed Bandits (Page
2618)
Yunfan Zhao (Harvard University)
Nikhil Behari (Harvard University)
Edward Hughes (Google)
Edwin Zhang (Harvard University)
Dheeraj Nagaraj (Google)
Karl Tuyls (Google)
Aparna Taneja (Google)
Milind Tambe (Harvard University & Google)
vMFER:
von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient
Directions for Policy Improvement of Actor-Critic Algorithms (Page
2621)
Yiwen Zhu (Zhejiang University)
Jinyi Liu (Tianjin University)
Wenya Wei (Zhejiang University)
Qianyi Fu (Zhejiang University)
Yujing Hu (NetEase Fuxi AI Lab)
Zhou Fang (Zhejiang University)
Bo An (Nanyang Technological University)
Jianye Hao (Tianjin University)
Tangjie Lv (NetEase Fuxi AI Lab)
Changjie Fan (NetEase Fuxi AI Lab) |
(Return
to Top) |
AAAI
Track
|
(Return
to Top) |
Controlling
Delegations in Liquid Democracy (Page
2624)
Shiri Alouf-Heffetz (Ben Gurion University)
Tanmay Inamdar (Indian Institute of Technology Jodhpur)
Pallavi Jain (Indian Institute of Technology Jodhpur)
Nimrod Talmon (Ben-Gurion University)
Yash More Hiren (Indian Institute of Technology Gandhinagar)
Regret-based
Defense in Adversarial Reinforcement Learning (Page
2633)
Roman Belaire (Singapore Management University)
Pradeep Varakantham (Singapore Management University)
Thanh Nguyen (University of Oregon)
David Lo (Singapore Management University)
Fair
and Efficient Division of a Discrete Cake with Switching Utility Loss (Page
2641)
Zheng Chen (College of Computer Science and Technology, Zhejiang University)
Bo Li (Department of Computing, The Hong Kong Polytechnic University)
Minming Li (Department of Computer Science, The Hong Kong Polytechnic
University)
Guochuan Zhang (College of Computer Science and Technology, Zhejiang
University)
MAGNets:
Micro-Architectured Group Neural Networks (Page
2650)
Sumanta Dey (IIT Kharagpur)
Briti Gangopadhyay (IIT Kharagpur)
Pallb Dasgupta (Synopsys Inc.)
Soumyajit Dey (IIT Kharagpur)
Budget-feasible
Egalitarian Allocation of Conflicting Jobs (Page
2659)
Sushmita Gupta (The Institute of Mathematical Science)
Pallavi Jain (Indian Institute of Technology Jodhpur)
A Mohanapriya (The Institute of Mathematical Sciences)
Vikash Tripathi (The Institute of Mathematical Sciences) |
(Return
to Top) |
Blue
Sky Ideas Track
Multi-deal
Negotiation (Page
2668)
Tim Baarslag (Centrum Wiskunde & Informatica)
|
(Return
to Top) |
Going
Beyond Mono-Mission Earth Observation: Using the Multi-Agent Paradigm
to Federate Multiple Missions (Page
2674)
Jean-Loup Farges (ONERA/DTIS, Université de Toulouse)
Filipo Perotto (ONERA/DTIS, Université de Toulouse)
Gauthier Picard (ONERA, Université de Toulouse)
Cédric Pralet (ONERA/DTIS, Université de Toulouse)
Cyrille de Lussy (Airbus Defence and Space)
Jonathan Guerra (Airbus Defence and Space)
Philippe Pavero (Airbus Defence and Space)
Fabrice Planchou (Airbus Defence and Space)
Empowering
BDI Agents with Generalised Decision-Making (Page
2679)
Ramon Fraga Pereira (University of Manchester)
Felipe Meneguzzi (University of Aberdeen)
Adaptive
Incentive Engineering in Citizen-Centric AI (Page
2684)
Behrad Koohy (University of Southampton)
Jan Buermann (University of Southampton)
Vahid Yazdanpanah (University of Southampton)
Pamela Briggs (Northumbria University)
Paul Pschierer-Barnfather (Zaptec)
Enrico Gerding (University of Southampton)
Sebastian Stein (University of Southampton)
Designing
Artificial Reasoners for Communication (Page
2690)
Emiliano Lorini (IRIT, CNRS, Toulouse University)
Towards
Sustainable Human-Agent Teams: A Framework for Understanding Human-Agent
Team Dynamics (Page
2696)
Rui Prada (INESC-ID & Instituto Superior Técnico, Universidade de Lisboa)
Astrid C. Homan (University of Amsterdam)
Gerben A. van Kleef (University of Amsterdam)
Selecting
Representative Bodies: An Axiomatic View (Page
2701)
Manon Revel (Harvard University)
Niclas Boehmer (Harvard University)
Rachael Colley (University of Glasgow)
Markus Brill (University of Warwick)
Piotr Faliszewski (AGH University)
Edith Elkind (University of Oxford & ATI)
|
(Return
to Top) |
The
Cognitive Hourglass: Agent Abstractions in the Large Models Era (Page
2706)
Alessandro Ricci (University of Bologna)
Stefano Mariani (University of Modena and Reggio Emilia)
Franco Zambonelli (University of Modena and Reggio Emilia)
Samuele Burattini (University of Bologna)
Cristiano Castelfranchi (Italian Research Council)
Explainable
Agents (XAg) by Design (Page
2712)
Sebastian Rodriguez (RMIT University)
John Thangarajah (RMIT University)
Utility-Based
Reinforcement Learning: Unifying Single-objective and Multi-objective
Reinforcement Learning (Page
2717)
Peter Vamplew (Federation University Australia)
Cameron Foale (Federation University Australia)
Conor F. Hayes (Lawrence Livermore National Laboratory)
Patrick Mannion (University of Galway)
Enda Howley (University of Galway)
Richard Dazeley (Deakin University)
Scott Johnson (Deakin University)
Johan Källström (Linköping University)
Gabriel Ramos (Universidade do Vale do Rio dos Sinos)
Roxana Rădulescu (Vrije Universiteit Brussel / Utrecht University)
Willem Röpke (Vrije Universiteit Belgium)
Diederik M. Roijers (Vrije Universiteit Brussel) |
(Return
to Top) |
Doctoral
Consortium
Abstraction
in Non-Monotonic Reasoning (Page
2722)
Iosif Apostolakis (Graz University of Technology)
Emergence
of Linguistic Conventions In Multi-Agent Systems Through Situated Communicative
Interactions (Page
2725)
Jérôme Botoko Ekila (Vrije Universiteit Brussel)
|
|
Communication
and Generalization in Multi-Agent Learning (Page
2728)
Jiaxun Cui (The University of Texas at Austin)
|
(Return
to Top) |
The
Multi-agent System based on LLM for Online Discussions (Page
2731)
Yihan Dong (Kyoto University)
Negotiation
Strategies for Combining Partials Deals in One-To-Many Negotiations (Page
2734)
Tamara C.P. Florijn (Centrum Wiskunde & Informatica and Utrecht University)
Scaling
up Cooperative Multi-agent Reinforcement Learning Systems (Page
2737)
Minghong Geng (Singapore Management University)
Toward
Explainable Agent Behaviour (Page
2740)
Victor Gimenez-Abalos (Barcelona Supercomputing Center & Universitat
Politecnica de Catalunya)
Towards
building Autonomous AI Agents and Robots for Open World Environments (Page
2743)
Shivam Goel (Tufts University)
Large
Learning Agents: Towards Continually Aligned Robots with Scale in RL (Page
2746)
Bram Grooten (Eindhoven University of Technology)
|
(Return
to Top) |
Efficient
Continuous Space BeliefMDP Solutions for Navigation and Active Sensing (Page
2749)
Himanshu Gupta (University of Colorado Boulder)
Building
Trustworthy Human-Centric Autonomous Systems Via Explanations (Page
2752)
Balint Gyevnar (University of Edinburgh)
Adaptive
Decision-Making in Non-Stationary Markov Decision Processes (Page
2755)
Baiting Luo (Vanderbilt University)
Interactive
Control and Decision-Making for Multi-Robots Systems (Page
2758)
Yiwei Lyu (Carnegie Mellon University)
Leveraging
Human Models to Personalize AI Interventions for Behavior Change (Page
2761)
Eura Nofshin (Harvard University)
|
|
Predicting
and Protecting the Cognitive Health of Operators in Isolated, Confined,
and Extreme Environments (Page
2764)
Erin E. Richardson (University of Colorado Boulder)
|
(Return
to Top) |
Generalizing
Objective-Specification in Markov Decision Processes (Page
2767)
Pedro P. Santos (INESC-ID, Instituto Superior Técnico)
Cooperative
Multi-Agent Reinforcement Learning in Convention Reliant Environments (Page
2773)
Jarrod Shipton (University of the Witwatersrand Johannesburg)
Formal
and Natural Language assisted Curriculum Generation for Reinforcement
Learning Agents (Page
2776)
Yash Shukla (Tufts University)
Distributive
and Temporal Fairness in Algorithmic Collective Decision-Making (Page
2779)
Nicholas Teh (University of Oxford)
Bayesian
Model-Free Deep Reinforcement Learning (Page
2782)
Pascal R. van der Vaart (Delft University of Technology)
Autonomous
Skill Acquisition for Robots Using Graduated Learning (Page
2785)
Gautham Vasan (University of Alberta)
Allocating
Resources with Imperfect Information (Page
2788)
Shiji Xing (The Hong Kong Polytechnic University)
Advancing
Sample Efficiency and Explainability in Multi-Agent Reinforcement Learning (Page
2791)
Zhicheng Zhang (Carnegie Mellon University) |
(Return
to Top) |
Demonstration
Track
EVtonomy:
A Personalised Route Planner for Electric Vehicles (Page
2794)
Alexandry Augustin (University of Southampton)
Elnaz Shafipour (University of Southampton)
Sebastian Stein (University of Southampton)
End
to End Camera only Drone Detection and Tracking Demo within a Multi-agent
Framework with a CNN-LSTM Model for Range Estimation (Page
2797)
Maxence de Rochechouart (Sorbonne University, Abu Dhabi)
Raed Abu Zitar (Sorbonne University, Abu Dhabi)
Amal El Fallah Seghrouchni (Sorbonne University)
Frederic Barbaresco (Thales Group)
Imitation
Learning Datasets: A Toolkit For Creating Datasets, Training Agents and
Benchmarking (Page
2800)
Nathan Gavenski (King's College Longon)
Michael Luck (University of Sussex)
Odinaldo Rodrigues (King's College London)
A
Symbolic Sequential Equilibria Solver for Game Theory Explorer (Page
2803)
Moritz Graf (University of Freiburg)
Thorsten Engesser (IRIT)
Bernhard Nebel (University of Freiburg)
Naphtha
Cracking Center Scheduling Optimization using Multi-Agent Reinforcement
Learning (Page 2806)
Sunghoon Hong (LG AI Research)
Deunsol Yoon (LG AI Research)
Whiyoung Jung (LG AI Research)
Jinsang Lee (LG AI Research)
Hyundam Yoo (LG AI Research)
Jiwon Ham (LG AI Research)
Suhyun Jung (LG AI Research)
Chanwoo Moon (LG AI Research)
Yeontae Jung (LG AI Research)
Kanghoon Lee (LG AI Research)
Woohyung Lim (LG AI Research)
Somin Jeon (LG Chem)
Myounggu Lee (LG Chem)
Sohui Hong (LG Chem)
Jaesang Lee (LG Chem)
Hangyoul Jang (LG Chem)
Changhyun Kwak (LG Chem)
Jeonghyeon Park (LG Chem)
Changhoon Kang (LG Chem)
Jungki Kim (LG Chem)
|
(Return
to Top) |
Conversational
Language Models for Human-in-the-Loop Multi-Robot Coordination (Page
2809)
William Hunt (University of Southampton)
Toby Godfrey (University of Southampton)
Mohammad D. Soorati (University of Southampton)
STV+KH:
Towards Practical Verification of Strategic Ability for Knowledge and
Information Flow (Page
2812)
Mateusz Kamiński (Institute of Computer Science, Polish Academy of
Sciences & Faculty of Mathematics and Computer Science, Nicolaus Copernicus University in Toruń)
Damian Kurpiewski (Institute of Computer Science, Polish Academy of
Sciences & Faculty of Mathematics and Computer Science, Nicolaus Copernicus University in Toruń)
Wojciech Jamroga (Interdisciplinary Centre for Security, Reliability
and Trust, SnT, University of Luxembourg & Institute of Computer Science,
Polish Academy of Sciences)
SMT4SMTL:
A Tool for SMT-Based Satisfiability Checking of SMTL (Page
2815)
Artur Niewiadomski (University of Siedlce)
Maciej Nazarczuk (University of Siedlce)
Mateusz Przychodzki (University of Siedlce)
Magdalena Kacprzak (Bialystok University of Technology)
Wojciech Penczek (Institute of Computer Science, PAS)
Andrzej Zbrzezny (Jan Dlugosz University in Czestochowa)
Engaging
the Elderly in Exercise with Agents: A Gamified Stationary Bike System
for Sarcopenia Management (Page
2818)
Yang Qiu (Nanyang Technological University)
Ping Chen (Nanyang Technological University)
Huiguo Zhang (Nanyang Technological University)
Bo Huang (Nanyang Technological University)
Di Wang (Nanyang Technological University)
Zhiqi Shen (Nanyang Technological University)
pgeon
applied to Overcooked-AI to explain agents' behaviour (Page
2821)
Adrian Tormos (Barcelona Supercomputing Center)
Victor Gimenez-Abalos (Barcelona Supercomputing Center)
Javier Vázquez-Salceda (Universitat Politčcnica de Catalunya)
Sergio Alvarez-Napagao (Universitat Politčcnica de Catalunya& Barcelona
Supercomputing Center) |
(Return
to Top) |
JAAMAS
Track
Generating
and Choosing Organizations for Multi-Agent Systems (Page
2824)
Cleber J. Amaral (Instituto Federal de Santa Catarina)
Jomi F. Hübner (Universidade Federal de Santa Catarina)
Stephen Cranefield (University of Otago)
A
Summary of the RGS⊕: an RDF Graph Synchronization System for
Collaborative Robotics (Page
2827)
Cyrille Berger (Linköping University)
Patrick Doherty (Linköping University)
Piotr Rudol (Linköping University)
Mariusz Wzorek (Linköping University)
A
Summary of Online Markov Decision Processes with Non-oblivious Strategic
Adversary (Page 2830)
Le Cong Dinh (University of Southampton)
David Henry Mguni (Huawei R&D UK)
Long Tran-Thanh (University of Warwick)
Jun Wang (University College London)
Yaodong Yang (Institute for AI, Peking University)
Extended
Abstract of Diffusion Auction Design with Transaction Costs (Page
2833)
Bin Li (Nanjing University of Science and Technology)
Dong Hao (University of Electronic Science and Technology of China)
Dengji Zhao (ShanghaiTech University)
|
(Return
to Top) |
Toward
a Normative Approach for Resilient Multiagent Systems: A Summary (Page
2836)
Geeta Mahala (University of Wollongong)
Ozgur Kafali (University of Kent)
Hoa Khanh Dam (University of Wollongong)
Aditya Ghose (University of Wollongong)
Munindar P. Singh (North Carolina State university)
Combining
Theory of Mind and Abductive Reasoning in Agent-Oriented Programming (Page
2839)
Nieves Montes (Artificial Intelligence Research Institute (IIIA-CSIC))
Michael Luck (University of Sussex)
Nardine Osman (Artificial Intelligence Research Institute (IIIA-CSIC))
Odinaldo Rodrigues (King's College London)
Carles Sierra (Artificial Intelligence Research Institute (IIIA-CSIC))
Extended
Abstract: Price of Anarchy of Traffic Assignment with Exponential Cost
Functions (Page 2842)
Jianglin Qiao (University of South Australia)
Dave De Jonge (IIIA-CSIC)
Dongmo Zhang (Western Sydney University)
Simeon Simoff (Western Sydney University)
Carles Sierra (IIIA-CSIC)
Bo Du (Griffith University)
A
Survey of Multi-Agent Deep Reinforcement Learning with Communication (Page
2845)
Changxi Zhu (Department of Information and Computing Sciences, Utrecht
University)
Mehdi Dastani (Department of Information and Computing Sciences, Utrecht
University)
Shihan Wang (Department of Information and Computing Sciences, Utrecht
University)
|