Main Page

Table of Contents

Author Index

Author Index


Faghan, Yaser (University of Lisbon)

Extended Abstract ~ Gambler Bandits and the Regret of Being Ruined (Page 1664)


Faltings, Boi (École Polytechnique Fédérale de Lausanne (EPFL))

Main Track ~ Improved Cooperation by Exploiting a Common Signal (Page 395)


Farges, Jean-Loup (ONERA DTIS, Université de Toulouse)

Blue Sky Ideas Track ~ Autonomous Agents and Multiagent Systems Challenges in Earth Observation Satellite Constellations (Page 39)


Fargier, Hélène (IRIT-CNRS, Université de Toulouse)

Main Track ~ A Knowledge Compilation Map for Conditional Preference Statements-based Languages (Page 492)


Farinelli, Alessandro (Università degli Studi di Verona)

Main Track ~ Identification of Unexpected Decisions in Partially Observable Monte-Carlo Planning: A Rule-Based Approach (Page 889)


Feng, Lu (University of Virginia)

Main Track ~ Safe Multi-Agent Reinforcement Learning via Shielding (Page 483)


Ferrando, Angelo (The University of Manchester)

Demonstration Track ~ Strategy RV: A Tool to Approximate ATL Model Checking under Imperfect Information and Perfect Recall (Page 1764)


Ferret, Johan (Google Research & Inria Lille Nord Europe)

Main Track ~ Self-Imitation Advantage Learning (Page 501)


Ferstl, Ylva (Trinity College Dublin)

Extended Abstract ~ It's A Match! Gesture Generation Using Expressive Parameter Matching (Page 1495)


Figiel, Aleksander (TU Berlin)

Main Track ~ High-Multiplicity Fair Allocation Made More Practical (Page 260)


Filimonov, Alina (Technion - Israel Institute of Technology)

Main Track ~ Strategyproof Facility Location Mechanisms on Discrete Trees (Page 510)


Filliat, David (U2IS, ENSTA Paris, Institut Polytechnique de Paris & INRIA)

Extended Abstract ~ On the Sensory Commutativity of Action Sequences for Embodied Agents (Page 1472)


Fioretto, Ferdinando (Syracuse University)

Extended Abstract ~ Privacy-Preserving and Accountable Multi-agent Learning (Page 1605)


Fisher, Michael (University of Manchester)

JAAMAS Track ~ Summarising a Framework for the Certification of Reliable Autonomous Systems (Page 1733)


Fochetta, Luca (Politecnico di Milano)

Main Track ~ Exploration of Indoor Environments through Predicting the Layout of Partially Observed Rooms (Page 836)


Foerster, Jakob (Facebook AI Research)

Extended Abstract ~ Trajectory Diversity for Zero-Shot Coordination (Page 1593)


Formisano, Andrea (University of Udine)

Extended Abstract ~ A Logic of Inferable in Multi-Agent Systems with Budget and Costs (Page 1483)


Fowler, Patrick (Washington University in St. Louis)

Main Track ~ Efficient Nonmyopic Online Allocation of Scarce Reusable Resources (Page 447)


Fox, James (University of Oxford)

Main Track ~ Equilibrium Refinements for Multi-Agent Influence Diagrams: Theory and Practice (Page 574)


Fraga Pereira, Ramon (Sapienza University of Rome)

Extended Abstract ~ Combining LSTMs and Symbolic Approaches for Robust Plan Recognition (Page 1634)


Francisco, Alexandre P. (Universidade de Lisboa)

Extended Abstract ~ Eliciting Fairness in Multiplayer Bargaining through Network-Based Role Assignment (Page 1683)


Gabai Schlosberg, Yuval (Ben Gurion University of the Negev)

Extended Abstract ~ Partially Cooperative Multi-Agent Periodic Indivisible Resource Allocation (Page 1498)


Gabriel, Iason (DeepMind)

Extended Abstract ~ Modelling Cooperation in Network Games with Spatio-Temporal Complexity (Page 1455)


Gaignier, Fabrice (University of Paris)

Main Track ~ Probabilistic Control Argumentation Frameworks (Page 519)


Gajane, Pratik (University of Leoben)

Extended Abstract ~ Gambler Bandits and the Regret of Being Ruined (Page 1664)


Galimullin, Rustam (University of Bergen)

Main Track ~ Quantified Announcements and Common Knowledge (Page 528)


Ganapathi Subramanian, Sriram (University of Waterloo)

Main Track ~ Partially Observable Mean Field Reinforcement Learning (Page 537)


Gao, Zhimin (Auburn University at Montgomery)

Main Track ~ A Game Theoretical Analysis of Non-Linear Blockchain System (Page 323)


García, Julian (Monash University)

Main Track ~ Cooperation and Reputation Dynamics with Reinforcement Learning (Page 115)


Garcia-Ortiz, Michael (CitAI, SMCSE, City University of London)

Extended Abstract ~ On the Sensory Commutativity of Action Sequences for Embodied Agents (Page 1472)


Gargouri, Anis (University Artois, CNRS, UMR 8188, CRIL)

Main Track ~ On a Notion of Monotonic Support for Bipolar Argumentation Frameworks (Page 546)


Garnelo, Marta (DeepMind)

Extended Abstract ~ Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity (Page 1501)


Gatti, Nicola (Politecnico di Milano)

Main Track ~ Multi-Agent Coordination in Adversarial Environments through Signal Mediated Strategies (Page 269)


Geist, Matthieu (Google Research)

Main Track ~ Self-Imitation Advantage Learning (Page 501)

Main Track ~ Show Me the Way: Intrinsic Motivation from Demonstrations (Page 620)


Georgara, Athina (Artificial Intelligence Research Institute (IIIA-CSIC))

Extended Abstract ~ A Succinct Representation Scheme for Cooperative Games under Uncertainty (Page 1661)

Extended Abstract ~ Towards a Competence-Based Approach to Allocate Teams to Tasks (Page 1504)


Gerding, Enrico H. (University of Southampton)

Blue Sky Ideas Track ~ Responsibility Research for Trustworthy Autonomous Systems (Page 57)


Giacobbe, Mirco (University of Oxford)

Extended Abstract ~ Shielding Atari Games with Bounded Prescience (Page 1507)


Gianessi, Paolo (Mines Saint-Étienne, CNRS, UMR 6158, LIMOS Institut Henri Fayol)

Extended Abstract ~ A Generic Multi-Agent Model for Resource Allocation Strategies in Online On-Demand Transport with Autonomous Vehicles (Page 1489)


Gibescu, Madeleine (Utrecht University)

Blue Sky Ideas Track ~ Sparse Training Theory for Scalable and Efficient Agents (Page 34)


Gidel, Gauthier (DeepMind)

Extended Abstract ~ Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity (Page 1501)


Gijswijt, Dion (TU Delft)

Main Track ~ Achieving Sybil-Proofness in Distributed Work Systems (Page 1263)


Gilbert, Hugo (Université Paris-Dauphine, Université PSL, CNRS, LAMSADE)

Extended Abstract ~ Maximizing Influence-Based Group Shapley Centrality (Page 1461)


Giordano, Joseph P. (University of Central Florida)

Extended Abstract ~ Comparison of Desynchronization Methods for a Decentralized Swarm on a Logistical Resupply Problem (Page 1510)


Goel, Shivam (Tufts University)

Main Track ~ A Novelty-Centric Agent Architecture for Changing Worlds (Page 925)

Main Track ~ SPOTTER: Extending Symbolic Planning Operators through Targeted Reinforcement Learning (Page 1118)


Goindani, Mahak (Purdue University)

Extended Abstract ~ Towards Decentralized Social Reinforcement Learning via Ego-Network Extrapolation (Page 1512)


Goldweber, Eli (University of Michigan)

JAAMAS Track ~ On Teammate-Pattern-Aware Autonomy (Page 1730)


Gombolay, Matthew (Georgia Institute of Technology)

Main Track ~ Multi-Agent Graph-Attention Communication and Teaming (Page 964)


Gomez, Andres (University of St. Gallen)

Demonstration Track ~ Autonomous Agents on the Edge of Things (Page 1767)


Gómez, Jonatan (Universidad Nacional de Colombia)

JAAMAS Track ~ A Decentralised Self-Healing Approach for Network Topology Maintenance (Page 1743)


Gonen, Rica (The Open University of Israel)

Extended Abstract ~ A Global Multi-Sided Market with Ascending-Price Mechanism (Page 1515)


Goranko, Valentin (Stockholm University)

JAAMAS Track ~ Logic-based Specification and Verification of Homogeneous Dynamic Multi-agent Systems (Page 1727)


Gourvès, Laurent (Université Paris-Dauphine & Université PSL, CNRS, LAMSADE)

Main Track ~ Worst-case Bounds for Spending a Common Budget (Page 288)


Gow, Jeremy (Queen Mary University of London)

Main Track ~ Action Advising with Advice Imitation in Deep Reinforcement Learning (Page 629)


Goyal, Aastha (University of Texas at Austin)

Main Track ~ Scalable Multiagent Driving Policies for Reducing Traffic Congestion (Page 386)


Graves, Daniel (Huawei Canada)

Blue Sky Ideas Track ~ Diverse Auto-Curriculum is Critical for Successful Real-World Multiagent Learning Systems (Page 51)


Greenwald, Amy (Brown University)

Extended Abstract ~ Learning Competitive Equilibria in Noisy Combinatorial Markets (Page 1446)


Grivet Sébert, Arnaud (Université Paris-Saclay, CEA, List)

Extended Abstract ~ Rank Aggregation by Dissatisfaction Minimisation in the Unavailable Candidate Model (Page 1518)


Gross-Humbert, Nathanaël (LIP6, Sorbonne Université, CNRS)

Extended Abstract ~ Sequential and Swap Mechanisms for Public Housing Allocation with Quotas and Neighbourhood-Based Utilities (Page 1521)


Guaman, Mateo (Tufts University)

Main Track ~ A Novelty-Centric Agent Architecture for Changing Worlds (Page 925)


Guerra, Carla (INESC-ID & Universidade de Lisboa)

Extended Abstract ~ Teaching Unknown Learners to Classify via Feature Importance (Page 1524)


Guerra, Jonathan (Airbus Defence and Space)

Blue Sky Ideas Track ~ Autonomous Agents and Multiagent Systems Challenges in Earth Observation Satellite Constellations (Page 39)


Guimarães, Manuel (INESC-ID & Universidade de Lisboa)

Extended Abstract ~ CHARET: Character-centered Approach to Emotion Tracking in Stories (Page 1469)


Gujar, Sujit (International Institute of Information Technology Hyderabad)

Extended Abstract ~ A Multi-Arm Bandit Approach To Subset Selection Under Constraints (Page 1492)

Extended Abstract ~ We might walk together, but I run faster: Network Fairness and Scalability in Blockchains (Page 1539)


Guo, Mingyu (University of Adelaide)

Main Track ~ Mechanism Design for Public Projects via Neural Networks (Page 1380)


Guo, Runqi (University of Adelaide)

Main Track ~ Mechanism Design for Public Projects via Neural Networks (Page 1380)


Guo, Shangwei (Nanyang Technological University)

Main Track ~ Temporal Watermarks for Deep Reinforcement Learning Models (Page 314)


Guo, Xiaoxi (Peking University)

Main Track ~ Sequential Mechanisms for Multi-type Resource Allocation (Page 1209)


Gupta, Jayesh K. (Stanford University)

Main Track ~ Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning (Page 764)

Main Track ~ Scalable Anytime Planning for Multi-Agent MDPs (Page 341)


Gupta, Siddharth (Ben-Gurion University of the Negev)

Main Track ~ Multivariate Analysis of Scheduling Fair Competitions (Page 555)


Gupta, Vaibhav (International Institute of Information Technology, Hyderabad)

Main Track ~ Action Selection for Composable Modular Deep Reinforcement Learning (Page 565)


Gutierrez, Julian (Monash University)

Main Track ~ Mean-Payoff Games with ω-Regular Specifications (Page 1272)

Main Track ~ Multi-Agent Reinforcement Learning with Temporal Logic Specifications (Page 583)


Gyawali, Saurav (Tufts University)

Main Track ~ A Novelty-Centric Agent Architecture for Changing Worlds (Page 925)


Haeri, Hossein (University of Massachusetts, Lowell)

Doctoral Consortium ~ Reward-Sharing Relational Networks in Multi-Agent Reinforcement Learning as a Framework for Emergent Behavior (Page 1808)


Hailes, Stephen (University College London)

Main Track ~ Cooperation and Reputation Dynamics with Reinforcement Learning (Page 115)


Hammond, Lewis (University of Oxford)

Main Track ~ Equilibrium Refinements for Multi-Agent Influence Diagrams: Theory and Practice (Page 574)

Main Track ~ Multi-Agent Reinforcement Learning with Temporal Logic Specifications (Page 583)


Han, The Anh (Teesside Univeresity)

Extended Abstract ~ Promoting Fair Proposers, Fair Responders or Both? Cost-Efficient Interference in the Spatial Ultimatum Game (Page 1480)


Harabor, Daniel (Monash University)

Extended Abstract ~ Anytime Multi-Agent Path Finding via Large Neighborhood Search (Page 1581)


Harrenstein, Paul (University of Oxford)

Main Track ~ A Hotelling-Downs Framework for Party Nominees (Page 593)


Hasanbeig, Mohammadhosein (University of Oxford)

Extended Abstract ~ Shielding Atari Games with Bounded Prescience (Page 1507)


Hatanaka, Wataru (RICOH COMPANY, LTD.)

Extended Abstract ~ Simultaneous Learning of Moving and Active Perceptual Policies for Autonomous Robot (Page 1527)


Hayes, Conor F. (National University of Ireland Galway)

Extended Abstract ~ Distributional Monte Carlo Tree Search for Risk-Aware and Multi-Objective Reinforcement Learning (Page 1530)


Hazrati, Naieme (Free University of Bolzano)

Doctoral Consortium ~ Impact of Recommender Systems on the Dynamics of Users' Choices (Page 1811)


He, Keyang (University of Georgia)

Main Track ~ Cooperative-Competitive Reinforcement Learning with History-Dependent Rewards (Page 602)


He, Chloe (Imperial College London)

Demonstration Track ~ Argflow: A Toolkit for Deep Argumentative Explanations for Neural Networks (Page 1761)


He, Tairan (Shanghai Jiao Tong University)

Main Track ~ Energy-Based Imitation Learning (Page 809)


Helsen, Jan (Vrije Universiteit Brussel)

Main Track ~ Scalable Optimization for Wind Farm Control using Coordination Graphs (Page 1362)


Henderson, Peter (Stanford University)

Main Track ~ TDprop: Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference Learning? (Page 1082)


Hickert, Cameron (Harvard University)

Extended Abstract ~ Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning (Page 1486)


Hindriks, Koen (Vrije Universiteit Amsterdam)

JAAMAS Track ~ Agent Programming in the Cognitive Era (Page 1718)


Hirashima, Kazuki (Mitsubishi Electric Corporation)

Extended Abstract ~ Coverage Control under Connectivity Constraints (Page 1554)


Ho, Chien-Ju (Washington University in St. Louis)

Main Track ~ Efficient Nonmyopic Online Allocation of Scarce Reusable Resources (Page 447)


Hofmann, Katja (Microsoft Research)

Extended Abstract ~ Deep Interactive Bayesian Reinforcement Learning via Meta-Learning (Page 1712)

Extended Abstract ~ Evaluating the Robustness of Collaborative Agents (Page 1560)


Hogrebe, Tobias Alexander (Heinrich-Heine-Universität Düsseldorf)

Main Track ~ Complexity of Scheduling and Predicting Round-Robin Tournaments (Page 178)


Hoogeveen, Han (Utrecht University)

Demonstration Track ~ TORS: A Train Unit Shunting and Servicing Simulator (Page 1785)


Howley, Enda (National University of Ireland Galway)

Extended Abstract ~ Distributional Monte Carlo Tree Search for Risk-Aware and Multi-Objective Reinforcement Learning (Page 1530)


Hsiao, Vincent (University of Maryland)

Extended Abstract ~ Approximating Spatial Evolutionary Games using Bayesian Networks (Page 1533)


Hu, Hengyuan (Facebook AI Research)

Extended Abstract ~ Trajectory Diversity for Zero-Shot Coordination (Page 1593)


Hu, Xiaohui (The Institute of Software, Chinese Academy of Sciences)

Extended Abstract ~ Intrinsic Motivated Multi-Agent Communication (Page 1668)


Hua, Yun (East China Normal University)

Main Track ~ Structured Diversification Emergence via Reinforced Organization Control and Hierachical Consensus Learning (Page 773)


Huang, Taoan (University of Southern California)

Main Track ~ Learning Node-Selection Strategies in Bounded-Suboptimal Conflict-Based Search for Multi-Agent Path Finding (Page 611)


Hübner, Jomi F. (Universidade Federal de Santa Catarina)

Main Track ~ Active Perception within BDI Agents Reasoning Cycle (Page 1218)


Hughes, Edward (DeepMind)

Extended Abstract ~ Modelling Cooperation in Network Games with Spatio-Temporal Complexity (Page 1455)


Huisman, Bob (Nederlandse Spoorwegen)

Demonstration Track ~ TORS: A Train Unit Shunting and Servicing Simulator (Page 1785)


Hung, Hayley (Delft University of Technology)

JAAMAS Track ~ Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER Framework (Page 1735)


Hunter, Julie (LINAGORA Labs)

Extended Abstract ~ Interpretive Blindness and the Impossibility of Learning from Testimony (Page 1449)


Hussenot, Léonard (Google Research, Brain Team & Univ. Lille, CNRS, Inria Scool, UMR 9189 CRIStAL)

Main Track ~ Show Me the Way: Intrinsic Motivation from Demonstrations (Page 620)


İlhan, Ercüment (Queen Mary University of London)

Main Track ~ Action Advising with Advice Imitation in Deep Reinforcement Learning (Page 629)


Imajo, Kentaro (Preferred Networks, Inc.)

Main Track ~ Trader-Company Method: A Metaheuristics for Interpretable Stock Price Prediction (Page 656)


Imber, Aviram (Technion - Israel Institute of Technology)

Main Track ~ Computing the Extremal Possible Ranks with Incomplete Preferences (Page 638)

Main Track ~ Probabilistic Inference of Winners in Elections by Independent Random Voters (Page 647)


Inoue, Katsumi (National Institute of Informatics & The Graduate University for Advanced Studies)

Main Track ~ Partial Robustness in Team Formation: Bridging the Gap between Robustness and Resilience (Page 1154)


Irissappane, Athirai A. (University of Washington)

Main Track ~ Learning Complex Policy Distribution with CEM Guided Adversarial Hypernetwork (Page 1308)


Isaac, William S. (DeepMind)

Extended Abstract ~ Modelling Cooperation in Network Games with Spatio-Temporal Complexity (Page 1455)


Islam, Zahoor Ul (Umeå University)

Doctoral Consortium ~ Software Engineering Methods for Responsible Artificial Intelligence (Page 1814)


Istrate, Gabriel (West University of Timişoara & the e-Austria Research Institute)

Blue Sky Ideas Track ~ Models We Can Trust: Toward a Systematic Discipline of (Agent-Based) Model Interpretation and Validation (Page 6)


Ito, Katsuya (Preferred Networks, Inc.)

Main Track ~ Trader-Company Method: A Metaheuristics for Interpretable Stock Price Prediction (Page 656)


Ivanov, Dmitry (JetBrains Research & HSE University)

Extended Abstract ~ Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments (Page 1536)


Jabbari, Shahin (Harvard University)

Main Track ~ Active Screening for Recurrent Diseases: A Reinforcement Learning Approach (Page 992)


Jafarkhani, Hamid (University of California, Irvine)

Extended Abstract ~ Distributed Q-Learning with State Tracking for Multi-agent Networked Control (Page 1692)


Jain, Anurag (International Institute of Information Technology, Hyderabad)

Extended Abstract ~ We might walk together, but I run faster: Network Fairness and Scalability in Blockchains (Page 1539)


Jain, Pallavi (Indian Institute of Technology Jodhpur)

Main Track ~ Partition Aggregation for Participatory Budgeting (Page 665)

Extended Abstract ~ Preserving Consistency for Liquid Knapsack Voting (Page 1542)


Jamroga, Wojciech (Polish Academy of Sciences & University of Luxembourg)

Extended Abstract ~ Strategic Abilities of Asynchronous Agents: Semantic Side Effects (Page 1545)

Demonstration Track ~ STV+Reductions: Towards Practical Verification of Strategic Ability Using Model Reductions (Page 1770)


Jamshidi, Pooyan (University of South Carolina)

Main Track ~ Accelerating Recursive Partition-Based Causal Structure Learning (Page 1028)


Jansen, Nils (Radboud University)

Main Track ~ AlwaysSafe: Reinforcement Learning without Safety Constraint Violations during Training (Page 1226)


Javidian, Mohammad Ali (Purdue University)

Main Track ~ Accelerating Recursive Partition-Based Causal Structure Learning (Page 1028)


Jeong, JiHyun (Cornell University)

Doctoral Consortium ~ Leveraging Social Interactions in Human-Agent Decision-Making (Page 1816)


Ji, Jianmin (University of Science and Technology of China)

Main Track ~ Parallel Curriculum Experience Replay in Distributed Reinforcement Learning (Page 782)


Jiang, Arthur (Microsoft ARD Incubation Team)

Main Track ~ Cooperative Policy Learning with Pre-trained Heterogeneous Observation Representations (Page 1191)


Jiang, Minqi (University College London)

Main Track ~ Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning (Page 674)


Jiang, Yuan (Nanjing University)

Main Track ~ Imitation Learning from Pixel-Level Demonstrations by HashReward (Page 279)

Extended Abstract ~ Solving 3D Bin Packing Problem via Multimodal Deep Reinforcement Learning (Page 1548)


Jiang, Zhengyao (University College London)

Main Track ~ Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning (Page 674)


Jin, Bo (East China Normal University & SRIAS)

Main Track ~ Structured Diversification Emergence via Reinforced Organization Control and Hierachical Consensus Learning (Page 773)


Jin, Zhi (Peking University)

Extended Abstract ~ Fast Adaptation to External Agents via Meta Imitation Counterfactual Regret Advantage (Page 1709)


Jonker, Catholijn M. (Delft University of Technology)

Main Track ~ Axies: Identifying and Evaluating Context-Specific Values (Page 799)

Blue Sky Ideas Track ~ Responsibility Research for Trustworthy Autonomous Systems (Page 57)

Demonstration Track ~ A Collaborative Platform for Identifying Context-Specific Values (Page 1773)


Kaczmarczyk, Andrzej (TU Berlin)

Main Track ~ High-Multiplicity Fair Allocation Made More Practical (Page 260)


Kagita, Venkateswara Rao (National Institute of Technology)

Main Track ~ Committee Selection using Attribute Approvals (Page 683)


Kamarthi, Harshavardhan (Georgia Institute of Technology)

Main Track ~ Reinforcement Learning for Unified Allocation and Patrolling in Signaling Games with Uncertainty (Page 1353)


Kampik, Timotheus (Umeå University)

Extended Abstract ~ Toward Consistent Agreement Approximation in Abstract Argumentation and Beyond (Page 1551)

Demonstration Track ~ Autonomous Agents on the Edge of Things (Page 1767)


Kanaa, David (Polytechnique Montréal & Mila)

Main Track ~ TDprop: Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference Learning? (Page 1082)


Kaneko, Yusuke (CyberAgent, Inc.)

Main Track ~ Off-Policy Exploitability-Evaluation in Two-Player Zero-Sum Markov Games (Page 78)


Kanthan, Leslie (Turing Intelligence Technology)

Extended Abstract ~ Call Markets with Adaptive Clearing Intervals (Page 1587)


Kasenberg, Daniel (Tufts University)

Main Track ~ SPOTTER: Extending Symbolic Planning Operators through Targeted Reinforcement Learning (Page 1118)


Kashima, Hisashi (Kyoto University)

Main Track ~ Grab the Reins of Crowds: Estimating the Effects of Crowd Movement Guidance Using Causal Inference (Page 1290)


Kasichainula, Keshav (University of Houston)

Main Track ~ A Game Theoretical Analysis of Non-Linear Blockchain System (Page 323)


Katz, Jonathan (University of Maryland)

Extended Abstract ~ RPPLNS: Pay-per-last-N-shares with a Randomised Twist (Page 1569)


Kaul, Bharat (Intel Corporation)

Main Track ~ SEERL: Sample Efficient Ensemble Reinforcement Learning (Page 1100)


Kawaguchi, Atsuo (RICOH COMPANY, LTD.)

Extended Abstract ~ Simultaneous Learning of Moving and Active Perceptual Policies for Autonomous Robot (Page 1527)


Kawajiri, Shota (Mitsubishi Electric Corporation)

Extended Abstract ~ Coverage Control under Connectivity Constraints (Page 1554)


Kawasaki, Takehiro (Kyushu University)

Main Track ~ Mechanism Design for Housing Markets over Social Networks (Page 692)


Kelkar, Amol (i3AI.org)

Blue Sky Ideas Track ~ Cognitive Homeostatic Agents (Page 12)


Kephart, Jeffrey O. (IBM Thomas J. Research Center)

Blue Sky Ideas Track ~ Multi-modal Agents for Business Intelligence (Page 17)


Keskin, Mehmet Onur (Özyeğin University)

Extended Abstract ~ Solver Agent: Towards Emotional and Opponent-Aware Agent for Human-Robot Negotiation (Page 1557)


Ketter, Wolfgang (University of Cologne)

Main Track ~ Siting and Sizing of Charging Infrastructure for Shared Autonomous Electric Fleets (Page 88)


Khadilkar, Harshad (TCS Research)

Extended Abstract ~ SIBRE: Self Improvement Based REwards for Adaptive Feedback in Reinforcement Learning (Page 1607)


Khadka, Shauharda (MIcrosoft)

Extended Abstract ~ Dynamic Skill Selection for Learning Joint Actions (Page 1637)


Khaluf, Yara (Ghent University - imec)

Main Track ~ No More Hand-Tuning Rewards: Masked Constrained Policy Optimization for Safe Reinforcement Learning (Page 1344)


Khan, Arindam (Indian Institute of Science)

Main Track ~ Group Fairness for Knapsack Problems (Page 1001)


Khan, Md. Mosaddek (University of Dhaka)

Main Track ~ A Local Search Based Approach to Solve Continuous DCOPs (Page 1127)

Main Track ~ Accelerating Recursive Partition-Based Causal Structure Learning (Page 1028)


Khan, Shakil M. (Ronin Institute)

Main Track ~ Knowing Why – On the Dynamics of Knowledge about Actual Causes in the Situation Calculus (Page 701)


Killian, Jackson A. (Harvard University)

Main Track ~ Beyond "To Act or Not to Act": Fast Lagrangian Approaches to General Multi-Action Restless Bandits (Page 710)


Kim, Yan (University of Luxembourg)

Demonstration Track ~ STV+Reductions: Towards Practical Verification of Strategic Ability Using Model Reductions (Page 1770)


Kimelfeld, Benny (Technion - Israel Institute of Technology)

Main Track ~ Computing the Extremal Possible Ranks with Incomplete Preferences (Page 638)

Main Track ~ Probabilistic Inference of Winners in Elections by Independent Random Voters (Page 647)


Kirley, Michael (The University of Melbourne)

Main Track ~ Extended Goal Recognition: A Planning-Based Model for Strategic Deception (Page 871)


Kjellström, Hedvig (KTH Royal Institute of Technology)

Demonstration Track ~ A Framework for Integrating Gesture Generation Models into Interactive Conversational Agents (Page 1779)


Knop, Dušan (Czech Technical University in Prague)

Main Track ~ High-Multiplicity Fair Allocation Made More Practical (Page 260)


Knott, Paul (University of Nottingham)

Extended Abstract ~ Evaluating the Robustness of Collaborative Agents (Page 1560)


Kochenderfer, Mykel J. (Stanford University)

Main Track ~ Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning (Page 764)

Main Track ~ Scalable Anytime Planning for Multi-Agent MDPs (Page 341)


Koenig, Sven (University of Southern California)

Main Track ~ Learning Node-Selection Strategies in Bounded-Suboptimal Conflict-Based Search for Multi-Agent Path Finding (Page 611)

Extended Abstract ~ Anytime Multi-Agent Path Finding via Large Neighborhood Search (Page 1581)


Kolaitis, Phokion G. (University of California, Santa Cruz & IBM Research)

Main Track ~ Classifying the Complexity of the Possible Winner Problem on Partial Chains (Page 297)


Kompella, Varun (Sony AI)

Main Track ~ Multiagent Epidemiologic Inference through Realtime Contact Tracing (Page 1182)


Konieczny, Sébastien (CNRS, University Artois, UMR 8188, CRIL)

Main Track ~ On a Notion of Monotonic Support for Bipolar Argumentation Frameworks (Page 546)


Kraiczy, Sonja (University of Oxford)

Extended Abstract ~ On Weakly and Strongly Popular Rankings (Page 1563)


Krausburg, Tabajara (Pontifical Catholic University of Rio Grande do Sul & Clausthal University of Technology)

Main Track ~ Feasible Coalition Sequences (Page 719)


Kroening, Daniel (Amazon, Inc.)

Extended Abstract ~ Shielding Atari Games with Bounded Prescience (Page 1507)


Kucherenko, Taras (KTH Royal Institute of Technology)

Demonstration Track ~ A Framework for Integrating Gesture Generation Models into Interactive Conversational Agents (Page 1779)


Kumar, Akshat (Singapore Management University)

Main Track ~ Action Selection for Composable Modular Deep Reinforcement Learning (Page 565)

Extended Abstract ~ Approximate Difference Rewards for Scalable Multigent Reinforcement Learning (Page 1655)

Demonstration Track ~ Ship-GAN: Generative Modeling Based Maritime Traffic Simulator (Page 1755)


Kumar, Rajiv Ranjan (Singapore Management University)

Main Track ~ Adaptive Operating Hours for Improved Performance of Taxi Fleets (Page 728)


Kumar, Vikas (University of Delhi & Central University of Rajasthan)

Main Track ~ Committee Selection using Attribute Approvals (Page 683)


Kurpiewski, Damian (Polish Academy of Sciences)

Demonstration Track ~ STV+Reductions: Towards Practical Verification of Strategic Ability Using Model Reductions (Page 1770)


Kutay, Cat (Charles Darwin University)

Extended Abstract ~ Personalising the Dialogue of Relational Agents for First-Time Users (Page 1622)


Lackner, Martin (TU Wien)

Main Track ~ Approval-Based Shortlisting (Page 737)

Extended Abstract ~ Fairness in Long-Term Participatory Budgeting (Page 1566)


LaCroix, Travis (Université de Montréal & University of Toronto)

Main Track ~ Emergent Communication under Competition (Page 974)


Lagniez, Jean-Marie (CRIL-CNRS, Université d'Artois)

Main Track ~ Partial Robustness in Team Formation: Bridging the Gap between Robustness and Resilience (Page 1154)


Lai, Edmund (Auckland University of Technology)

Demonstration Track ~ Graph-based Self-Adaptive Conversational Agent (Page 1791)


Lam, Alexander (University of New South Wales Sydney)

Doctoral Consortium ~ Balancing Fairness, Efficiency and Strategy-Proofness in Voting and Facility Location Problems (Page 1818)


Lanctot, Marc (DeepMind)

Extended Abstract ~ Sound Algorithms in Imperfect Information Games (Page 1674)


Lau, Hoong Chuin (Singapore Management University)

Extended Abstract ~ Approximate Difference Rewards for Scalable Multigent Reinforcement Learning (Page 1655)


Lauren, Stefan (Imperial College London)

Main Track ~ Aggregating Bipolar Opinions (Page 746)


Law, Matthew V. (Cornell University)

Doctoral Consortium ~ Intention-Aware Human-Robot Collaborative Design (Page 1820)


Lazaridou, Angeliki (Deepmind)

Main Track ~ Emergent Communication under Competition (Page 974)


Lazos, Philip (Sapienza University of Rome)

Extended Abstract ~ RPPLNS: Pay-per-last-N-shares with a Randomised Twist (Page 1569)


Lederer, Patrick (Technische Universität München)

Main Track ~ On the Indecisiveness of Kelly-Strategyproof Social Choice Functions (Page 251)

Doctoral Consortium ~ Non-manipulability in Set-valued and Probabilistic Social Choice Theory (Page 1823)


Lee, Cheng-Kuang (NVIDIA Corporation)

Extended Abstract ~ A Distributional Perspective on Value Function Factorization Methods for Multi-Agent Reinforcement Learning (Page 1671)


Lee, Chun-Yi (National Tsing Hua University)

Extended Abstract ~ A Distributional Perspective on Value Function Factorization Methods for Multi-Agent Reinforcement Learning (Page 1671)


Lee, Wee Sun (National University of Singapore)

Main Track ~ State-Aware Variational Thompson Sampling for Deep Q-Networks (Page 124)


Leibo, Joel Z. (DeepMind)

Extended Abstract ~ Modelling Cooperation in Network Games with Spatio-Temporal Complexity (Page 1455)


León, Borja G. (Imperial College London)

Main Track ~ An Abstraction-based Method to Check Multi-Agent Deep Reinforcement-Learning Behaviors (Page 474)


Lertvittayakumjorn, Piyawat (Imperial College London)

Demonstration Track ~ Argflow: A Toolkit for Deep Argumentative Explanations for Neural Networks (Page 1761)


Lesca, Julien (Université Paris-Dauphine & Université PSL, CNRS, LAMSADE)

Main Track ~ Worst-case Bounds for Spending a Common Budget (Page 288)


Lespérance, Yves (York University)

Main Track ~ Knowing Why – On the Dynamics of Knowledge about Actual Causes in the Situation Calculus (Page 701)


Lev, Omer (Ben-Gurion University of the Negev)

Main Track ~ Predicting Voting Outcomes in Presence of Communities (Page 151)

Main Track ~ The Price is (Probably) Right: Learning Market Equilibria from Samples (Page 755)

Extended Abstract ~ Learning Cooperative Solution Concepts from Voting Behavior: A Case Study on the Israeli Knesset (Page 1572)


Lev Lehman, Rotem (Ben Gurion University of the Negev)

Extended Abstract ~ Partial Disclosure of Private Dependencies in Privacy Preserving Planning (Page 1575)


Levin, Simon A. (Princeton University)

Extended Abstract ~ Social Network Interventions to Prevent Reciprocity-driven Polarization (Page 1643)


Levy, Jordi (IIIA-CSIC)

Main Track ~ A Heuristic Algorithm for Multi-Agent Vehicle Routing with Automated Negotiation (Page 404)


Li, Bo (The Hong Kong Polytechnic University)

Main Track ~ Multi-Robot Task Allocation-Complexity and Approximation (Page 133)


Li, Chenghao (Tsinghua University)

Main Track ~ Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning (Page 853)


Li, Fu (University of Texas at Austin)

Extended Abstract ~ Object Allocation Over a Network of Objects: Mobile Agents with Strict Preferences (Page 1578)


Li, Guangliang (Ocean University of China)

JAAMAS Track ~ Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER Framework (Page 1735)


Li, Jiaoyang (University of Southern California)

Extended Abstract ~ Anytime Multi-Agent Path Finding via Large Neighborhood Search (Page 1581)


Li, Lingbo (Turing Intelligence Technology)

Extended Abstract ~ Call Markets with Adaptive Clearing Intervals (Page 1587)


Li, Mickey (University of Bristol)

Extended Abstract ~ Reliability-Aware Multi-UAV Coverage Path Planning using a Genetic Algorithm (Page 1584)


Li, Sheng (Stanford University)

Main Track ~ Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning (Page 764)


Li, Shuxin (Nanyang Technological University)

Main Track ~ Temporal Watermarks for Deep Reinforcement Learning Models (Page 314)


Li, Weihua (Auckland University of Technology)

Extended Abstract ~ Learning Policies for Effective Incentive Allocation in Unknown Social Networks (Page 1701)

Demonstration Track ~ Graph-based Self-Adaptive Conversational Agent (Page 1791)


Li, Wenhao (East China Normal University)

Main Track ~ Structured Diversification Emergence via Reinforced Organization Control and Hierachical Consensus Learning (Page 773)


Li, Yuyu (University of Science and Technology of China)

Main Track ~ Parallel Curriculum Experience Replay in Distributed Reinforcement Learning (Page 782)


Liang, Yu (Penn State University)

Main Track ~ Let the DOCTOR Decide Whom to Test: Adaptive Testing Strategies to Tackle the COVID-19 Pandemic (Page 790)


Libin, Pieter J.K. (Hasselt University)

Main Track ~ Scalable Optimization for Wind Farm Control using Coordination Graphs (Page 1362)


Lin, Anthony W. (Technical University of Kaiserslautern, MPI SWS)

Main Track ~ Regular Model Checking Approach to Knowledge Reasoning over Parameterized Systems (Page 1254)


Lin, Sen (Arizona State University)

Extended Abstract ~ Distributed Q-Learning with State Tracking for Multi-agent Networked Control (Page 1692)


Liscio, Enrico (Delft University of Technology)

Main Track ~ Axies: Identifying and Evaluating Context-Specific Values (Page 799)

Demonstration Track ~ A Collaborative Platform for Identifying Context-Specific Values (Page 1773)


Lisowski, Grzegorz (University of Warwick)

Main Track ~ A Hotelling-Downs Framework for Party Nominees (Page 593)


Liu, Bo (Institute of Automation, Chinese Academy of Sciences)

Main Track ~ Learning Correlated Communication Topology in Multi-Agent Reinforcement learning (Page 456)


Liu, Buhong (King's College London)

Extended Abstract ~ Call Markets with Adaptive Clearing Intervals (Page 1587)


Liu, Kun (Peking University)

Extended Abstract ~ Fast Adaptation to External Agents via Meta Imitation Counterfactual Regret Advantage (Page 1709)


Liu, Minghuan (Shanghai Jiao Tong University)

Main Track ~ Energy-Based Imitation Learning (Page 809)


Liu, Siqi (DeepMind)

Extended Abstract ~ Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity (Page 1501)


Liu, Tie-Yan (Microsoft Research)

Main Track ~ Cooperative Policy Learning with Pre-trained Heterogeneous Observation Representations (Page 1191)


Liu, Xiaolong (Sun Yat-sen University)

Extended Abstract ~ Solid Semantics and Extension Aggregation Using Quota Rules under Integrity Constraints (Page 1590)


Liu, Yang (Nanyang Technological University)

Main Track ~ Temporal Watermarks for Deep Reinforcement Learning Models (Page 314)


Liu, Zhengshang (The University of Melbourne)

Main Track ~ Deceptive Reinforcement Learning for Privacy-Preserving Planning (Page 818)


Liu, Ziqi (University College London)

Main Track ~ Learning Correlated Communication Topology in Multi-Agent Reinforcement learning (Page 456)


Logan, Brian (Utrecht University)

Main Track ~ Intention Progression using Quantitative Summary Information (Page 1416)


Logan, Brian (University of Nottingham)

JAAMAS Track ~ Agent Programming in the Cognitive Era (Page 1718)


Lomuscio, Alessio (AAMAS-2021 General Chair)

Welcome Message from the Chairs


Lopes, Manuel (INESC-ID & Universidade de Lisboa)

Extended Abstract ~ Teaching Unknown Learners to Classify via Feature Importance (Page 1524)


Lorini, Emiliano (IRIT-CNRS, Toulouse University)

Main Track ~ A Logic of Evaluation (Page 827)


Louis, Anand (Indian Institute of Science)

Main Track ~ Group Fairness for Knapsack Problems (Page 1001)


Lu, Wei (Massachusetts Institute of Technology)

Extended Abstract ~ Learning Cooperative Solution Concepts from Voting Behavior: A Case Study on the Israeli Knesset (Page 1572)


Lu, Yiwen (Tsinghua University)

Main Track ~ Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning (Page 853)


Luo, Jun (Huawei Canada)

Blue Sky Ideas Track ~ Diverse Auto-Curriculum is Critical for Successful Real-World Multiagent Learning Systems (Page 51)


Luo, Tie (Missouri University of Science and Technology)

Extended Abstract ~ A Blockchain-Enabled Quantitative Approach to Trust and Reputation Management with Sparse Evidence (Page 1707)


Luperto, Matteo (Università degli Studi di Milano)

Main Track ~ Exploration of Indoor Environments through Predicting the Layout of Partially Observed Rooms (Page 836)


Lupu, Andrei (McGill University & Mila)

Extended Abstract ~ Trajectory Diversity for Zero-Shot Coordination (Page 1593)


Lyu, Xueguang (Northeastern University)

Main Track ~ Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning (Page 844)


Ma, Biyang (Northumbria University)

Demonstration Track ~ ATPT: Automate Typhoon Contingency Plan Generation from Text (Page 1788)


Ma, Chaofan (Zhongyuan University of Technology)

Main Track ~ Drone Formation Control via Belief-Correlated Imitation Learning (Page 1407)


Ma, Xiaoteng (Tsinghua University)

Main Track ~ Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning (Page 853)


Mańdziuk, Jacek (Warsaw University of Technology)

Main Track ~ Evolution of Strategies in Sequential Security Games (Page 1434)


Macke, William (University of Texas at Austin)

Main Track ~ Scalable Multiagent Driving Policies for Reducing Traffic Congestion (Page 386)


Mailly, Jean-Guy (University of Paris)

Main Track ~ Probabilistic Control Argumentation Frameworks (Page 519)


Maity, Suman Kalyan (Northwestern University)

Main Track ~ Network Robustness via Global k-cores (Page 438)


Majumdar, Somdeb (Intel Labs)

Extended Abstract ~ Dynamic Skill Selection for Learning Joint Actions (Page 1637)


Malladi, Tejasvi (TCS Research)

Main Track ~ To hold or not to hold? - Reducing Passenger Missed Connections in Airlines using Reinforcement Learning (Page 862)


Malvone, Vadim (Télécom Paris)

Demonstration Track ~ Strategy RV: A Tool to Approximate ATL Model Checking under Imperfect Information and Perfect Recall (Page 1764)


Maly, Jan (TU Wien)

Main Track ~ Approval-Based Shortlisting (Page 737)

Extended Abstract ~ Fairness in Long-Term Participatory Budgeting (Page 1566)


Mamun-Or-Rashid, Md. (University of Dhaka)

Main Track ~ Accelerating Recursive Partition-Based Causal Structure Learning (Page 1028)


Mancheva, Lyuba (University Grenoble Alpes, LIG)

Main Track ~ An Agent-Based Model to Predict Pedestrians Trajectories with an Autonomous Vehicle in Shared Spaces (Page 1010)


Mangal, Pranav (Imperial College London)

Demonstration Track ~ Argflow: A Toolkit for Deep Argumentative Explanations for Neural Networks (Page 1761)


Manlove, David (University of Glasgow)

Extended Abstract ~ On Weakly and Strongly Popular Rankings (Page 1563)


Mannion, Patrick (National University of Ireland Galway)

Extended Abstract ~ Distributional Monte Carlo Tree Search for Risk-Aware and Multi-Objective Reinforcement Learning (Page 1530)

Extended Abstract ~ Tunable Behaviours in Sequential Social Dilemmas using Multi-Objective Reinforcement Learning (Page 1610)


Manoonpong, Poramate (Vidyasirimedhi Institute of Science and Technology (VISTEC))

Extended Abstract ~ Learning to Cooperate with Unseen Agents Through Meta-Reinforcement Learning (Page 1478)


Marathe, Madhav (University of Virginia)

Extended Abstract ~ Cohorting to Isolate Asymptomatic Spreaders: An Agent-Based Simulation Study on the Mumbai Suburban Railway (Page 1680)


Marcon dos Santos, Gilberto (Oregon State University)

Demonstration Track ~ Scalable Multiple Robot Task Planning with Plan Merging and Conflict Resolution (Page 1776)


Marden, Jason R. (University of California, Santa Barbara)

Main Track ~ Tractable Mechanisms for Computing Near-Optimal Utility Functions (Page 306)


Marmolejo Cossío, Francisco J. (University of Oxford & IOHK)

Extended Abstract ~ RPPLNS: Pay-per-last-N-shares with a Randomised Twist (Page 1569)


Marquis, Pierre (University Artois, CNRS, IUF, UMR 8188, CRIL)

Main Track ~ On a Notion of Monotonic Support for Bipolar Argumentation Frameworks (Page 546)


Marsella, Stacy (Northeastern University)

Main Track ~ A Computational Model of Coping for Simulating Human Behavior in High-Stress Situations (Page 1425)

Main Track ~ CMCF: An Architecture for Realtime Gesture Generation by Clustering Gestures by Motion and Communicative Function (Page 1136)


Martín Rico, Francisco (Rey Juan Carlos University)

Extended Abstract ~ Optimized Execution of PDDL Plans using Behavior Trees (Page 1596)


Mascardi, Viviana (Università di Genova)

JAAMAS Track ~ Logic-based Technologies for Multi-agent Systems: Summary of a Systematic Literature Review (Page 1721)

JAAMAS Track ~ Summarising a Framework for the Certification of Reliable Autonomous Systems (Page 1733)


Masters, Peta (The University of Melbourne)

Main Track ~ Deceptive Reinforcement Learning for Privacy-Preserving Planning (Page 818)

Main Track ~ Extended Goal Recognition: A Planning-Based Model for Strategic Deception (Page 871)


Mate, Aditya (Harvard University)

Main Track ~ Risk-Aware Interventions in Public Health: Planning with Restless Multi-Armed Bandits (Page 880)


Matellán Olivera, Vicente (University of León)

Extended Abstract ~ Optimized Execution of PDDL Plans using Behavior Trees (Page 1596)


Mathias, H. David (University of Wisconsin-La Crosse)

Extended Abstract ~ Comparison of Desynchronization Methods for a Decentralized Swarm on a Logistical Resupply Problem (Page 1510)


Maudet, Nicolas (LIP6, UMR 7606, Sorbonne Université)

Extended Abstract ~ Rank Aggregation by Dissatisfaction Minimisation in the Unavailable Candidate Model (Page 1518)

Extended Abstract ~ Sequential and Swap Mechanisms for Public Housing Allocation with Quotas and Neighbourhood-Based Utilities (Page 1521)


Mayer, Simon (University of St. Gallen)

Demonstration Track ~ Autonomous Agents on the Edge of Things (Page 1767)


Mayo, Katherine (University of Michigan)

Extended Abstract ~ A Strategic Analysis of Portfolio Compression (Page 1599)


Mazzi, Giulio (Università degli Studi di Verona)

Main Track ~ Identification of Unexpected Decisions in Partially Observable Monte-Carlo Planning: A Rule-Based Approach (Page 889)


McCoy, Joshua (University of California, Davis)

Extended Abstract ~ HOAD: The Hanabi Open Agent Dataset (Page 1646)


McDonnell, Rachel (Trinity College Dublin)

Extended Abstract ~ It's A Match! Gesture Generation Using Expressive Parameter Matching (Page 1495)


Medya, Sourav (Northwestern University)

Main Track ~ Network Robustness via Global k-cores (Page 438)


Mehrotra, Siddharth (Delft University of Technology)

Doctoral Consortium ~ Modelling Trust in Human-AI Interaction (Page 1826)


Meir, Reshef (Technion - Israel Institute of Technology)

Main Track ~ Strategyproof Facility Location Mechanisms on Discrete Trees (Page 510)


Melissourgos, Themistoklis (Technical University of Munich)

Main Track ~ Walrasian Equilibria in Markets with Small Demands (Page 413)


Melo, Francisco S. (INESC-ID & Universidade de Lisboa)

Main Track ~ Cooperation between Independent Reinforcement Learners under Wealth Inequality and Collective Risks (Page 898)

Extended Abstract ~ Teaching Unknown Learners to Classify via Feature Importance (Page 1524)


Meneguzzi, Felipe (Pontifical Catholic University of Rio Grande do Sul)

Extended Abstract ~ Combining LSTMs and Symbolic Approaches for Robust Plan Recognition (Page 1634)


Mengin, Jérôme (IRIT-CNRS, Université de Toulouse)

Main Track ~ A Knowledge Compilation Map for Conditional Preference Statements-based Languages (Page 492)


Merhej, Ramona (Universidade de Lisboa & Sorbonne University)

Main Track ~ Cooperation between Independent Reinforcement Learners under Wealth Inequality and Collective Risks (Page 898)


Mey, Alexander (Delft University of Technology)

Main Track ~ Loss Bounds for Approximate Influence-Based Abstraction (Page 377)

Blue Sky Ideas Track ~ Environment Shift Games: Are Multiple Agents the Solution, and not the Problem, to Non-Stationarity? (Page 23)


Micalizio, Roberto (Università di Torino)

Main Track ~ Robustness Based on Accountability in Multiagent Organizations (Page 142)

Demonstration Track ~ Distributing Responsibilities for Exception Handling in JaCaMo (Page 1752)


Michalak, Tomasz P. (University of Warsaw)

Main Track ~ Strategic Evasion of Centrality Measures (Page 1389)


Miller, Tim (The University of Melbourne)

Main Track ~ Deceptive Reinforcement Learning for Privacy-Preserving Planning (Page 818)


Minami, Kentaro (Preferred Networks, Inc.)

Main Track ~ Trader-Company Method: A Metaheuristics for Interpretable Stock Price Prediction (Page 656)


Minarsch, David (Fetch.ai)

Main Track ~ Peer-to-peer Autonomous Agent Communication Network (Page 1037)


Minervini, Pasquale (University College London)

Main Track ~ Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning (Page 674)


Ming, Zhong (Shenzhen University)

Demonstration Track ~ ATPT: Automate Typhoon Contingency Plan Generation from Text (Page 1788)


Mirsky, Reuth (The University of Texas at Austin)

Blue Sky Ideas Track ~ The Seeing-Eye Robot Grand Challenge: Rethinking Automated Care (Page 28)


Mittelmann, Munyque (Université de Toulouse - IRIT)

Extended Abstract ~ A General Framework for the Logical Representation of Combinatorial Exchange Protocols (Page 1602)


Mocanu, Decebal Constantin (University of Twente & Eindhoven University of Technology)

Blue Sky Ideas Track ~ Sparse Training Theory for Scalable and Efficient Agents (Page 34)

Extended Abstract ~ Self-Attention Meta-Learner for Continual Learning (Page 1658)


Mocanu, Elena (University of Twente)

Blue Sky Ideas Track ~ Sparse Training Theory for Scalable and Efficient Agents (Page 34)


Moell, Birger (KTH Royal Institute of Technology)

Demonstration Track ~ A Framework for Integrating Gesture Generation Models into Interactive Conversational Agents (Page 1779)


Moens, Vincent (Huawei R&D UK)

Main Track ~ Learning Correlated Communication Topology in Multi-Agent Reinforcement learning (Page 456)


Mohammad, Yasser (NEC Corporation)

Main Track ~ An Autonomous Negotiating Agent Framework with Reinforcement Learning based Strategies and Adaptive Strategy Switching Mechanism (Page 1163)


Mohsin, Hasan (Imperial College London)

Demonstration Track ~ Argflow: A Toolkit for Deep Argumentative Explanations for Neural Networks (Page 1761)


Montes, Nieves (Artificial Intelligence Research Institute (IIIA-CSIC))

Main Track ~ Value-Guided Synthesis of Parametric Normative Systems (Page 907)


Moraitis, Pavlos (University of Paris and Argument Theory)

Main Track ~ Probabilistic Control Argumentation Frameworks (Page 519)


Morales, Peter (Microsoft)

Main Track ~ Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning (Page 764)

Main Track ~ Scalable Anytime Planning for Multi-Agent MDPs (Page 341)


Moravćík, Matej (DeepMind)

Extended Abstract ~ Sound Algorithms in Imperfect Information Games (Page 1674)


Morelli, Matteo (CEA list)

Extended Abstract ~ Optimized Execution of PDDL Plans using Behavior Trees (Page 1596)


Mosca, Francesca (King's College London)

Main Track ~ ELVIRA: An Explainable Agent for Value and Utility-Driven Multiuser Privacy (Page 916)


Mouter, Niek (Delft University of Technology)

Main Track ~ Axies: Identifying and Evaluating Context-Specific Values (Page 799)


Mudigere, Dheevatsa (Facebook Inc.)

Main Track ~ SEERL: Sample Efficient Ensemble Reinforcement Learning (Page 1100)


Muhammad, Faizan (Tufts University)

Main Track ~ A Novelty-Centric Agent Architecture for Changing Worlds (Page 925)


Mujumdar, Anusha (Ericsson Research)

Demonstration Track ~ Symbolic Reinforcement Learning for Safe RAN Control (Page 1782)


Mulderij, Jesse (Delft University of Technology)

Demonstration Track ~ TORS: A Train Unit Shunting and Servicing Simulator (Page 1785)


Murugappan, Karpagam (TCS Research)

Main Track ~ To hold or not to hold? - Reducing Passenger Missed Connections in Airlines using Reinforcement Learning (Page 862)


Murukannaiah, Pradeep K. (Delft University of Technology)

Main Track ~ Axies: Identifying and Evaluating Context-Specific Values (Page 799)

Demonstration Track ~ A Collaborative Platform for Identifying Context-Specific Values (Page 1773)


Musolesi, Mirco (University College London & University of Bologna)

Main Track ~ Cooperation and Reputation Dynamics with Reinforcement Learning (Page 115)