Session | B3 |
Title | Learning I |
Chair | Sandip Sen |
734 | Adaptive Budgeted Bandit Algorithms for Trust Development in a Supply-Chain |
Sandip Sen, Anton Ridgway, Michael Ripley | |
126 | Improving the Performance of Mobile Phone Crowdsourcing Applications |
Erfan Davami, Gita Sukthankar | |
761 | Selecting Robust Strategies in RTS Games via Concurrent Plan Augmentation |
Abdelrahman Elogeel, Andrey Kolobov, Matthew Alden, Ankur Teredesai | |
643 | CFQI: Fitted Q-Iteration with Complex Returns |
Robert W Wright, Xingye Qiao, Steven Loscalzo, Lei Yu | |
323 | Counterfactual Exploration for Improving Multiagent Learning |
Mitchell K. Colby, Sepideh Kharaghani, Chris HolmesParker, Kagan Tumer | |
38 | Policy Transfer using Reward Shaping |
Tim Brys, Anna Harutyunyan, Matthew E. Taylor, Ann Nowé |