Publications tagged "Partial observability"


Conferences

Reinforcement Learning from Delayed Observations via World Models

Armin Karamzade, Kyungmin Kim, Montek Kalsi, and Roy Fox

1st Reinforcement Learning Conference (RLC), 2024


Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games

Stephen McAleer, JB Lanier, Kevin Wang, Pierre Baldi, Tuomas Sandholm, and Roy Fox

12th International Conference on Learning Representations (ICLR), 2024


Learning to Query Internet Text for Informing Reinforcement Learning Agents

Kolby Nottingham, Alekhya Pyla, Sameer Singh, and Roy Fox

Reinforcement Learning and Decision Making (RLDM), 2022


XDO: A Double Oracle Algorithm for Extensive-Form Games

Stephen McAleer, JB Lanier, Kevin Wang, Pierre Baldi, and Roy Fox

35th Conference on Neural Information Processing Systems (NeurIPS), 2021


Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games

Stephen McAleer*, JB Lanier*, Roy Fox, and Pierre Baldi

34th Conference on Neural Information Processing Systems (NeurIPS), 2020


A Multi-Agent Control Framework for Co-Adaptation in Brain-Computer Interfaces

Josh Merel*, Roy Fox*, Tony Jebara, and Liam Paninski

27th Conference on Neural Information Processing Systems (NeurIPS), 2013


A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs

Roy Fox and Moshe Tennenholtz

22nd Conference on Artificial Intelligence (AAAI), 2007


Workshops

Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments

JB Lanier, Stephen McAleer, Pierre Baldi, and Roy Fox

Deep Reinforcement Learning workshop (DRL @ NeurIPS), 2022


Anytime PSRO for Two-Player Zero-Sum Games

Stephen McAleer, Kevin Wang, JB Lanier, Marc Lanctot, Pierre Baldi, Tuomas Sandholm, and Roy Fox

Reinforcement Learning in Games workshop (RLG @ AAAI), 2022


CFR-DO: A Double Oracle Algorithm for Extensive-Form Games

Stephen McAleer, JB Lanier, Pierre Baldi, and Roy Fox

Reinforcement Learning in Games workshop (RLG @ AAAI), 2021


Task-Relevant Embeddings for Robust Perception in Reinforcement Learning

Eric Liang, Roy Fox, Joseph Gonzalez, and Ion Stoica

Prediction and Generative Modeling in Reinforcement Learning workshop (PGMRL @ ICML), 2018


Theses


Preprints