All Publications

Conferences

REG: In-Sample RL via Regularizing the Evaluation Gap

Hanpu Shen, Weining Shen, and Roy Fox

43rd International Conference on Machine Learning (ICML), 2026

Adapting World Models with Latent-State Dynamics Residuals

JB Lanier, Kyungmin Kim, Armin Karamzade, Yifei Liu, Ankita Sinha, Kathleen He, Davide Corsi, and Roy Fox

8th Annual Learning for Dynamics and Control Conference (L4DC), 2026

Model-Based Reinforcement Learning under Random Observation Delays

Armin Karamzade, Kyungmin Kim, JB Lanier, Davide Corsi, and Roy Fox

8th Annual Learning for Dynamics and Control Conference (L4DC), 2026

Moonwalk: Inverse-Forward Differentiation

Dmitrii Krylov, Armin Karamzade, and Roy Fox

29th Annual Conference on Artificial Intelligence and Statistics (AISTATS), 2026

Explanations for Unrealizability of Infinite-State Safety Shields

Andoni Rodríguez, Irfansha Shaik, Davide Corsi, Roy Fox, and César Sánchez

22nd International Conference on Principles of Knowledge Representation and Reasoning (KR), 2025

Make the Pertinent Salient: Task-Relevant Reconstruction for Visual Control with Distractions

Kyungmin Kim, JB Lanier, and Roy Fox

2nd Reinforcement Learning Conference (RLC), 2025

A Variational Neural Network Model of Resource-Rational Reward Encoding in Human Planning

Zhuojun Ying, Frederick Callaway, Roy Fox, Anastasia Kiyonaga, and Marcelo Mattar

47th Annual Meeting of the Cognitive Science Society (CogSci), 2025

Realizable Continuous-Space Shields for Safe Reinforcement Learning

Kyungmin Kim*, Davide Corsi*, Andoni Rodríguez*, JB Lanier, Benjami Parellada, Pierre Baldi, César Sánchez, and Roy Fox

7th Annual Learning for Dynamics & Control Conference (L4DC), 2025

Verification-Guided Shielding for Deep Reinforcement Learning

Davide Corsi, Guy Amir, Andoni Rodríguez, César Sánchez, Guy Katz, and Roy Fox

1st Reinforcement Learning Conference (RLC), 2024

Reinforcement Learning from Delayed Observations via World Models

Armin Karamzade, Kyungmin Kim, Montek Kalsi, and Roy Fox

1st Reinforcement Learning Conference (RLC), 2024

Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills

Kolby Nottingham, Bodhisattwa Prasad Majumder, Bhavana Dalvi Mishra, Sameer Singh, Peter Clark, and Roy Fox

41st International Conference on Machine Learning (ICML), 2024

Selective Perception: Learning Concise State Descriptions for Language Model Actors

Kolby Nottingham, Yasaman Razeghi, Kyungmin Kim, JB Lanier, Pierre Baldi, Roy Fox, and Sameer Singh

2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024

Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games

Stephen McAleer, JB Lanier, Kevin Wang, Pierre Baldi, Tuomas Sandholm, and Roy Fox

12th International Conference on Learning Representations (ICLR), 2024

Do Embodied Agents Dream of Pixelated Sheep?: Embodied Decision Making using Language Guided World Modelling

Kolby Nottingham, Prithviraj Ammanabrolu, Alane Suhr, Yejin Choi, Hannaneh Hajishirzi, Sameer Singh, and Roy Fox

40th International Conference on Machine Learning (ICML), 2023

Learning to Design Analog Circuits to Meet Threshold Specifications

Dmitrii Krylov, Pooya Khajeh, Junhan Ouyang, Thomas Reeves, Tongkai Liu, Hiba Ajmal, Hamidreza Aghasi, and Roy Fox

40th International Conference on Machine Learning (ICML), 2023

Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks

Litian Liang, Yaosheng Xu, Stephen McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, and Roy Fox

39th International Conference on Machine Learning (ICML), 2022

Independent Natural Policy Gradient Always Converges in Markov Potential Games

Roy Fox, Stephen McAleer, William Overman, and Ioannis Panageas

25th International Conference on Artificial Intelligence and Statistics (AISTATS), 2022

XDO: A Double Oracle Algorithm for Extensive-Form Games

Stephen McAleer, JB Lanier, Kevin Wang, Pierre Baldi, and Roy Fox

35th Conference on Neural Information Processing Systems (NeurIPS), 2021

Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games

Stephen McAleer*, JB Lanier*, Roy Fox, and Pierre Baldi

34th Conference on Neural Information Processing Systems (NeurIPS), 2020

AutoPandas: Neural-Backed Generators for Program Synthesis

Rohan Bavishi, Caroline Lemieux, Roy Fox, Koushik Sen, and Ion Stoica

10th ACM SIGPLAN Conference on Systems, Programming, Languages, and Applications: Software for Humanity (SPLASH OOPSLA), 2019

Multi-Task Hierarchical Imitation Learning for Home Automation

Roy Fox*, Ron Berenstein*, Ion Stoica, and Ken Goldberg

15th IEEE International Conference on Automation Science and Engineering (CASE), 2019

Generalizing Robot Imitation Learning with Invariant Hidden Semi-Markov Models

Ajay Kumar Tanwani, Jonathan Lee, Brijen Thananjeyan, Michael Laskey, Sanjay Krishnan, Roy Fox, Ken Goldberg, and Sylvain Calinon

13th International Workshop on the Algorithmic Foundations of Robotics (WAFR), 2018

Constraint Estimation and Derivative-Free Recovery for Robot Learning from Demonstrations

Jonathan Lee, Michael Laskey, Roy Fox, and Ken Goldberg

14th IEEE International Conference on Automation Science and Engineering (CASE), 2018

RLlib: Abstractions for Distributed Reinforcement Learning

Eric Liang*, Richard Liaw*, Robert Nishihara, Philipp Moritz, Roy Fox, Ken Goldberg, Joseph Gonzalez, Michael Jordan, and Ion Stoica

35th International Conference on Machine Learning (ICML), 2018

Fast and Reliable Autonomous Surgical Debridement with Cable-Driven Robots Using a Two-Phase Calibration Procedure

Daniel Seita, Sanjay Krishnan, Roy Fox, Stephen McKinley, John Canny, and Ken Goldberg

35th IEEE International Conference on Robotics and Automation (ICRA), 2018

Robustly Adjusting Indoor Drip Irrigation Emitters with the Toyota HSR Robot

Ron Berenstein, Roy Fox, Stephen McKinley, Stefano Carpin, and Ken Goldberg

35th IEEE International Conference on Robotics and Automation (ICRA), 2018

Parametrized Hierarchical Procedures for Neural Programming

Roy Fox, Richard Shin, Sanjay Krishnan, Ken Goldberg, Dawn Song, and Ion Stoica

6th International Conference on Learning Representations (ICLR), 2018

DDCO: Discovery of Deep Continuous Options for Robot Learning from Demonstrations

Sanjay Krishnan*, Roy Fox*, Ion Stoica, and Ken Goldberg

1st Conference on Robot Learning (CoRL), 2017

DART: Noise Injection for Robust Imitation Learning

Michael Laskey, Jonathan Lee, Roy Fox, Anca Dragan, and Ken Goldberg

1st Conference on Robot Learning (CoRL), 2017

An Algorithm and User Study for Teaching Bilateral Manipulation via Iterated Best Response Demonstrations

Carolyn Chen, Sanjay Krishnan, Michael Laskey, Roy Fox, and Ken Goldberg

13th IEEE International Conference on Automation Science and Engineering (CASE), 2017

Statistical Data Cleaning for Deep Learning of Automation Tasks from Demonstrations

Caleb Chuck, Michael Laskey, Sanjay Krishnan, Ruta Joshi, Roy Fox, and Ken Goldberg

13th IEEE International Conference on Automation Science and Engineering (CASE), 2017

Minimum-Information LQG Control — Part I: Memoryless Controllers

Roy Fox and Naftali Tishby

55th IEEE Conference on Decision and Control (CDC), 2016

Minimum-Information LQG Control — Part II: Retentive Controllers

Roy Fox and Naftali Tishby

55th IEEE Conference on Decision and Control (CDC), 2016

Principled Option Learning in Markov Decision Processes

Roy Fox*, Michal Moshkovitz*, and Naftali Tishby

13th European Workshop on Reinforcement Learning (EWRL), 2016

Taming the Noise in Reinforcement Learning via Soft Updates

Roy Fox*, Ari Pakman*, and Naftali Tishby

32nd Conference on Uncertainty in Artificial Intelligence (UAI), 2016

A Multi-Agent Control Framework for Co-Adaptation in Brain-Computer Interfaces

Josh Merel*, Roy Fox*, Tony Jebara, and Liam Paninski

27th Conference on Neural Information Processing Systems (NeurIPS), 2013

Bounded Planning in Passive POMDPs

Roy Fox and Naftali Tishby

29th International Conference on Machine Learning (ICML), 2012

A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs

Roy Fox and Moshe Tennenholtz

22nd Conference on Artificial Intelligence (AAAI), 2007

Theses

Information-Theoretic Methods for Planning and Learning in Partially Observable Markov Decision Processes

Roy Fox

PhD Thesis, 2016

Reinforcement Learning in Partially Observable Decision Processes

Roy Fox

MSc Thesis, 2008