All Publications
Conferences
Verification-Guided Shielding for Deep Reinforcement Learning
Davide Corsi, Guy Amir, Andoni Rodríguez, César Sánchez, Guy Katz, and Roy Fox
1st Reinforcement Learning Conference (RLC), 2024
Reinforcement Learning from Delayed Observations via World Models
Armin Karamzade, Kyungmin Kim, Montek Kalsi, and Roy Fox
1st Reinforcement Learning Conference (RLC), 2024
Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills
Kolby Nottingham, Bodhisattwa Prasad Majumder, Bhavana Dalvi Mishra, Sameer Singh, Peter Clark, and Roy Fox
41st International Conference on Machine Learning (ICML), 2024
Selective Perception: Learning Concise State Descriptions for Language Model Actors
Kolby Nottingham, Yasaman Razeghi, Kyungmin Kim, JB Lanier, Pierre Baldi, Roy Fox, and Sameer Singh
2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games
Stephen McAleer, JB Lanier, Kevin Wang, Pierre Baldi, Tuomas Sandholm, and Roy Fox
12th International Conference on Learning Representations (ICLR), 2024
Do Embodied Agents Dream of Pixelated Sheep?: Embodied Decision Making using Language Guided World Modelling
Kolby Nottingham, Prithviraj Ammanabrolu, Alane Suhr, Yejin Choi, Hannaneh Hajishirzi, Sameer Singh, and Roy Fox
40th International Conference on Machine Learning (ICML), 2023
Learning to Design Analog Circuits to Meet Threshold Specifications
Dmitrii Krylov, Pooya Khajeh, Junhan Ouyang, Thomas Reeves, Tongkai Liu, Hiba Ajmal, Hamidreza Aghasi, and Roy Fox
40th International Conference on Machine Learning (ICML), 2023
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Litian Liang, Yaosheng Xu, Stephen McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, and Roy Fox
39th International Conference on Machine Learning (ICML), 2022
Learning to Query Internet Text for Informing Reinforcement Learning Agents
Kolby Nottingham, Alekhya Pyla, Sameer Singh, and Roy Fox
Reinforcement Learning and Decision Making (RLDM), 2022
Independent Natural Policy Gradient Always Converges in Markov Potential Games
Roy Fox, Stephen McAleer, William Overman, and Ioannis Panageas
25th International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
XDO: A Double Oracle Algorithm for Extensive-Form Games
Stephen McAleer, JB Lanier, Kevin Wang, Pierre Baldi, and Roy Fox
35th Conference on Neural Information Processing Systems (NeurIPS), 2021
Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
Stephen McAleer*, JB Lanier*, Roy Fox, and Pierre Baldi
34th Conference on Neural Information Processing Systems (NeurIPS), 2020
AutoPandas: Neural-Backed Generators for Program Synthesis
Rohan Bavishi, Caroline Lemieux, Roy Fox, Koushik Sen, and Ion Stoica
10th ACM SIGPLAN Conference on Systems, Programming, Languages, and Applications: Software for Humanity (SPLASH OOPSLA), 2019
Multi-Task Hierarchical Imitation Learning for Home Automation
Roy Fox*, Ron Berenstein*, Ion Stoica, and Ken Goldberg
15th IEEE Conference on Automation Science and Engineering (CASE), 2019
Generalizing Robot Imitation Learning with Invariant Hidden Semi-Markov Models
Ajay Kumar Tanwani, Jonathan Lee, Brijen Thananjeyan, Michael Laskey, Sanjay Krishnan, Roy Fox, Ken Goldberg, and Sylvain Calinon
13th International Workshop on the Algorithmic Foundations of Robotics (WAFR), 2018
Constraint Estimation and Derivative-Free Recovery for Robot Learning from Demonstrations
Jonathan Lee, Michael Laskey, Roy Fox, and Ken Goldberg
14th IEEE Conference on Automation Science and Engineering (CASE), 2018
RLlib: Abstractions for Distributed Reinforcement Learning
Eric Liang*, Richard Liaw*, Robert Nishihara, Philipp Moritz, Roy Fox, Ken Goldberg, Joseph Gonzalez, Michael Jordan, and Ion Stoica
35th International Conference on Machine Learning (ICML), 2018
Fast and Reliable Autonomous Surgical Debridement with Cable-Driven Robots Using a Two-Phase Calibration Procedure
Daniel Seita, Sanjay Krishnan, Roy Fox, Stephen McKinley, John Canny, and Ken Goldberg
35th IEEE International Conference on Robotics and Automation (ICRA), 2018
Robustly Adjusting Indoor Drip Irrigation Emitters with the Toyota HSR Robot
Ron Berenstein, Roy Fox, Stephen McKinley, Stefano Carpin, and Ken Goldberg
35th IEEE International Conference on Robotics and Automation (ICRA), 2018
Parametrized Hierarchical Procedures for Neural Programming
Roy Fox, Richard Shin, Sanjay Krishnan, Ken Goldberg, Dawn Song, and Ion Stoica
6th International Conference on Learning Representations (ICLR), 2018
DDCO: Discovery of Deep Continuous Options for Robot Learning from Demonstrations
Sanjay Krishnan*, Roy Fox*, Ion Stoica, and Ken Goldberg
1st Conference on Robot Learning (CoRL), 2017
DART: Noise Injection for Robust Imitation Learning
Michael Laskey, Jonathan Lee, Roy Fox, Anca Dragan, and Ken Goldberg
1st Conference on Robot Learning (CoRL), 2017
An Algorithm and User Study for Teaching Bilateral Manipulation via Iterated Best Response Demonstrations
Carolyn Chen, Sanjay Krishnan, Michael Laskey, Roy Fox, and Ken Goldberg
13th IEEE Conference on Automation Science and Engineering (CASE), 2017
Statistical Data Cleaning for Deep Learning of Automation Tasks from Demonstrations
Caleb Chuck, Michael Laskey, Sanjay Krishnan, Ruta Joshi, Roy Fox, and Ken Goldberg
13th IEEE Conference on Automation Science and Engineering (CASE), 2017
Principled Option Learning in Markov Decision Processes
Roy Fox*, Michal Moshkovitz*, and Naftali Tishby
13th European Workshop on Reinforcement Learning (EWRL), 2016
Taming the Noise in Reinforcement Learning via Soft Updates
Roy Fox*, Ari Pakman*, and Naftali Tishby
32nd Conference on Uncertainty in Artificial Intelligence (UAI), 2016
A Multi-Agent Control Framework for Co-Adaptation in Brain-Computer Interfaces
Josh Merel*, Roy Fox*, Tony Jebara, and Liam Paninski
27th Conference on Neural Information Processing Systems (NeurIPS), 2013
Workshops
Make the Pertinent Salient: Task-Relevant Reconstruction for Visual Control with Distraction
Kyungmin Kim, Charless Fowlkes, and Roy Fox
Training Agents with Foundation Models workshop (TAFM @ RLC), 2024
Q* Search: Heuristic Search with Deep Q-Networks
Forest Agostinelli, Shahaf Shperberg, Alexander Shmakov, Stephen McAleer, Roy Fox, and Pierre Baldi
Bridging the Gap Between AI Planning and Reinforcement Learning workshop (PRL @ ICAPS), 2024
Selective Perception: Learning Concise State Descriptions for Language Model Actors
Kolby Nottingham, Yasaman Razeghi, Kyungmin Kim, JB Lanier, Pierre Baldi, Roy Fox, and Sameer Singh
Foundation Models for Decision Making workshop (FMDM @ NeurIPS), 2023
Do Embodied Agents Dream of Pixelated Sheep?: Embodied Decision Making using Language Guided World Modelling
Kolby Nottingham, Prithviraj Ammanabrolu, Alane Suhr, Yejin Choi, Hannaneh Hajishirzi, Sameer Singh, and Roy Fox
Reincarnating Reinforcement Learning workshop (RRL @ ICLR), 2023
Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
JB Lanier, Stephen McAleer, Pierre Baldi, and Roy Fox
Deep Reinforcement Learning workshop (DRL @ NeurIPS), 2022
Anytime PSRO for Two-Player Zero-Sum Games
Stephen McAleer, Kevin Wang, JB Lanier, Marc Lanctot, Pierre Baldi, Tuomas Sandholm, and Roy Fox
Reinforcement Learning in Games workshop (RLG @ AAAI), 2022
Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Litian Liang, Yaosheng Xu, Stephen McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, and Roy Fox
Deep Reinforcement Learning workshop (DRL @ NeurIPS), 2021
Target Entropy Annealing for Discrete Soft Actor–Critic
Yaosheng Xu, Dailin Hu, Litian Liang, Stephen McAleer, Pieter Abbeel, and Roy Fox
Deep Reinforcement Learning workshop (DRL @ NeurIPS), 2021
Obtaining Approximately Admissible Heuristic Functions through Deep Reinforcement Learning and A* Search
Forest Agostinelli, Stephen McAleer, Alexander Shmakov, Roy Fox, Marco Valtorta, Biplav Srivastava, and Pierre Baldi
Bridging the Gap between AI Planning and Reinforcement Learning workshop (PRL @ ICAPS), 2021
Modular Framework for Visuomotor Language Grounding
Kolby Nottingham, Litian Liang, Daeyun Shin, Charless Fowlkes, Roy Fox, and Sameer Singh
Embodied AI workshop (EmbodiedAI @ CVPR), 2021
CFR-DO: A Double Oracle Algorithm for Extensive-Form Games
Stephen McAleer, JB Lanier, Pierre Baldi, and Roy Fox
Reinforcement Learning in Games workshop (RLG @ AAAI), 2021
Multi-Task Learning via Task Multi-Clustering
Andy Yan, Xin Wang, Ion Stoica, Joseph Gonzalez, and Roy Fox
Adaptive & Multitask Learning workshop (AMTL @ ICML), 2019
Hierarchical Imitation Learning via Variational Inference of Control Programs
Roy Fox, Richard Shin, William Paul, Yitian Zou, Dawn Song, Ken Goldberg, Pieter Abbeel, and Ion Stoica
Infer to Control: Probabilistic Reinforcement Learning and Structured Control workshop (Infer2Control @ NeurIPS), 2018
An Empirical Exploration of Gradient Correlations in Deep Learning
Daniel Rothchild, Roy Fox, Noah Golmant, Joseph Gonzalez, Michael Mahoney, Kai Rothauge, Ion Stoica, and Zhewei Yao
Integration of Deep Learning Theories workshop (DLT @ NeurIPS), 2018
Neural Inference of API Functions from Input–Output Examples
Rohan Bavishi, Caroline Lemieux, Neel Kant, Roy Fox, Koushik Sen, and Ion Stoica
Machine Learning for Systems workshop (ML for Sys @ NeurIPS), 2018
Imitation Learning of Hierarchical Programs via Variational Inference
Roy Fox*, Richard Shin*, Pieter Abbeel, Ken Goldberg, Dawn Song, and Ion Stoica
Neural Abstract Machines & Program Induction workshop (NAMPI @ ICML), 2018
Task-Relevant Embeddings for Robust Perception in Reinforcement Learning
Eric Liang, Roy Fox, Joseph Gonzalez, and Ion Stoica
Prediction and Generative Modeling in Reinforcement Learning workshop (PGMRL @ ICML), 2018
Robot Learning with Invariant Hidden Semi-Markov Models
Ajay Kumar Tanwani, Jonathon Lee, Michael Laskey, Sanjay Krishnan, Roy Fox, and Ken Goldberg
Perspectives on Robot Learning: Imitation and Causality workshop (Causal Imit. @ RSS), 2018
Ray RLlib: A Composable and Scalable Reinforcement Learning Library
Eric Liang*, Richard Liaw*, Robert Nishihara, Philipp Moritz, Roy Fox, Joseph Gonzalez, Ken Goldberg, and Ion Stoica
Deep Reinforcement Learning symposium (DeepRL @ NeurIPS), 2017
Theses
Preprints
Hierarchical Variational Imitation Learning of Control Programs
Roy Fox, Richard Shin, William Paul, Yitian Zou, Dawn Song, Ken Goldberg, Pieter Abbeel, and Ion Stoica
arXiv:1912.12612, 2019