Publications tagged "Reinforcement learning"


Conferences

RLlib: Abstractions for Distributed Reinforcement Learning

Eric Liang*, Richard Liaw*, Robert Nishihara, Philipp Moritz, Roy Fox, Ken Goldberg, Joseph Gonzalez, Michael Jordan, and Ion Stoica

35th International Conference on Machine Learning (ICML), 2018


Principled Option Learning in Markov Decision Processes

Roy Fox*, Michal Moshkovitz*, and Naftali Tishby

13th European Workshop on Reinforcement Learning (EWRL), 2016


Taming the Noise in Reinforcement Learning via Soft Updates

Roy Fox*, Ari Pakman*, and Naftali Tishby

32nd Conference on Uncertainty in Artificial Intelligence (UAI), 2016


Bounded Planning in Passive POMDPs

Roy Fox, and Naftali Tishby

29th International Conference on Machine Learning (ICML), 2012


A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs

Roy Fox, and Moshe Tennenholtz

22nd Conference on Artificial Intelligence (AAAI), 2007


Workshops

Toward Provably Unbiased Temporal-Difference Value Estimation

Roy Fox

Optimization Foundations for Reinforcement Learning workshop (OPTRL @ NeurIPS), 2019


Task-Relevant Embeddings for Robust Perception in Reinforcement Learning

Eric Liang, Roy Fox, Joseph Gonzalez, and Ion Stoica

Prediction and Generative Modeling in Reinforcement Learning workshop (PGMRL @ ICML), 2018


Ray RLlib: A Composable and Scalable Reinforcement Learning Library

Eric Liang*, Richard Liaw*, Robert Nishihara, Philipp Moritz, Roy Fox, Joseph Gonzalez, Ken Goldberg, and Ion Stoica

Deep Reinforcement Learning symposium (DeepRL @ NeurIPS), 2017


Theses

Information-Theoretic Methods for Planning and Learning in Partially Observable Markov Decision Processes

Roy Fox

PhD Thesis, 2016



Preprints

Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games

Stephen McAleer*, John Lanier*, Roy Fox, and Pierre Baldi

arXiv:2006.08555, 2020


Multi-Level Discovery of Deep Options

Roy Fox*, Sanjay Krishnan*, Ion Stoica, and Ken Goldberg

arXiv:1703.08294, 2017