Bounded Planning in Passive POMDPs
Roy Fox and Naftali Tishby
29th International Conference on Machine Learning (ICML), 2012
In Passive POMDPs actions do not affect the world state, but still incur costs. When the agent is bounded by information-processing constraints, it can only keep an approximation of the belief. We present a variational principle for the problem of maintaining the information which is most useful for minimizing the cost, and introduce an efficient and simple algorithm for finding an optimum.