Journal article

Improving cognitive agent decision making: Experience trajectories as plans

J Pfau, S Karim, M Kirley, E Sonenberg

Web Intelligence and Agent Systems: An International Journal | IOS Press | Published : 2014


In task environments with large state and action spaces, the use of temporal and state abstraction can potentially improve the decision making performance of agents. However, existing approaches within a reinforcement learning framework typically identify possible subgoal states and instantly learn stochastic subpolicies to reach them from other states. In these circumstances, exploration of the reinforcement learner is unfavorably biased towards local behavior around these subgoals; temporal abstractions are not exploited to reduce required deliberation; and the benefit of employing temporal abstractions is conflated with the benefit of additional learning done to define subpolicies. In thi..

View full abstract