دورية أكاديمية

A Unifying Framework for Reinforcement Learning and Planning

التفاصيل البيبلوغرافية
العنوان: A Unifying Framework for Reinforcement Learning and Planning
المؤلفون: Moerland, Thomas M., Broekens, Joost, Plaat, Aske, Jonker, Catholijn M.
المساهمون: Universiteit Leiden
المصدر: Frontiers in Artificial Intelligence ; volume 5 ; ISSN 2624-8212
بيانات النشر: Frontiers Media SA
سنة النشر: 2022
المجموعة: Frontiers (Publisher - via CrossRef)
مصطلحات موضوعية: Artificial Intelligence
الوصف: Sequential decision making, commonly formalized as optimization of a Markov Decision Process, is a key challenge in artificial intelligence. Two successful approaches to MDP optimization are reinforcement learning and planning , which both largely have their own research communities. However, if both research fields solve the same problem, then we might be able to disentangle the common factors in their solution approaches. Therefore, this paper presents a unifying algorithmic framework for reinforcement learning and planning (FRAP), which identifies underlying dimensions on which MDP planning and learning algorithms have to decide. At the end of the paper, we compare a variety of well-known planning, model-free and model-based RL algorithms along these dimensions. Altogether, the framework may help provide deeper insight in the algorithmic design space of planning and reinforcement learning.
نوع الوثيقة: article in journal/newspaper
اللغة: unknown
DOI: 10.3389/frai.2022.908353
DOI: 10.3389/frai.2022.908353/full
الإتاحة: https://doi.org/10.3389/frai.2022.908353Test
حقوق: https://creativecommons.org/licenses/by/4.0Test/
رقم الانضمام: edsbas.D6ED1F0
قاعدة البيانات: BASE