Structure and Randomness in Planning and Reinforcement Learning

Konrad Czechowski; Piotr Januszewski; Piotr Kozakowski; Łukasz Kuciński; Piotr Miłoś

doi:10.1109/ijcnn52387.2021.9533317

Planning in large state spaces inevitably needs to balance the depth and breadth of the search. It has a crucial impact on the performance of a planner and most manage this interplay implicitly. We present a novel method \textit{Shoot Tree Search (STS)}, which makes it possible to control this trade-off more explicitly. Our algorithm can be understood as an interpolation between two celebrated search mechanisms: MCTS and random shooting. It also lets the user control the bias-variance trade-off, akin to TD(n), but in the tree search context. In experiments on challenging domains, we show that STS can get the best of both worlds consistently achieving higher scores.

Authors

magister Konrad Czechowski,
Piotr Januszewski link open in new tab ,
magister Piotr Kozakowski,
doktor Łukasz Kuciński,
doktor habilitowany Piotr Miłoś

Additional information

DOI: Digital Object Identifier link open in new tab 10.1109/ijcnn52387.2021.9533317
Category: Aktywność konferencyjna
Type: materiały konferencyjne indeksowane w Web of Science
Language: angielski
Publication year: 2021

Source: MOSTWiedzy.pl - publication "Structure and Randomness in Planning and Reinforcement Learning" link open in new tab

link open in new tab

Publications Repository - Gdańsk University of Technology

Treść strony