Structure and Randomness in Planning and Reinforcement Learning

Konrad Czechowski; Piotr Januszewski; Piotr Kozakowski; Łukasz Kuciński; Piotr Miłoś

doi:10.1109/ijcnn52387.2021.9533317

Planning in large state spaces inevitably needs to balance the depth and breadth of the search. It has a crucial impact on the performance of a planner and most manage this interplay implicitly. We present a novel method \textit{Shoot Tree Search (STS)}, which makes it possible to control this trade-off more explicitly. Our algorithm can be understood as an interpolation between two celebrated search mechanisms: MCTS and random shooting. It also lets the user control the bias-variance trade-off, akin to TD(n), but in the tree search context. In experiments on challenging domains, we show that STS can get the best of both worlds consistently achieving higher scores.

Autorzy

magister Konrad Czechowski,
Piotr Januszewski link otwiera się w nowej karcie ,
magister Piotr Kozakowski,
doktor Łukasz Kuciński,
doktor habilitowany Piotr Miłoś

Informacje dodatkowe

DOI: Cyfrowy identyfikator dokumentu elektronicznego link otwiera się w nowej karcie 10.1109/ijcnn52387.2021.9533317
Kategoria: Aktywność konferencyjna
Typ: materiały konferencyjne indeksowane w Web of Science
Język: angielski
Rok wydania: 2021

Źródło danych: MOSTWiedzy.pl - publikacja "Structure and Randomness in Planning and Reinforcement Learning" link otwiera się w nowej karcie

link otwiera się w nowej karcie

Repozytorium publikacji - Politechnika Gdańska

Treść strony