RLVS 2021 - Day 3 - Regret bounds of model-based reinforcement learning

Показать описание

Speaker: Mengdi Wang
Chairman: Sébastien Gerchinovitz

Abstract. We discuss some recent results on model-based methods for online reinforcement learning (RL). The goal of online RL is to adaptively explore an unknown environment and learn to act with provable regret bounds. In particular, we focus on finite-horizon episodic RL where the unknown transition law belongs to a generic family of models. We propose a model-based "value-targeted regression" RL algorithm that is based on the optimism principle: In each episode, the set of models that are "consistent" with the data collected is constructed. The criterion of consistency is based on the total squared error that the model incurs on the task of predicting values as determined by the last value estimate along with the transitions. The next value function is then chosen by solving the optimistic planning problem with the constructed set of models. We derive a bound on the regret, for an arbitrary family of transition models, using the notion of the so-called Eluder dimension proposed by Russo & Van Roy (2014).

---------------
Reinforcement Learning Virtual School (March and April 2021).

Organized by the Toulouse AI institute ANITI, with the support of IRT Saint Exupéry, ISAE-SUPAERO, LAAS-CNRS, Université Fédérale Toulouse Midi-Pyrénées.

Officially sponsored by DeepMind.

We also thank Institut de Mathématiques de Toulouse - Université Toulouse III - Paul Sabatier, Toulouse School of Economics - Université Toulouse I - Capitole.

This virtual event took place in March and April 2021. It gathered 1500 registered participants from 43 different countries.

ANITI Toulouse

Рекомендации по теме

RLVS 2021 - Day 3 - Regret bounds of model-based reinforcement learning

India vs New Zealand, 2nd Test, Day 3 | IND vs NZ Live Match Today | Day 3-Session 1

India vs New Zealand, 2nd Test, Day 3 | IND vs NZ Live Match Today | Day 3-Session 2

Full Highlights | Pakistan vs England | 3rd Test Day 2, 2024 | PCB | M3G1K

India vs New Zealand 2nd Test 2024 Day 3 Highlights | Ind vs NZ

India Vs New Zealand 2nd Test Day 3 FULL Match Highlights • IND VS NZ 2nd Test Day 3 HIGHLIGHTS

IND vs NZ Highlights 2024,India vs New Zealand 2nd Test Day 3 Highlights 2024,Today Match Highlights

England show strong resolve in day three fightback | Men's Ashes 2021-22

India vs New Zealand, 2nd Test, Day 3 | IND vs NZ Live Match Today | Live Cricket Match Today

PRADA Cup Day 3 | Full Race Replay | Round Robins Day 3

Day 3 Full Race Replay | The 36th America’s Cup Presented by PRADA

India vs New Zealand, 2nd Test, Day 3 | IND vs NZ Live Match | Live Cricket Match Today, Session 3

Full Highlights | Pakistan vs England | 3rd Test Day 1, 2024 | PCB | M3G1K

India Vs New Zealand 2nd Test Day 3 FULL Match Highlights • IND VS NZ 2nd Test Day 3 HIGHLIGHTS

RNG vs. HLE | Worlds Group Stage Day 3 | Royal Never Give Up vs. Hanwha Life Esports (2021)

Highlights | West Indies vs South Africa | Roach and Mayers Shine! | 2nd Betway Test Day 3 2021

Day 3 Highlights | 2nd Test, Sri Lanka vs West Indies 2021

Bangladesh vs South Africa | Highlights | 1st Test | Day 3 | South Africa tour of Bangladesh 2024

Live: India vs New Zealand 2nd Test Day 3 | IND vs NZ, Live Scores & Commentary | Live match Tod...

Brave India pull off the great escape at the SCG | Vodafone Test Series 2020-21

India Vs New Zealand 2nd Test Day 3 FULL Match Highlights •IND VS NZ 2nd Test Day3 HIGHLIGHTS Jadeja...

India vs New Zealand, 2nd Test, Day 3 | IND vs NZ Live Match | Live Cricket Match Today, IND Batting

India vs New Zealand 2nd Test Day 3 | IND vs NZ, Live Scores & Commentary | Live match2nd sessio...

India Vs New Zealand 2nd Test Day 3 FULL Match Highlights • IND VS NZ 2nd Test Day 3 HIGHLIGHTS

Kyrgios, Wawrinka, Dimitrov & Auger-Aliassime All In Action | Melbourne 2 2021 Day 3 Highlights