Все публикации

RL Theory Seminar

RL Theory Seminar 2024: Audrey Huang (October 22)

RL Theory Seminar

RL Theory Seminar 2024: Zakaria Mhammedi (October 15)

Zeyu Jia -

Zeyu Jia - Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data

Quanquan Gu -

Quanquan Gu - Self-Play Preference Optimization for Language Model Alignment

Simon Du -

Simon Du - When are Offline Multi-Agent Games Solvable?

Shipra Agrawal -

Shipra Agrawal - Optimistic Q-learning for average reward and episodic RL

Sharan Vaswani -

Sharan Vaswani - Towards Principled, Practical Policy Gradient for Bandits and Tabular MDPs

Niao He -

Niao He - Reinforcement Learning in Mean Field Games: the pitfalls and promises

Philip Amortila -

Philip Amortila - Scalable Online Exploration via Coverability

Philip Amortila -

Philip Amortila - Statistical and Algorithmic Reductions for RL From Rich Observations

Gergely Neu -

Gergely Neu - Bisimulation Metrics are Optimal Transport Distances, and Can be Computed Efficiently

Ki Hong -

Ki Hong - Computationally Efficient Alg for Infinite-Horizon Average Reward RL with Linear MDPs

Ishani Aniruddha Karmarkar

Ishani Aniruddha Karmarkar - Truncated Variance Reduced Value Iteration

Kevin Jamieson -

Kevin Jamieson - On the Instance-dependent Sample Complexity of Tabular RL

Dongruo Zhou -

Dongruo Zhou - Uncertainty-Aware Reward-Free Exploration with General Function Approximation

Brendan O’Donoghue -

Brendan O’Donoghue - Efficient exploration in deep RL via utility theory

RL theory seminar

RL theory seminar 2024: Uri Sherman (May 14)

RL theory seminar

RL theory seminar 2024: Gene Li (May 7)

RL theory seminar

RL theory seminar 2024: Sergey Samsonov (Apr 30)

RL theory seminar

RL theory seminar 2024: Hamish Flynn (Apr 23)

RL theory seminar

RL theory seminar 2024: Andrew Wagenmaker (Apr 16)

RL theory seminar

RL theory seminar 2024: Ayush Sekhari (Apr 9)

RL theory seminar

RL theory seminar 2024: Matthew Zurek (Apr 2)

RL theory seminar

RL theory seminar 2024: Roberto Cipollone (Mar 26)