Zhang-Wei Hong on Explore and Exploit Data in Reinforcement Learning | Toronto AIR Seminar

Показать описание

Abstract:
Reinforcement learning (RL) is a data-driven method for solving sequential decision-making problems from interaction experience with the environment. RL has shown to be able to learn non-trivial controllers in robot locomotion and manipulation that are challenging for model-based planning. However, the intensive data requirement prevents RL from being widely applied in robotics. Even training policies in simulators take several weeks to obtain a satisfactory policy. Prior works circumvent this data requirements using a curiosity-driven (a.k.a. exploration bonuses or intrinsic rewards) strategy to improve exploration (data collection) or learning from dataset (offline RL) curated by humans or pre-programmed controller. In this talk, I will illustrate the fallacy of curiosity-driven exploration strategy and sensitivity to data distribution of offline RL algorithms.

Paper:

Bio:
Zhang-Wei Hong is a Ph.D. candidate in the Department of Electrical Engineering and Computer Science at Massachusetts Institute of Technology (MIT). He received his B.S. and M.S. degrees from National Tsing Hua University in Taiwan and has conducted research internships at TU Darmstadt in Germany and Preferred Networks (PFN) in Japan. Zhang-Wei's research interests lie at the intersection of reinforcement learning and optimization, with a focus on developing principled algorithms to improve the usability of RL in real-world scenarios. His work has been published in top-tier conferences such as NeurIPS, ICLR, ICRA, and CoRL.

Toronto AIR Seminar:
The Toronto AI Robotics Seminar Series is a set of events featuring young robotics and AI experts. The talks are given by local as well as global speakers and organized by the Faculty and Students at University of Toronto’s Department of Computer Science. We welcome students, researchers and robotics enthusiasts from around the world to join us and interact with the Toronto Robotics Community.

AI Robotics Seminar - University of Toronto

Рекомендации по теме

Zhang-Wei Hong on Explore and Exploit Data in Reinforcement Learning | Toronto AIR Seminar

Zhang-Wei Hong on Explore and Exploit Data in Reinforcement Learning | Toronto AIR Seminar

The Most Dangerous Cliff Road to School | Rural Life on the Cliff | Amazing lifestyle

The Most Beautiful Places in China 🇨🇳✨🤩 #china #travel #explore #nature #adventure

How One Man Rules in Asia’s Golden Triangle

Mountain Climbing Gone Wrong #shorts Mount Huangshan #ytshorts

Ignore Your Life's Problem Like Wang Yibo And Xiao Zhan Ignoring This Boy🙃😆No Comments & No...

Father reunited with son snatched as baby 24 years ago in China - BBC News

I Awakened the Ability to Predict the Next 24 Hours, Using This Power, I Won 100M in the Lottery

I Awakened an SSS Talent,The Holy Body of Immortal Cultivation,But I Was Mocked as a Useless Person！...

Weihong Tan on Biomedical Engineering and Aptamers

Only In Mandarin - Fresh Off The Boat

The way she tries to say sorry🤣🤣 #zhaolusi #datinginthekitchen #shorts

Apocalypse:Everyone Else Is Dizzy from Hunger,But I Practice Yoga Every Day with My Neighbor’s Wife!...

You Spent $1 and Got a Ferrari and a Luxury Villa, All Thanks to Your Tycoon System

Michelle Yeoh can kick Jackie Chan's butt | Letterman #shorts

10. The Han Dynasty - The First Empire in Flames

What The Most Carefree Philosopher Can Teach Us | ZHUANGZI

Michelle Yeoh WON Miss Malaysia | The Graham Norton Show

Chinese Girlfriend - Fresh Off the Boat

So protective 😊 #beautyofresilience

10 Chinese Actors Whose English Will BLOW Your Mind! [Ft HappySqueak]

How a Nun Became China's Only Female Emperor - Wu Zetian (Part 1)

Top 11 Best Bai Lu Chinese Dramas #dramalist #odyssey #cdrama #chinesedrama #cdrama #bailu

The Entire History of Ancient Japan