RSS 2021, Spotlight Talk 11: Safe Reinforcement Learning via Statistical Model Predictive Shielding

Показать описание

**Safe Reinforcement Learning via Statistical Model Predictive Shielding**
Osbert Bastani (University of Pennsylvania); Shuo Li (University of Pennsylvania); Anton Xue (University of Pennsylvania)

**Abstract**
Reinforcement learning is a promising approach to solving hard robotics tasks. An important challenge is ensuring safety--e.g., that a walking robot does not fall over or an autonomous car does not crash into an obstacle. We build on an approach that composes the learned policy with a backup policy--it uses the learned policy on the interior of the region where the backup policy is guaranteed to be safe, and switches to the backup policy on the boundary of this region. The key challenge is checking when the backup policy is guaranteed to be safe. Our algorithm, statistical model predictive shielding (SMPS), uses sampling-based verification and linear systems analysis to perform this check. We prove that SMPS ensures safety with high probability, and empirically evaluate its performance on several benchmarks.

Robotics Science and Systems

Рекомендации по теме

RSS 2021, Spotlight Talk 11: Safe Reinforcement Learning via Statistical Model Predictive Shielding

RSS 2021, Spotlight Talk 11: Safe Reinforcement Learning via Statistical Model Predictive Shielding

RSS 2020, Spotlight Talk 11: Shared Autonomy with Learned Latent Actions

RSS 2021, Spotlight Talk 16: Co-Design of Communication and Machine Inference for Cloud Robotics

RSS 2021, Spotlight Talk 38: Learning Proofs of Motion Planning Infeasibility

RSS 2019, Spotlight Talks: Group 11

RSS 2021, Spotlight Talk 61: PROMPT: Probabilistic Motion Primitives based Trajectory Planning

RSS 2021, Spotlight Talk 47: An End-to-End Differentiable Framework for Contact-Aware Robot Design

RSS 2021, Spotlight Talk 44: Towards finding the shortest-paths for 3D rigid bodies

RSS 2021, Spotlight Talk 56: On the Importance of Environments in Human-Robot Coordination

RSS 2021, Spotlight Talk 28: Skill-based Shared Control

RSS 2021, Spotlight Talk 57: MAGIC: Learning Macro-Actions for Online POMDP Planning

RSS 2021, Spotlight Talk 66: Distributed Covariance Steering with Consensus ADMM for Stochastic...

RSS 2021, Spotlight Talk 42: Continuous Integration over SO(3) for IMU Preintegration

RSS 2020, Spotlight Talk 81: AlphaPilot: Autonomous Drone Racing

RSS 2021, Spotlight Talk 83: Lyapunov-stable neural-network control

RSS 2021, Spotlight Talk 10: Optimal Pose and Shape Estimation for Category-level 3D Object...

RSS 2020, Spotlight Talk 95: Robust Multiple-Path Orienteering Problem: Securing Against Adversar...

RSS 2020, Spotlight Talk 101: Learning Task-Driven Control Policies via Information Bottlenecks

RSS 2021, Spotlight Talk 29: Radar Odometry Combining Probabilistic Estimation and Unsupervised...

RSS 2020, Spotlight Talk 16: Grounding Language to Non-Markovian Tasks with No Supervision of Tas...

RSS 2020, Spotlight Talk 88: Reinforcement Learning for Safety-Critical Control under Model Uncer...

RSS 2020, Spotlight Talk 1: Planning and Execution using Inaccurate Models with Provable Guarantees

Research talk: Safe reinforcement learning using advantage-based intervention

RSS 2019, Spotlight Talks: Group 2