10-701 Student Presentation - Robust Iterative Policy Search for Aerobatic Helicopter Flight

Показать описание

Jiaji Zhou and Kumar Shaurya Shankar

In many real robotics examples where dynamics and sensor data are noisy, the robustness of control algorithm is essential. Additionally, most physical systems are high-dimensional and it is highly unlikely to have data for all parts of the state-space. Since it is unlikely for any formulated model to completely capture all of the underlying system dynamics, learning control algorithms should exhibit some degree of robustness to undermodeling.

In this research project we intend to explore a hybrid approach to solving a control problem through a reinforcement learning approach; specifically, we propose to combine Iterative Linear Quadratic Regulator methods with Policy Search by Dynamic Programming.

Alex Smola
Machine Learning
10-701
CMU

Рекомендации по теме

10-701 Student Presentation - Robust Iterative Policy Search for Aerobatic Helicopter Flight

10-701 Student Presentation - Robust Damage Detection of Civil Structures

10-701 Student Presentation - Robust Iterative Policy Search for Aerobatic Helicopter Flight

10-701 Student Presentation - Combine cRBM and CNN for Object Recognition

10-701 Student Presentation - Latent Dirichlet Allocation on Biological Data

10-701 Student Presentation - Kernelized Sparse Representation for Image Classification

10-701 Student Presentation - Clustering-based Segmentation for Histology Images

10-701 Student Presentation - The feature selection process SVD and the bridge vibration theory

10-701 Student Presentation - Yelp Me: Inferring Business Attention

10-701 Student Presentation - Predicting Financial Volatility from Earning Calls

American vs British English

Presentation in class (Group F&E )

NIPS 2011 Sparse Representation & Low-rank Approximation Workshop: For Transform Invariant..

10-701 Machine Learning Fall 2014 - Lecture 9

2015 Student Presentation 02 Phylogenics

Planning and Control for Quadrotor Flight through Cluttered Environments

IPCV'21 - Conference Presentation - Post-Disaster Damage Detection

SYIP Student Presentation by Clifford

ECCV 2020 | Presentation | Domain Adaptive Semantic Segmentation Using Weak Labels

Enabling Precision/Recall Preferences for Semi-supervised SVM Training Presentation

3D Point Cloud Based Object Recognition System

Business Administration in 2 Minutes | Start a Business with proper Business Administration Process

GPSS2017 workshop: Probabilistic Programming with GPs, Dustin Tran

How to live in the present moment? | Buddhism In English

Jim Heath, PhD, California Institute of Technology