Abstract
In this paper, we propose a novel algorithm for learning from demonstration, which can learn a policy function robustly from a large number of demonstrations with mixed qualities. While most of the existing approaches assume that demonstrations are collected from skillful experts, the proposed method alleviates such restrictions by estimating the proficiency level of each demonstration using the proposed leverage optimization. Furthermore, a novel leveraged cost function is proposed to represent a policy function using deep neural networks by reformulating the objective function of leveraged Gaussian process regression using the representer theorem. The proposed method is successfully applied to autonomous track driving tasks, where a large number of demonstrations with mixed qualities are given as training data without labels.
Original language | English |
---|---|
Title of host publication | IROS 2017 - IEEE/RSJ International Conference on Intelligent Robots and Systems |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 3926-3931 |
Number of pages | 6 |
ISBN (Electronic) | 9781538626825 |
DOIs | |
Publication status | Published - 2017 Dec 13 |
Externally published | Yes |
Event | 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2017 - Vancouver, Canada Duration: 2017 Sept 24 → 2017 Sept 28 |
Publication series
Name | IEEE International Conference on Intelligent Robots and Systems |
---|---|
Volume | 2017-September |
ISSN (Print) | 2153-0858 |
ISSN (Electronic) | 2153-0866 |
Other
Other | 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2017 |
---|---|
Country/Territory | Canada |
City | Vancouver |
Period | 17/9/24 → 17/9/28 |
Bibliographical note
Publisher Copyright:© 2017 IEEE.
ASJC Scopus subject areas
- Control and Systems Engineering
- Software
- Computer Vision and Pattern Recognition
- Computer Science Applications