Abstract
In this paper, we consider the problem of autonomous driving using imitation learning in a semi-supervised manner. In particular, both labeled and unlabeled demonstrations are leveraged during training by estimating the quality of each unlabeled demonstration. If the provided demonstrations are corrupted and have a low signal-to-noise ratio, the performance of the imitation learning agent can be degraded significantly. To mitigate this problem, we propose a method called semi-supervised imitation learning (SSIL). SSIL first learns how to discriminate and evaluate each state-action pair's reliability in unlabeled demonstrations by assigning higher reliability values to demonstrations similar to labeled expert demonstrations. This reliability value is called leverage. After this discrimination process, labeled and unlabeled demonstrations with estimated leverage values are utilized while training the policy in a semi-supervised manner. The experimental results demonstrate the validity of the proposed algorithm using unlabeled trajectories with mixed qualities. Moreover, the hardware experiments using an RC car are conducted to show that the proposed method can be applied to real-world applications.
Original language | English |
---|---|
Title of host publication | 2022 22nd International Conference on Control, Automation and Systems, ICCAS 2022 |
Publisher | IEEE Computer Society |
Pages | 20-25 |
Number of pages | 6 |
ISBN (Electronic) | 9788993215243 |
DOIs | |
Publication status | Published - 2022 |
Event | 22nd International Conference on Control, Automation and Systems, ICCAS 2022 - Busan, Korea, Republic of Duration: 2022 Nov 27 → 2022 Dec 1 |
Publication series
Name | International Conference on Control, Automation and Systems |
---|---|
Volume | 2022-November |
ISSN (Print) | 1598-7833 |
Conference
Conference | 22nd International Conference on Control, Automation and Systems, ICCAS 2022 |
---|---|
Country/Territory | Korea, Republic of |
City | Busan |
Period | 22/11/27 → 22/12/1 |
Bibliographical note
Funding Information:This work was supported by the Institute of Information and communications Technology Planning and Evaluation(IITP) grant funded by the Korea government(MSIT) (No. 2019-0-01309, Development of AI Technology for Guidance of a Mobile Robot to its Goal with Uncertain Maps in Indoor/Outdoor Environments).
Funding Information:
This work was supported by the Institute of Information & communications Technology Planning & Evaluation(IITP) grant funded by the Korea government(MSIT) (No. 2019-0-01309, Development of AI Technology for Guidance of a Mobile Robot to its Goal with Uncertain Maps in Indoor/Outdoor Environments).
Publisher Copyright:
© 2022 ICROS.
Keywords
- Autonomous driving
- Deep learning
- Imitation learning
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Science Applications
- Control and Systems Engineering
- Electrical and Electronic Engineering