Abstract
For a service robot to assist humans, it should interact with objects of varying sizes and shapes existing in an indoor environment. 3D object detection must be preceded to achieve this goal since it provides the robot with the ability to perceive visual information. Most of the existing methods are anchor-based and predict the bounding box close to the ground truth among multiple candidates. However, it is complex to compute Intersection over Union (IoU) and Non-Maximum Suppression (NMS) per each anchor box. Therefore, we propose keypoint-based monocular 3D object detection, where each object's center location is only needed for reproducing predicted 3D bounding box without extra computation of the anchor boxes. Our 3D object detection also works well even if images are rotated corresponding to the robot's head movement. To properly train our network, the object center is based on a projected 3D location instead of 2D to take advantage of the exact center position of the object. Furthermore, we apply data augmentation using a perspective transformation. The method facilitates adding a small perturbation to the camera rotation angle randomly. We use the SUN RGB-D dataset, which has images taken indoor scenes with camera rotations for training and test set. Our approach particularly shows that the errors of object center location based on a single image reduce 15.4% and 24.2%, respectively, compared to the method without data augmentation.
Original language | English |
---|---|
Title of host publication | 29th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2020 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 438-445 |
Number of pages | 8 |
ISBN (Electronic) | 9781728160757 |
DOIs | |
Publication status | Published - 2020 Aug |
Event | 29th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2020 - Virtual, Naples, Italy Duration: 2020 Aug 31 → 2020 Sept 4 |
Publication series
Name | 29th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2020 |
---|
Conference
Conference | 29th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2020 |
---|---|
Country/Territory | Italy |
City | Virtual, Naples |
Period | 20/8/31 → 20/9/4 |
Bibliographical note
Funding Information:ACKNOWLEDGMENT This work was supported by KIST flagship program under Project 2E30280.
Publisher Copyright:
© 2020 IEEE.
ASJC Scopus subject areas
- Artificial Intelligence
- Human-Computer Interaction
- Social Psychology
- Communication