Monocular 3D object detection for an indoor robot environment

Jiwon Kim, Gi Jae Lee, Jun Sik Kim, Hyunwoo J. Kim, Kang Geon Kim

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

For a service robot to assist humans, it should interact with objects of varying sizes and shapes existing in an indoor environment. 3D object detection must be preceded to achieve this goal since it provides the robot with the ability to perceive visual information. Most of the existing methods are anchor-based and predict the bounding box close to the ground truth among multiple candidates. However, it is complex to compute Intersection over Union (IoU) and Non-Maximum Suppression (NMS) per each anchor box. Therefore, we propose keypoint-based monocular 3D object detection, where each object's center location is only needed for reproducing predicted 3D bounding box without extra computation of the anchor boxes. Our 3D object detection also works well even if images are rotated corresponding to the robot's head movement. To properly train our network, the object center is based on a projected 3D location instead of 2D to take advantage of the exact center position of the object. Furthermore, we apply data augmentation using a perspective transformation. The method facilitates adding a small perturbation to the camera rotation angle randomly. We use the SUN RGB-D dataset, which has images taken indoor scenes with camera rotations for training and test set. Our approach particularly shows that the errors of object center location based on a single image reduce 15.4% and 24.2%, respectively, compared to the method without data augmentation.

Original languageEnglish
Title of host publication29th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages438-445
Number of pages8
ISBN (Electronic)9781728160757
DOIs
Publication statusPublished - 2020 Aug
Event29th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2020 - Virtual, Naples, Italy
Duration: 2020 Aug 312020 Sept 4

Publication series

Name29th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2020

Conference

Conference29th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2020
Country/TerritoryItaly
CityVirtual, Naples
Period20/8/3120/9/4

Bibliographical note

Funding Information:
ACKNOWLEDGMENT This work was supported by KIST flagship program under Project 2E30280.

Publisher Copyright:
© 2020 IEEE.

ASJC Scopus subject areas

  • Artificial Intelligence
  • Human-Computer Interaction
  • Social Psychology
  • Communication

Fingerprint

Dive into the research topics of 'Monocular 3D object detection for an indoor robot environment'. Together they form a unique fingerprint.

Cite this