Monocular Depth Estimation for Autonomous Driving Based on Instance Clustering Guidance

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

A novel monocular depth estimator for autonomous driving, which produces reliable instance depths via instance clustering guidance, is proposed in this work. First, we extract multi-scale feature maps from a road scene and initialize depth clusters. Second, we update the depth clusters using the feature maps through transposed cross-attention. To guide the update process, we develop the instance clustering membership (ICM) loss, which employs an instance segmentation map. Third, we transfer the updated depth clusters to the feature map at the finest resolution, from which we produce the final depth map. Extensive experimental results show that the proposed algorithm yields competitive results to state-of-the-art techniques on the KITTI, Cityscapes-DVPS, and SemKITTI-DVPS datasets.

Original languageEnglish
Title of host publicationAPSIPA ASC 2024 - Asia Pacific Signal and Information Processing Association Annual Summit and Conference 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350367331
DOIs
Publication statusPublished - 2024
Event2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2024 - Macau, China
Duration: 2024 Dec 32024 Dec 6

Publication series

NameAPSIPA ASC 2024 - Asia Pacific Signal and Information Processing Association Annual Summit and Conference 2024

Conference

Conference2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2024
Country/TerritoryChina
CityMacau
Period24/12/324/12/6

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Hardware and Architecture
  • Signal Processing

Fingerprint

Dive into the research topics of 'Monocular Depth Estimation for Autonomous Driving Based on Instance Clustering Guidance'. Together they form a unique fingerprint.

Cite this