Abstract
In autonomous driving applications, there is a strong preference for modeling the world in Bird's-Eye View (BEV), as it leads to improved accuracy and performance. BEV features are widely used in perception tasks since they allow fusing information from multiple views in an efficient manner. However, BEV features generated from camera images are prone to be imprecise due to the difficulty of estimating depth in the perspective view. Improper placement of BEV features limits the accuracy of downstream tasks. We introduce a method for incorporating map information to improve perspective depth estimation from 2D camera images and thereby producing geometrically- and semantically-robust BEV features. We show that augmenting the camera images with the BEV map and map-to-camera projections can compensate for the depth uncertainty and enrich camera-only BEV features with road contexts. Experiments on the nuScenes dataset demonstrate that our method outperforms previous approaches using only camera images in segmentation and detection tasks.
Original language | English |
---|---|
Title of host publication | Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 7404-7413 |
Number of pages | 10 |
ISBN (Electronic) | 9798350318920 |
DOIs | |
Publication status | Published - 2024 Jan 3 |
Event | 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024 - Waikoloa, United States Duration: 2024 Jan 4 → 2024 Jan 8 |
Publication series
Name | Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024 |
---|
Conference
Conference | 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024 |
---|---|
Country/Territory | United States |
City | Waikoloa |
Period | 24/1/4 → 24/1/8 |
Bibliographical note
Publisher Copyright:© 2024 IEEE.
Keywords
- Algorithms
- Applications
- Autonomous Driving
- Image recognition and understanding
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Science Applications
- Computer Vision and Pattern Recognition