Robust Camera Pose Refinement for Multi-Resolution Hash Encoding

  • Hwan Heo
  • , Taekyung Kim
  • , Jiyoung Lee
  • , Jaewon Lee
  • , Soohyun Kim
  • , Hyunwoo J. Kim*
  • , Jin Hwa Kim*
  • *Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

Abstract

Multi-resolution hash encoding has recently been proposed to reduce the computational cost of neural renderings, such as NeRF. This method requires accurate camera poses for the neural renderings of given scenes. However, contrary to previous methods jointly optimizing camera poses and 3D scenes, the naïve gradient-based camera pose refinement method using multi-resolution hash encoding severely deteriorates performance. We propose a joint optimization algorithm to calibrate the camera pose and learn a geometric representation using efficient multi-resolution hash encoding. Showing that the oscillating gradient flows of hash encoding interfere with the registration of camera poses, our method addresses the issue by utilizing smooth interpolation weighting to stabilize the gradient oscillation for the ray samplings across hash grids. Moreover, the curriculum training procedure helps to learn the level-wise hash encoding, further increasing the pose refinement. Experiments on the novel-view synthesis datasets validate that our learning frameworks achieve state-of-the-art performance and rapid convergence of neural rendering.

Original languageEnglish
Pages (from-to)13000-13016
Number of pages17
JournalProceedings of Machine Learning Research
Volume202
Publication statusPublished - 2023
Event40th International Conference on Machine Learning, ICML 2023 - Honolulu, United States
Duration: 2023 Jul 232023 Jul 29

Bibliographical note

Publisher Copyright:
© 2023 Proceedings of Machine Learning Research. All rights reserved.

ASJC Scopus subject areas

  • Artificial Intelligence
  • Software
  • Control and Systems Engineering
  • Statistics and Probability

Fingerprint

Dive into the research topics of 'Robust Camera Pose Refinement for Multi-Resolution Hash Encoding'. Together they form a unique fingerprint.

Cite this