Kendall, A and Cipolla, R and IEEE, (2017) Geometric loss functions for camera pose regression with deep learning. In: 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017-7-22 to -- pp. 6555-6564..
Full text not available from this repository.Abstract
Deep learning has shown to be effective for robust and real-time monocular image relocalisation. In particular, PoseNet [22] is a deep convolutional neural network which learns to regress the 6-DOF camera pose from a single im- age. It learns to localize using high level features and is ro- bust to difficult lighting, motion blur and unknown camera intrinsics, where point based SIFT registration fails. How- ever, it is trained using a naive loss function, with hyper- parameters which require expensive tuning. In this paper, we give the problem a more fundamental theoretical treat- ment. We explore a number of novel loss functions for learning camera pose which are based on geometry and scene reprojection error. Additionally we show how to au- tomatically learn an optimal weighting to simultaneously regress position and orientation. By leveraging geometry, we demonstrate that our technique significantly improves PoseNet’s performance across datasets ranging from indoor rooms to a small city.
Item Type: | Conference or Workshop Item (UNSPECIFIED) |
---|---|
Subjects: | UNSPECIFIED |
Divisions: | Div F > Machine Intelligence |
Depositing User: | Cron Job |
Date Deposited: | 22 Jan 2018 20:11 |
Last Modified: | 11 Apr 2021 20:43 |
DOI: | 10.1109/CVPR.2017.694 |