Semantic Pose Verification for Outdoor Visual Localization With Self-Supervised Contrastive Learning

Loading...

Date

2022

Authors

Orhan, Semih
Baştanlar, Yalın

Journal Title

Journal ISSN

Volume Title

Publisher

IEEE

Open Access Color

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Average
Influence
Average
Popularity
Average

relationships.isProjectOf

relationships.isJournalIssueOf

Abstract

Any city-scale visual localization system has to overcome long-term appearance changes, such as varying illumination conditions or seasonal changes between query and database images. Since semantic content is more robust to such changes, we exploit semantic information to improve visual localization. In our scenario, the database consists of gnomonic views generated from panoramic images (e.g. Google Street View) and query images are collected with a standard field-of-view camera at a different time. To improve localization, we check the semantic similarity between query and database images, which is not trivial since the position and viewpoint of the cameras do not exactly match. To learn similarity, we propose training a CNN in a self-supervised fashion with contrastive learning on a dataset of semantically segmented images. With experiments we showed that this semantic similarity estimation approach works better than measuring the similarity at pixel-level. Finally, we used the semantic similarity scores to verify the retrievals obtained by a state-of-the-art visual localization method and observed that contrastive learning-based pose verification increases top-1 recall value to 0.90 which corresponds to a 2% improvement.

Description

This work was supported by the Scientific and Technological Research Council of Turkey under Grant No. 120E500 and also under 2214-A International Researcher Fellowship Programme.

Keywords

Visual localization, Cameras, Semantic similarity, Query processing

Fields of Science

Citation

WoS Q

N/A

Scopus Q

Q1
OpenCitations Logo
OpenCitations Citation Count
6

Source

IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Volume

2022-June

Issue

Start Page

3988

End Page

3997
PlumX Metrics
Citations

Scopus : 6

Captures

Mendeley Readers : 10

SCOPUS™ Citations

6

checked on Apr 27, 2026

Web of Science™ Citations

7

checked on Apr 27, 2026

Page Views

524

checked on Apr 27, 2026

Downloads

236

checked on Apr 27, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
2.91706358

Sustainable Development Goals

SDG data is not available