Orhan, Semih

This item is non-discoverable

Orhan, Semih

Profile URL

https://hdl.handle.net/11147/16302

Main Affiliation

01.01. Units Affiliated to the Rectorate

Status

External

Full item page

Sustainable Development Goals

1

NO POVERTY

0

Research Products

2

ZERO HUNGER

0

Research Products

3

GOOD HEALTH AND WELL-BEING

0

Research Products

4

QUALITY EDUCATION

0

Research Products

5

GENDER EQUALITY

0

Research Products

6

CLEAN WATER AND SANITATION

0

Research Products

7

AFFORDABLE AND CLEAN ENERGY

0

Research Products

8

DECENT WORK AND ECONOMIC GROWTH

0

Research Products

9

INDUSTRY, INNOVATION AND INFRASTRUCTURE

1

Research Products

10

REDUCED INEQUALITIES

0

Research Products

11

SUSTAINABLE CITIES AND COMMUNITIES

0

Research Products

12

RESPONSIBLE CONSUMPTION AND PRODUCTION

0

Research Products

13

CLIMATE ACTION

0

Research Products

14

LIFE BELOW WATER

0

Research Products

15

LIFE ON LAND

0

Research Products

16

PEACE, JUSTICE AND STRONG INSTITUTIONS

0

Research Products

17

PARTNERSHIPS FOR THE GOALS

0

Research Products

This researcher does not have a Scopus ID.

This researcher does not have a WoS ID.

No records found in other affiliations.

Scholarly Output

7

Articles

2

Views / Downloads

70535/2502

Supervised MSc Theses

1

Supervised PhD Theses

1

WoS Citation Count

64

Scopus Citation Count

76

Patents

0

Projects

0

WoS Citations per Publication

9.14

Scopus Citations per Publication

10.86

Open Access Source

5

Supervised Theses

2

Journal	Count
18th IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2021	1
25th Signal Processing and Communications Applications Conference (SIU) -- MAY 15-18, 2017 -- Antalya, TURKEY	1
Electronics Letters	1
IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)	1
Signal, Image and Video Processing	1

Page Size:

Current Page: 1 / 1

Scopus Quartile Distribution

Competency Cloud

Scholarly Output Search Results

Now showing 1 - 7 of 7

Parça Tabanlı Eǧitimin Evrişimli Yapay Sinir Aǧları ile Nesne Konumlandırma Üzerindeki Etkisi
(IEEE, 2017) Orhan, Semih; Baştanlar, Yalın; Bastanlar, Yalin; Orhan, Semih; 03.04. Department of Computer Engineering; 01.01. Units Affiliated to the Rectorate; 01. Izmir Institute of Technology; 03. Faculty of Engineering
In recent years, Convolutional Neural Networks (CNNs) have shown great performance not only in image classification and image recognition tasks but also several tasks of computer vision. A lot of models which have different number of layers and depths, have been proposed. In this work, locations of leopards are tried to be identified by deep neural networks. To accomplish this task, two different methods are applied. First of them is training neural network using with entire images, second of them is training neural networks using with image patches which are cropped from full size of images. Patch training model has shown better performance than full size of image trained model.
Citation - WoS: 5
Citation - Scopus: 10
Efficient Search in a Panoramic Image Database for Long-Term Visual Localization
(IEEE, 2021) Baştanlar, Yalın; Orhan, Semih; 03.04. Department of Computer Engineering; 01.01. Units Affiliated to the Rectorate; 01. Izmir Institute of Technology; 03. Faculty of Engineering
In this work, we focus on a localization technique that is based on image retrieval. In this technique, database images are kept with GPS coordinates and the geographic location of the retrieved database image serves as an approximate position of the query image. In our scenario, database consists of panoramic images (e.g. Google Street View) and query images are collected with a standard field-of-view camera in a different time. While searching the match of a perspective query image in a panoramic image database, unlike previous studies, we do not generate a number of perspective images from the panoramic image. Instead, taking advantage of CNNs, we slide a search window in the last convolutional layer belonging to the panoramic image and compute the similarity with the descriptor extracted from the query image. In this way, more locations are visited in less amount of time. We conducted experiments with state-of-the-art descriptors and results reveal that the proposed sliding window approach reaches higher accuracy than generating 4 or 8 perspective images.
Citation - WoS: 7
Citation - Scopus: 6
Semantic Pose Verification for Outdoor Visual Localization With Self-Supervised Contrastive Learning
(IEEE, 2022) Guerrero, Jose J.; Orhan, Semih; Baştanlar, Yalın; Orhan, Semih; 03.04. Department of Computer Engineering; 01.01. Units Affiliated to the Rectorate; 01. Izmir Institute of Technology; 03. Faculty of Engineering
Any city-scale visual localization system has to overcome long-term appearance changes, such as varying illumination conditions or seasonal changes between query and database images. Since semantic content is more robust to such changes, we exploit semantic information to improve visual localization. In our scenario, the database consists of gnomonic views generated from panoramic images (e.g. Google Street View) and query images are collected with a standard field-of-view camera at a different time. To improve localization, we check the semantic similarity between query and database images, which is not trivial since the position and viewpoint of the cameras do not exactly match. To learn similarity, we propose training a CNN in a self-supervised fashion with contrastive learning on a dataset of semantically segmented images. With experiments we showed that this semantic similarity estimation approach works better than measuring the similarity at pixel-level. Finally, we used the semantic similarity scores to verify the retrievals obtained by a state-of-the-art visual localization method and observed that contrastive learning-based pose verification increases top-1 recall value to 0.90 which corresponds to a 2% improvement.
Localization of Certain Animal Species in Images Via Training Neural Networks With Image Patches
(Izmir Institute of Technology, 2017) Orhan, Semih; Baştanlar, Yalın; Orhan, Semih; Baştanlar, Yalın; 03.04. Department of Computer Engineering; 01.01. Units Affiliated to the Rectorate; 01. Izmir Institute of Technology; 03. Faculty of Engineering
Object detection is one of the most important tasks for computer vision systems. Varying object size, varying view angle, illumination conditions, occlusion etc. effect the success rate. In recent years, convolutional neural networks (CNNs) have shown great performance in different problems of computer vision including object detection and localization. In this work, we propose a novel training approach for CNNs to localize some animal species whose bodies have distinctive pattern, such as speckles of leopards, black-white lines of zebras, etc. To learn characteristic patterns, small patches are taken from different body parts of animals and they are used to train models. To find object location, in a test image, all locations are visited in a sliding window fashion. Crops are fed to CNN, then classification scores of all patches are recorded. To illustrate object location, heat map is generated by the classification scores of the patches. Afterwards, heat maps are converted to binary images and end up with bounding box estimates of objects. The localization performance of our Patch-based training is compared with Faster R-CNN – a state-of-the-art CNN-based object detection and localization algorithm. While evaluating the performances, in addition to the standard precision-recall metric, we use area-precision and area-recall which represent the potential of Patch-based Model better. Experiment results show that the proposed training method has better performance than Faster R-CNN for most of the evaluated classes. We also showed that Patch-based Model can be used with Faster R-CNN to increase its localization performance.
Citation - WoS: 9
Citation - Scopus: 13
Training Cnns With Image Patches for Object Localisation
(Institution of Engineering and Technology, 2018) Orhan, Semih; Baştanlar, Yalın; Orhan, Semih; Baştanlar, Yalın; 03.04. Department of Computer Engineering; 01.01. Units Affiliated to the Rectorate; 01. Izmir Institute of Technology; 03. Faculty of Engineering
Recently, convolutional neural networks (CNNs) have shown great performance in different problems of computer vision including object detection and localisation. A novel training approach is proposed for CNNs to localise some animal species whose bodies have distinctive patterns such as leopards and zebras. To learn characteristic patterns, small patches which are taken from different body parts of animals are used to train models. To find object location, in a test image, all locations are visited in a sliding window fashion. Crops are fed into trained CNN and their classification scores are combined into a heat map. Later on, heat maps are converted to bounding box estimates for varying confidence scores. The localisation performance of the patch-based training approach is compared with Faster R-CNN – a state-of-the-art CNN-based object detection and localisation method. Experimental results reveal that the patch-based training outperforms Faster R-CNN, especially for classes with distinctive patterns.
Citation - WoS: 43
Citation - Scopus: 47
Semantic Segmentation of Outdoor Panoramic Images
(Springer, 2021) Orhan, Semih; Baştanlar, Yalın; Baştanlar, Yalın; Orhan, Semih; 03.04. Department of Computer Engineering; 01.01. Units Affiliated to the Rectorate; 01. Izmir Institute of Technology; 03. Faculty of Engineering
Omnidirectional cameras are capable of providing 360. field-of-view in a single shot. This comprehensive view makes them preferable for many computer vision applications. An omnidirectional view is generally represented as a panoramic image with equirectangular projection, which suffers from distortions. Thus, standard camera approaches should be mathematically modified to be used effectively with panoramic images. In this work, we built a semantic segmentation CNN model that handles distortions in panoramic images using equirectangular convolutions. The proposed model, we call it UNet-equiconv, outperforms an equivalent CNN model with standard convolutions. To the best of our knowledge, ours is the first work on the semantic segmentation of real outdoor panoramic images. Experiment results reveal that using a distortion-aware CNN with equirectangular convolution increases the semantic segmentation performance (4% increase in mIoU). We also released a pixel-level annotated outdoor panoramic image dataset which can be used for various computer vision applications such as autonomous driving and visual localization. Source code of the project and the dataset were made available at the project page (https://github.com/semihorhan/semseg-outdoor-pano). © 2021, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature.
Semantic Segmentation of Panoramic Images and Panoramic Image Based Outdoor Visual Localization
(01. Izmir Institute of Technology, 2022) Baştanlar, Yalın; Orhan, Semih; Baştanlar, Yalın; 03.04. Department of Computer Engineering; 01.01. Units Affiliated to the Rectorate; 01. Izmir Institute of Technology; 03. Faculty of Engineering
360-degree views are captured by full omnidirectional cameras and generally represented with panoramic images. Unfortunately, these images heavily suffer from the spherical distortion at the poles of the sphere. In previous studies of Convolutional Neural Networks (CNNs), several methods have been proposed (e.g. equirectangular convolution) to alleviate spherical distortion. Getting inspired from these previous efforts, we developed an equirectangular version of the UNet model. We evaluated the semantic segmentation performance of the UNet model and its equirectangular version on an outdoor panoramic dataset. Experimental results showed that the equirectangular version of UNet performed better than UNet. In addition, we released the pixel-level annotated dataset, which is one of the first semantic segmentation datasets of outdoor panoramic images. In visual localization, localizing perspective query images in a panoramic image dataset can alleviate the non-overlapping view problem between cameras. Generally, perspective query images are localized in a panoramic image database with generating its virtual 4 or 8 gnomonic views, which is deforming sphere into cube faces. Doing so can simplify the searching problem to perspective to perspective search, but still there might be a non-overlapping view problem between query and gnomonic database images. Therefore we propose directly localizing perspective query images in panoramic images by applying sliding windows on the last convolution layer of CNNs. Features are extracted with R-MAC, GeM, and SFRS. Experimental results showed that the sliding window approach outperformed 4-gnomonic views, and we get competitive results compared with 8 and 12 gnomonic views. Any city-scale visual localization system has to be robust against long-term changes. Semantic information is more robust to such changes (e.g. surface of the building), and the depth maps provide geometric clues. In our work, we utilized semantic and depth information while pose verification, that is checking semantic and depth similarity to verify the poses (retrievals) obtained with the approach that use only RGB image features. Semantic and depth information are represented with a self-supervised contrastive learning approach (SimCLR). Experimental results showed that pose verification with semantic and depth features improved the visual localization performance of the RGB-only model.

Orhan, Semih

Profile URL

Name Variants

Job Title

Email Address

Main Affiliation

Status

Website

ORCID ID

Scopus Author ID

Turkish CoHE Profile ID

Google Scholar ID

WoS Researcher ID

Sustainable Development Goals

NO POVERTY

ZERO HUNGER

GOOD HEALTH AND WELL-BEING

QUALITY EDUCATION

GENDER EQUALITY

CLEAN WATER AND SANITATION

AFFORDABLE AND CLEAN ENERGY

DECENT WORK AND ECONOMIC GROWTH

INDUSTRY, INNOVATION AND INFRASTRUCTURE

REDUCED INEQUALITIES

SUSTAINABLE CITIES AND COMMUNITIES

RESPONSIBLE CONSUMPTION AND PRODUCTION

CLIMATE ACTION

LIFE BELOW WATER

LIFE ON LAND

PEACE, JUSTICE AND STRONG INSTITUTIONS

PARTNERSHIPS FOR THE GOALS

This researcher does not have a Scopus ID.

This researcher does not have a WoS ID.

No records found in other affiliations.

Scholarly Output

7

Articles

2

Views / Downloads

70535/2502

Supervised MSc Theses

1

Supervised PhD Theses

1

WoS Citation Count

64

Scopus Citation Count

76

Patents

0

Projects

0

WoS Citations per Publication

9.14

Scopus Citations per Publication

10.86

Open Access Source

5

Supervised Theses

2

Scopus Quartile Distribution

Competency Cloud

Filters

Settings

Sort By

Results per page

Scholarly Output Search Results