A Semantic Search Engine for Turkish and English Research Resources
| dc.contributor.author | Karabacak, O. | |
| dc.contributor.author | Inan, E. | |
| dc.date.accessioned | 2025-12-25T21:39:44Z | |
| dc.date.available | 2025-12-25T21:39:44Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | Research resources are growing in volume at an exponential rate across disciplines and languages. This exponential increase has created a pressing need for intelligent search systems that can help researchers efficiently access relevant academic material. To overcome this issue, this study introduces a bilingual semantic search engine designed to retrieve academic articles written in both Turkish and English. The primary goal is to improve the accuracy and relevance of academic information retrieval by using modern Natural Language Processing techniques. Instead of relying on traditional keyword-based search methods, the system leverages transformer-based sentence embedding models. To capture semantic meaning more effectively, MiniLM-L6v2, paraphrase-multilingual-MiniLM-L12-v2 and multilingual-e5-base models were chosen for their multilingual capabilities and sentence-level embedding performance. To assess the quality of search results, Mean Average Precision (MAP) and Normalized Discounted Cumulative Gain (nDCG) were used. These metrics were calculated for each model across both language groups. Evaluation results show that the multilingual-e5-base model consistently outperformed the other models in both MAP and nDCG scores, demonstrating superior semantic understanding and multilingual alignment. The system also features a simple and responsive Streamlit-based interface that allows for real-time querying and result display. © 2025 IEEE. | en_US |
| dc.identifier.doi | 10.1109/ASYU67174.2025.11208360 | |
| dc.identifier.isbn | 9798331597276 | |
| dc.identifier.scopus | 2-s2.0-105022498489 | |
| dc.identifier.uri | https://doi.org/10.1109/ASYU67174.2025.11208360 | |
| dc.language.iso | en | en_US |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | en_US |
| dc.relation.ispartof | -- 2025 Innovations in Intelligent Systems and Applications Conference, ASYU 2025 -- 2025-09-10 through 2025-09-12 -- Bursa -- 214381 | en_US |
| dc.rights | info:eu-repo/semantics/closedAccess | en_US |
| dc.subject | Cross-Lingual | en_US |
| dc.subject | Natural Language Processing | en_US |
| dc.subject | Semantic Search | en_US |
| dc.subject | Sentence Transformers | en_US |
| dc.subject | Turkish | en_US |
| dc.title | A Semantic Search Engine for Turkish and English Research Resources | en_US |
| dc.type | Conference Object | en_US |
| dspace.entity.type | Publication | |
| gdc.author.scopusid | 60203418300 | |
| gdc.author.scopusid | 55623306000 | |
| gdc.coar.type | text::conference output | |
| gdc.collaboration.industrial | false | |
| gdc.description.department | İzmir Institute of Technology | en_US |
| gdc.description.departmenttemp | [Karabacak] Omer, Department of Computer Engineering, Izmir Yüksek Teknoloji Enstitüsü, Izmir, Turkey; [Inan] Emrah, Department of Computer Engineering, Izmir Yüksek Teknoloji Enstitüsü, Izmir, Turkey | en_US |
| gdc.description.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
| gdc.description.scopusquality | N/A | |
| gdc.description.wosquality | N/A | |
| gdc.identifier.openalex | W4415709508 | |
| gdc.index.type | Scopus | |
| gdc.openalex.collaboration | National | |
| gdc.opencitations.count | 0 | |
| gdc.plumx.mendeley | 2 | |
| gdc.plumx.scopuscites | 0 | |
| relation.isAuthorOfPublication.latestForDiscovery | 8d120978-d9da-42e0-8cb3-17b3fe8e3af1 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | 9af2b05f-28ac-4014-8abe-a4dfe192da5e |
