A Semantic Search Engine for Turkish and English Research Resources

dc.contributor.author Karabacak, O.
dc.contributor.author Inan, E.
dc.date.accessioned 2025-12-25T21:39:44Z
dc.date.available 2025-12-25T21:39:44Z
dc.date.issued 2025
dc.description.abstract Research resources are growing in volume at an exponential rate across disciplines and languages. This exponential increase has created a pressing need for intelligent search systems that can help researchers efficiently access relevant academic material. To overcome this issue, this study introduces a bilingual semantic search engine designed to retrieve academic articles written in both Turkish and English. The primary goal is to improve the accuracy and relevance of academic information retrieval by using modern Natural Language Processing techniques. Instead of relying on traditional keyword-based search methods, the system leverages transformer-based sentence embedding models. To capture semantic meaning more effectively, MiniLM-L6v2, paraphrase-multilingual-MiniLM-L12-v2 and multilingual-e5-base models were chosen for their multilingual capabilities and sentence-level embedding performance. To assess the quality of search results, Mean Average Precision (MAP) and Normalized Discounted Cumulative Gain (nDCG) were used. These metrics were calculated for each model across both language groups. Evaluation results show that the multilingual-e5-base model consistently outperformed the other models in both MAP and nDCG scores, demonstrating superior semantic understanding and multilingual alignment. The system also features a simple and responsive Streamlit-based interface that allows for real-time querying and result display. © 2025 IEEE. en_US
dc.identifier.doi 10.1109/ASYU67174.2025.11208360
dc.identifier.isbn 9798331597276
dc.identifier.scopus 2-s2.0-105022498489
dc.identifier.uri https://doi.org/10.1109/ASYU67174.2025.11208360
dc.language.iso en en_US
dc.publisher Institute of Electrical and Electronics Engineers Inc. en_US
dc.relation.ispartof -- 2025 Innovations in Intelligent Systems and Applications Conference, ASYU 2025 -- 2025-09-10 through 2025-09-12 -- Bursa -- 214381 en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Cross-Lingual en_US
dc.subject Natural Language Processing en_US
dc.subject Semantic Search en_US
dc.subject Sentence Transformers en_US
dc.subject Turkish en_US
dc.title A Semantic Search Engine for Turkish and English Research Resources en_US
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.author.scopusid 60203418300
gdc.author.scopusid 55623306000
gdc.coar.type text::conference output
gdc.collaboration.industrial false
gdc.description.department İzmir Institute of Technology en_US
gdc.description.departmenttemp [Karabacak] Omer, Department of Computer Engineering, Izmir Yüksek Teknoloji Enstitüsü, Izmir, Turkey; [Inan] Emrah, Department of Computer Engineering, Izmir Yüksek Teknoloji Enstitüsü, Izmir, Turkey en_US
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality N/A
gdc.description.wosquality N/A
gdc.identifier.openalex W4415709508
gdc.index.type Scopus
gdc.openalex.collaboration National
gdc.opencitations.count 0
gdc.plumx.mendeley 2
gdc.plumx.scopuscites 0
relation.isAuthorOfPublication.latestForDiscovery 8d120978-d9da-42e0-8cb3-17b3fe8e3af1
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4014-8abe-a4dfe192da5e

Files