Thquad: Turkish Historic Question Answering Dataset for Reading Comprehension

Question answering(QA) is a field in natural language processing and information retrieval, it aims to give answers to the questions using natural language. In this paper, we present the Turkish question answering dataset, which is THQuAD and baseline results with contextualized word embeddings. THQuAD consists of two different datasets one of them is TQuad on Turkish Islamic Science history within the scope of Teknofest 2018 "Artificial Intelligence competition", the second dataset on Ottoman history within the scope of Teknofest 2020 "Dogal Dil íçleme Yarismasi" prepared by us. THQuAD is a reading comprehension dataset, consisting of questions, answers, and passages. Our objective is to give an answer to a specific question by understanding the passage and extracting the answer from this passage. We generate contextualized word embeddings from pre-trained Turkish Bert, Electra, Albert language models after fine-tuning on different hyperparameters with neural networks. © 2021 IEEE

Keywords

Contextualized word embeddings, Deep learning, Information retrieval, Natural language understanding, Question answering

Fields of Science

0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology

WoS Q

N/A

Scopus Q

N/A

OpenCitations Citation Count

9

Source

Proceedings - 6th International Conference on Computer Science and Engineering, UBMK 2021 -- 6th International Conference on Computer Science and Engineering, UBMK 2021 -- 15 September 2021 through 17 September 2021 -- Ankara -- 176826

Start Page

215

End Page

220

URI

https://doi.org/10.1109/UBMK52708.2021.9559013
https://hdl.handle.net/11147/14785

Collections

Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection

PlumX Metrics

Citations

Scopus : 19

Captures

Mendeley Readers : 17

Full item page

SCOPUS™ Citations

19

checked on Jun 12, 2026

Page Views

75

checked on Jun 12, 2026

Google Scholar™

Check

OpenAlex FWCI

1.47143648

Sustainable Development Goals

4

Thquad: Turkish Historic Question Answering Dataset for Reading Comprehension

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Open Access Color

Green Open Access

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

BIP! Indicators

relationships.isProjectOf

relationships.isJournalIssueOf

Abstract

Description