A Turkish Dataset for Gender Identification of Twitter Users

dc.contributor.author Sezerer, Erhan
dc.contributor.author Polatbilek, Ozan
dc.contributor.author Tekir, Selma
dc.date.accessioned 2025-12-25T21:40:38Z
dc.date.available 2025-12-25T21:40:38Z
dc.date.issued 2019
dc.description.abstract Author profiling is the identification of an author's gender, age, and language from his/her texts. With the increasing trend of using Twitter as a means to express thought, profiling the gender of an author from his/her tweets has become a challenge. Although several datasets in different languages have been released on this problem, there is still a need for multilingualism. In this work, we propose a dataset of tweets of Turkish Twitter users which are labeled with their gender information. The dataset has 3368 users in the training set and 1924 users in the test set where each user has 100 tweets. The dataset is publicly available(1). en_US
dc.identifier.isbn 9781950737383
dc.identifier.uri https://hdl.handle.net/11147/18810
dc.language.iso en en_US
dc.publisher Assoc Computational Linguistics-ACL en_US
dc.relation.ispartof 13th Linguistic Annotation Workshop (LAW) -- Aug 01, 2019 -- Florence, Italy en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.title A Turkish Dataset for Gender Identification of Twitter Users en_US
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.coar.type text::conference output
gdc.description.department İzmir Institute of Technology en_US
gdc.description.departmenttemp [Sezerer, Erhan; Polatbilek, Ozan; Tekir, Selma] Izmir Inst Technol Comp Engn, Izmir, Turkey en_US
gdc.description.endpage 207 en_US
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality N/A
gdc.description.startpage 203 en_US
gdc.description.woscitationindex Conference Proceedings Citation Index - Science - Conference Proceedings Citation Index - Social Science &amp- Humanities
gdc.description.wosquality N/A
gdc.identifier.wos WOS:000538533900023
gdc.index.type WoS
gdc.wos.citedcount 10
relation.isAuthorOfPublication.latestForDiscovery 57639474-3954-4f77-a84c-db8a079648a8
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4014-8abe-a4dfe192da5e

Files