A Turkish Dataset for Gender Identification of Twitter Users
| dc.contributor.author | Sezerer, Erhan | |
| dc.contributor.author | Polatbilek, Ozan | |
| dc.contributor.author | Tekir, Selma | |
| dc.date.accessioned | 2025-12-25T21:40:38Z | |
| dc.date.available | 2025-12-25T21:40:38Z | |
| dc.date.issued | 2019 | |
| dc.description.abstract | Author profiling is the identification of an author's gender, age, and language from his/her texts. With the increasing trend of using Twitter as a means to express thought, profiling the gender of an author from his/her tweets has become a challenge. Although several datasets in different languages have been released on this problem, there is still a need for multilingualism. In this work, we propose a dataset of tweets of Turkish Twitter users which are labeled with their gender information. The dataset has 3368 users in the training set and 1924 users in the test set where each user has 100 tweets. The dataset is publicly available(1). | en_US |
| dc.identifier.isbn | 9781950737383 | |
| dc.identifier.uri | https://hdl.handle.net/11147/18810 | |
| dc.language.iso | en | en_US |
| dc.publisher | Assoc Computational Linguistics-ACL | en_US |
| dc.relation.ispartof | 13th Linguistic Annotation Workshop (LAW) -- Aug 01, 2019 -- Florence, Italy | en_US |
| dc.rights | info:eu-repo/semantics/closedAccess | en_US |
| dc.title | A Turkish Dataset for Gender Identification of Twitter Users | en_US |
| dc.type | Conference Object | en_US |
| dspace.entity.type | Publication | |
| gdc.coar.type | text::conference output | |
| gdc.description.department | İzmir Institute of Technology | en_US |
| gdc.description.departmenttemp | [Sezerer, Erhan; Polatbilek, Ozan; Tekir, Selma] Izmir Inst Technol Comp Engn, Izmir, Turkey | en_US |
| gdc.description.endpage | 207 | en_US |
| gdc.description.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
| gdc.description.scopusquality | N/A | |
| gdc.description.startpage | 203 | en_US |
| gdc.description.woscitationindex | Conference Proceedings Citation Index - Science - Conference Proceedings Citation Index - Social Science &- Humanities | |
| gdc.description.wosquality | N/A | |
| gdc.identifier.wos | WOS:000538533900023 | |
| gdc.index.type | WoS | |
| gdc.wos.citedcount | 10 | |
| relation.isAuthorOfPublication.latestForDiscovery | 57639474-3954-4f77-a84c-db8a079648a8 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | 9af2b05f-28ac-4014-8abe-a4dfe192da5e |
