WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection

Permanent URI for this collectionhttps://hdl.handle.net/11147/7150

Browse

Search Results

Now showing 1 - 2 of 2
  • Conference Object
    Citation - WoS: 2
    Word2vec Kullanarak Eş Anlamlılık Temelinde Anahtar Kelime Çıkarımı
    (IEEE, 2019) Oğul, İskender Ülgen; Oğul, İskender Ülgen; Özcan, Caner; Hakdağlı, Özlem; 01. Izmir Institute of Technology
    Nowadays, the data revealed by the online individuals are increasing exponentially. The raw information that increasing data holds, transformed into meaningful outputs using machine learning and deep learning methods. Generally, supervised learning methods are used for information extraction and classification. Supervised learning is based on the training set that classification algorithms are trained. In the proposed approach, keyword extraction solution is proposed to classify text data more convenient. The developed solution is based on the Word2Vec algorithm, which works by taking into consideration the semantic meaning of the words unlike general approaches that based on word frequency. A new approach, word embedding algorithm named Word2Vec, works by calculating the word weights, semantic relationship, and the final weights of vectors. The obtained keywords are trained with Name Bayes and Decision Trees methods and the performance of the proposed method is shown by classification example.
  • Conference Object
    Citation - WoS: 2
    Stream Text Data Analysis on Twitter Using Apache Spark Streaming
    (Institute of Electrical and Electronics Engineers Inc., 2018) Hakdağlı, Özlem; Oğul, İskender Ülgen; Özcan, Caner; Oğul, İskender Ülgen; 01. Izmir Institute of Technology
    With today's developing technology, people's access to information and its production have reached a very fast level. These generated and obtained information are instantly created, entered into data systems and updated. Sources of streaming data can be transformed into valuable analysis results when they are handled with targeted methods. In this study, a text data field is determined to perform analysis on instantaneous generated data and Twitter, the richest platform for instant text data, is used. Twitter instantly generates a variety of data in large quantities and it presents it as open source using an API. A machine learning framework Apache Spark's stream analysis environment is used to analyze these resources. Situation analysis was performed using Support Vector Machine, Decision Trees and Logistic Regression algorithms presented under this environment. The results are presented in tables.