Comparison of Different Algorithms for Exploting the Hidden Trends in Data Sources

dc.contributor.advisor Püskülcü, Halis
dc.contributor.author Özsevim, Emrah
dc.date.accessioned 2014-07-22T13:52:21Z
dc.date.available 2014-07-22T13:52:21Z
dc.date.issued 2003
dc.description Thesis (Master)--Izmir Institute of Technology, Computer Engineering, Izmir, 2003 en_US
dc.description Includes bibliographical references (leaves: 92-97) en_US
dc.description Text in English; Abstract: Turkish and English en_US
dc.description 97 leaves en_US
dc.description.abstract The growth of large-scale transactional databases, time-series databases and other kinds of databases has been giving rise to the development of several efficient algorithms that cope with the computationally expensive task of association rule mining.In this study, different algorithms, Apriori, FP-tree and CHARM, for exploiting the hidden trends such as frequent itemsets, frequent patterns, closed frequent itemsets respectively, were discussed and their performances were evaluated. The perfomances of the algorithms were measured at different support levels, and the algorithms were tested on different data sets (on both synthetic and real data sets). The algorihms were compared according to their, data preparation performances, mining performance, run time performances and knowledge extraction capabilities.The Apriori algorithm is the most prevalent algorithm of association rule mining which makes multiple passes over the database aiming at finding the set of frequent itemsets for each level. The FP-Tree algorithm is a scalable algorithm which finds the crucial information as regards the complete set of prefix paths, conditional pattern bases and frequent patterns by using a compact FP-Tree based mining method. The CHARM is a novel algorithm which brings remarkable improvements over existing association rule mining algorithms by proving the fact that mining the set of closed frequent itemsets is adequate instead of mining the set of all frequent itemsets.Related to our experimental results, we conclude that the Apriori algorithm demonstrates a good performance on sparse data sets. The Fp-tree algorithm extracts less association in comparison to Apriori, however it is completelty a feasable solution that facilitates mining dense data sets at low support levels. On the other hand, the CHARM algorithm is an appropriate algorithm for mining closed frequent itemsets (a substantial portion of frequent itemsets) on both sparse and dense data sets even at low levels of support. en_US
dc.identifier.uri https://hdl.handle.net/11147/3785
dc.language.iso en en_US
dc.publisher Izmir Institute of Technology en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject.lcc QA76.9.D343 O97 2003 en
dc.subject.lcsh Data mining en
dc.subject.lcsh Association rule mining en
dc.subject.lcsh Algorithms en
dc.title Comparison of Different Algorithms for Exploting the Hidden Trends in Data Sources en_US
dc.type Master Thesis en_US
dspace.entity.type Publication
gdc.author.institutional Özsevim, Emrah
gdc.coar.access open access
gdc.coar.type text::thesis::master thesis
gdc.description.department Thesis (Master)--İzmir Institute of Technology, Computer Engineering en_US
gdc.description.publicationcategory Tez en_US
gdc.description.scopusquality N/A
gdc.description.wosquality N/A
relation.isAuthorOfPublication.latestForDiscovery f3844554-c555-4f40-8a31-c2b1f5f2d3e6
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4014-8abe-a4dfe192da5e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
T000258.pdf
Size:
1.21 MB
Format:
Adobe Portable Document Format
Description:
MasterThesis

License bundle

Now showing 1 - 1 of 1
Loading...
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: