Comparison of Two Association Rule Mining Algorithms Without Candidate Generation

dc.contributor.author Yıldız, Barış
dc.contributor.author Ergenç, Belgin
dc.date.accessioned 2017-01-04T07:19:51Z
dc.date.available 2017-01-04T07:19:51Z
dc.date.issued 2010
dc.description.abstract Association rule mining techniques play an important role in data mining research where the aim is to find interesting correlations among sets of items in databases. Although the Apriori algorithm of association rule mining is the one that boosted data mining research, it has a bottleneck in its candidate generation phase that requires multiple passes over the source data. FP-Growth and Matrix Apriori are two algorithms that overcome that bottleneck by keeping the frequent itemsets in compact data structures, eliminating the need of candidate generation. To our knowledge, there is no work to compare those two similar algorithms focusing on their performances in different phases of execution. In this study, we compare Matrix Apriori and FP-Growth algorithms. Two case studies analyzing the algorithms are carried out phase by phase using two synthetic datasets generated in order i) to see their performance with datasets having different characteristics, ii) to understand the causes of performance differences in different phases. Our findings are i) performances of algorithms are related to the characteristics of the given dataset and threshold value, ii) Matrix Apriori outperforms FP-Growth in total performance for threshold values below 10%, iii) although building matrix data structure has higher cost, finding itemsets is faster. en_US
dc.identifier.citation Yıldız, B., and Ergenç, B. (2010). Comparison of two association rule mining algorithms without candidate generation. Paper presented at the 10th IASTED International Conference on Artificial Intelligence and Applications, AIA 2010; Innsbruck; Austria; 15-17 February (450-457). en_US
dc.identifier.isbn 9780889868182
dc.identifier.scopus 2-s2.0-77954602647
dc.identifier.uri https://hdl.handle.net/11147/2709
dc.language.iso en en_US
dc.publisher ACTA Press en_US
dc.relation.ispartof 10th IASTED International Conference on Artificial Intelligence and Applications, AIA 2010 en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Data mining en_US
dc.subject Association rule mining en_US
dc.subject Matrix apriori en_US
dc.subject FP-growth algorithm en_US
dc.title Comparison of Two Association Rule Mining Algorithms Without Candidate Generation en_US
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.author.institutional Yıldız, Barış
gdc.author.institutional Ergenç, Belgin
gdc.author.yokid 130596
gdc.coar.access open access
gdc.coar.type text::conference output
gdc.description.department İzmir Institute of Technology. Computer Engineering en_US
gdc.description.endpage 457 en_US
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality N/A
gdc.description.startpage 450 en_US
gdc.description.wosquality N/A
gdc.index.type Scopus
gdc.scopus.citedcount 16
relation.isAuthorOfPublication.latestForDiscovery 3b51d444-157d-4dff-a209-e28543a80dcd
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4014-8abe-a4dfe192da5e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
2709.pdf
Size:
691.53 KB
Format:
Adobe Portable Document Format
Description:
Conference Paper

License bundle

Now showing 1 - 1 of 1
Loading...
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: