Itemset Hiding Under Multiple Sensitive Support Thresholds

dc.contributor.author Öztürk, Ahmet Cumhur
dc.contributor.author Ergenç Bostanoğlu, Belgin
dc.coverage.doi 10.5220/0006501502220231
dc.date.accessioned 2020-07-18T03:35:21Z
dc.date.available 2020-07-18T03:35:21Z
dc.date.issued 2017
dc.description Institute for Systems and Technologies of Information, Control and Communication (INSTICC) en_US
dc.description 9th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2017 -- 1 November 2017 through 3 November 2017 en_US
dc.description.abstract Itemset mining is the challenging step of association rule mining that aims to extract patterns among items from transactional databases. In the case of applying itemset mining on the shared data of organizations, each party needs to hide its sensitive knowledge before extracting global knowledge for mutual benefit. Ensuring the privacy of the sensitive itemsets is not the only challenge in the itemset hiding process, also the distortion given to the non-sensitive knowledge and data should be kept at minimum. Most of the previous works related to itemset hiding allow database owner to assign unique sensitive threshold for each sensitive itemset however itemsets may have different count and utility. In this paper we propose a new heuristic based hiding algorithm which 1) allows database owner to assign multiple sensitive threshold values for sensitive itemsets, 2) hides all user defined sensitive itemsets, 3) uses heuristics that minimizes loss of information and distortion on the shared database. In order to speed up hiding steps we represent the database as Pseudo Graph and perform scan operations on this data structure rather than the actual database. Performance evaluation of our algorithm Pseudo Graph Based Sanitization (PGBS) is conducted on 4 real databases. Distortion given to the nonsensitive itemsets (information loss), distortion given to the shared data (distance) and execution time in comparison to three similar algorithms is measured. Experimental results show that PGBS is competitive in terms of execution time and distortion and achieves reasonable performance in terms of information loss amongst the other algorithms. © 2017 by SCITEPRESS - Science and Technology Publications, Lda. All rights reserved. en_US
dc.identifier.doi 10.5220/0006501502220231 en_US
dc.identifier.doi 10.5220/0006501502220231
dc.identifier.isbn 9789897582738
dc.identifier.scopus 2-s2.0-85055515458
dc.identifier.uri https://doi.org/10.5220/0006501502220231
dc.identifier.uri https://hdl.handle.net/11147/7901
dc.language.iso en en_US
dc.publisher SCITEPRESS en_US
dc.relation.ispartof IC3K 2017 - Proceedings of the 9th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Itemset hiding en_US
dc.subject Multiple sensitive support thresholds en_US
dc.subject Privacy preserving association rule mining en_US
dc.title Itemset Hiding Under Multiple Sensitive Support Thresholds en_US
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.author.institutional Öztürk, Ahmet Cumhur
gdc.author.institutional Ergenç Bostanoğlu, Belgin
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.access open access
gdc.coar.type text::conference output
gdc.collaboration.industrial false
gdc.description.department İzmir Institute of Technology. Computer Engineering en_US
gdc.description.endpage 231 en_US
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.startpage 222 en_US
gdc.description.volume 3 en_US
gdc.identifier.openalex W2768222566
gdc.index.type Scopus
gdc.oaire.accesstype HYBRID
gdc.oaire.diamondjournal false
gdc.oaire.impulse 2.0
gdc.oaire.influence 2.7587528E-9
gdc.oaire.isgreen true
gdc.oaire.keywords Multiple sensitive support thresholds
gdc.oaire.keywords Itemset hiding
gdc.oaire.keywords Privacy preserving association rule mining
gdc.oaire.popularity 2.1227202E-9
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 0202 electrical engineering, electronic engineering, information engineering
gdc.oaire.sciencefields 02 engineering and technology
gdc.openalex.collaboration National
gdc.openalex.fwci 0.91671855
gdc.openalex.normalizedpercentile 0.8
gdc.opencitations.count 3
gdc.plumx.mendeley 7
gdc.plumx.scopuscites 4
gdc.scopus.citedcount 4
relation.isAuthorOfPublication.latestForDiscovery 3b51d444-157d-4dff-a209-e28543a80dcd
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4014-8abe-a4dfe192da5e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
3b4a2c2fb3db806e7d6fc8a8c7a056eaa056.pdf
Size:
388.5 KB
Format:
Adobe Portable Document Format