Efficient Privacy-Preserving Whole-Genome Variant Queries

dc.contributor.author Akgün, Mete
dc.contributor.author Pfeifer, Nico
dc.contributor.author Kohlbacher, Oliver
dc.date.accessioned 2022-07-18T13:00:45Z
dc.date.available 2022-07-18T13:00:45Z
dc.date.issued 2022
dc.description.abstract Motivation: Diagnosis and treatment decisions on genomic data have become widespread as the cost of genome sequencing decreases gradually. In this context, disease-gene association studies are of great importance. However, genomic data are very sensitive when compared to other data types and contains information about individuals and their relatives. Many studies have shown that this information can be obtained from the query-response pairs on genomic databases. In this work, we propose a method that uses secure multi-party computation to query genomic databases in a privacy-protected manner. The proposed solution privately outsources genomic data from arbitrarily many sources to the two non-colluding proxies and allows genomic databases to be safely stored in semi-honest cloud environments. It provides data privacy, query privacy and output privacy by using XOR-based sharing and unlike previous solutions, it allows queries to run efficiently on hundreds of thousands of genomic data. Results: We measure the performance of our solution with parameters similar to real-world applications. It is possible to query a genomic database with 3 000 000 variants with five genomic query predicates under 400 ms. Querying 1 048 576 genomes, each containing 1 000 000 variants, for the presence of five different query variants can be achieved approximately in 6 min with a small amount of dedicated hardware and connectivity. These execution times are in the right range to enable real-world applications in medical research and healthcare. Unlike previous studies, it is possible to query multiple databases with response times fast enough for practical application. To the best of our knowledge, this is the first solution that provides this performance for querying large-scale genomic data. en_US
dc.identifier.doi 10.1093/bioinformatics/btac070
dc.identifier.issn 13674803
dc.identifier.issn 1367-4803
dc.identifier.issn 1367-4811
dc.identifier.scopus 2-s2.0-85128785392
dc.identifier.uri https://doi.org/10.1093/bioinformatics/btac070
dc.identifier.uri https://hdl.handle.net/11147/12166
dc.language.iso en en_US
dc.publisher Oxford University Press en_US
dc.relation.ispartof Bioinformatics en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.title Efficient Privacy-Preserving Whole-Genome Variant Queries en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.id 0000-0003-4088-2784
gdc.author.institutional Akgün, Mete
gdc.bip.impulseclass C4
gdc.bip.influenceclass C5
gdc.bip.popularityclass C4
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.contributor.affiliation 01. Izmir Institute of Technology en_US
gdc.contributor.affiliation University of Tübingen en_US
gdc.contributor.affiliation University of Tübingen en_US
gdc.description.department İzmir Institute of Technology. Computer Engineering en_US
gdc.description.endpage 2210 en_US
gdc.description.issue 8 en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q1
gdc.description.startpage 2202 en_US
gdc.description.volume 38 en_US
gdc.description.wosquality Q1
gdc.identifier.openalex W4210463533
gdc.identifier.pmid 35150254
gdc.identifier.wos WOS:000757951900001
gdc.index.type WoS
gdc.index.type Scopus
gdc.index.type PubMed
gdc.oaire.accesstype GOLD
gdc.oaire.diamondjournal false
gdc.oaire.impulse 10.0
gdc.oaire.influence 3.1515457E-9
gdc.oaire.isgreen true
gdc.oaire.keywords Databases, Factual
gdc.oaire.keywords Privacy
gdc.oaire.keywords Humans
gdc.oaire.keywords Genomics
gdc.oaire.keywords Original Papers
gdc.oaire.keywords Computer Security
gdc.oaire.popularity 9.213707E-9
gdc.oaire.publicfunded false
gdc.openalex.collaboration International
gdc.openalex.fwci 2.74118348
gdc.openalex.normalizedpercentile 0.87
gdc.openalex.toppercent TOP 10%
gdc.opencitations.count 7
gdc.plumx.crossrefcites 1
gdc.plumx.mendeley 10
gdc.plumx.pubmedcites 2
gdc.plumx.scopuscites 10
gdc.scopus.citedcount 10
gdc.wos.citedcount 6
relation.isAuthorOfPublication.latestForDiscovery bcaeb78e-77bd-4185-9e94-e507e9aadbe7
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4014-8abe-a4dfe192da5e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
btac070.pdf
Size:
1.51 MB
Format:
Adobe Portable Document Format
Description:
Article

License bundle

Now showing 1 - 1 of 1
Loading...
Name:
license.txt
Size:
3.2 KB
Format:
Item-specific license agreed upon to submission
Description: