Locality-Aware Task Scheduling for Homogeneous Parallel Computing Systems

dc.contributor.author Bhatti, Muhammad Khurram
dc.contributor.author Öz, Işıl
dc.contributor.author Amin, Sarah
dc.contributor.author Mushtaq, Maria
dc.contributor.author Farooq, Umer
dc.contributor.author Popov, Konstantin
dc.contributor.author Brorsson, Mats
dc.coverage.doi 10.1007/s00607-017-0581-6
dc.date.accessioned 2020-01-15T07:04:53Z
dc.date.available 2020-01-15T07:04:53Z
dc.date.issued 2018
dc.description.abstract In systems with complex many-core cache hierarchy, exploiting data locality can significantly reduce execution time and energy consumption of parallel applications. Locality can be exploited at various hardware and software layers. For instance, by implementing private and shared caches in a multi-level fashion, recent hardware designs are already optimised for locality. However, this would all be useless if the software scheduling does not cast the execution in a manner that promotes locality available in the programs themselves. Since programs for parallel systems consist of tasks executed simultaneously, task scheduling becomes crucial for the performance in multi-level cache architectures. This paper presents a heuristic algorithm for homogeneous multi-core systems called locality-aware task scheduling (LeTS). The LeTS heuristic is a work-conserving algorithm that takes into account both locality and load balancing in order to reduce the execution time of target applications. The working principle of LeTS is based on two distinctive phases, namely; working task group formation phase (WTG-FP) and working task group ordering phase (WTG-OP). The WTG-FP forms groups of tasks in order to capture data reuse across tasks while the WTG-OP determines an optimal order of execution for task groups that minimizes the reuse distance of shared data between tasks. We have performed experiments using randomly generated task graphs by varying three major performance parameters, namely: (1) communication to computation ratio (CCR) between 0.1 and 1.0, (2) application size, i.e., task graphs comprising of 50-, 100-, and 300-tasks per graph, and (3) number of cores with 2-, 4-, 8-, and 16-cores execution scenarios. We have also performed experiments using selected real-world applications. The LeTS heuristic reduces overall execution time of applications by exploiting inter-task data locality. Results show that LeTS outperforms state-of-the-art algorithms in amortizing inter-task communication cost. en_US
dc.identifier.citation Bhatti, M. K., Öz, I., Amin, S., Mushtaq, M., Farooq, U., Popov, K., and Brorsson, M. (2018). Locality-aware task scheduling for homogeneous parallel computing systems. Computing, 100(6), 557-595. doi:10.1007/s00607-017-0581-6 en_US
dc.identifier.doi 10.1007/s00607-017-0581-6 en_US
dc.identifier.doi 10.1007/s00607-017-0581-6
dc.identifier.issn 0010-485X
dc.identifier.issn 1436-5057
dc.identifier.scopus 2-s2.0-85032798462
dc.identifier.uri https://doi.org/10.1007/s00607-017-0581-6
dc.identifier.uri https://hdl.handle.net/11147/7581
dc.language.iso en en_US
dc.publisher Springer Verlag en_US
dc.relation.ispartof Computing en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Directed acyclic graph en_US
dc.subject Embedded systems en_US
dc.subject Homogeneous systems en_US
dc.subject Multicore scheduling en_US
dc.subject Parallel computing en_US
dc.title Locality-Aware Task Scheduling for Homogeneous Parallel Computing Systems en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.id 0000-0002-8310-1143
gdc.author.id 0000-0002-8310-1143 en_US
gdc.author.institutional Öz, Işıl
gdc.bip.impulseclass C4
gdc.bip.influenceclass C5
gdc.bip.popularityclass C4
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department İzmir Institute of Technology. Computer Engineering en_US
gdc.description.endpage 595 en_US
gdc.description.issue 6 en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q1
gdc.description.startpage 557 en_US
gdc.description.volume 100 en_US
gdc.description.wosquality Q2
gdc.identifier.openalex W2765463157
gdc.identifier.wos WOS:000432601500001
gdc.index.type WoS
gdc.index.type Scopus
gdc.oaire.diamondjournal false
gdc.oaire.impulse 5.0
gdc.oaire.influence 3.0598264E-9
gdc.oaire.isgreen true
gdc.oaire.keywords Parallel computing
gdc.oaire.keywords Homogeneous systems
gdc.oaire.keywords Multicore scheduling
gdc.oaire.keywords Embedded systems
gdc.oaire.keywords [INFO.INFO-ES]Computer Science [cs]/Embedded Systems
gdc.oaire.keywords Directed acyclic graph
gdc.oaire.popularity 5.1422937E-9
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 0202 electrical engineering, electronic engineering, information engineering
gdc.oaire.sciencefields 02 engineering and technology
gdc.openalex.collaboration International
gdc.openalex.fwci 1.65101077
gdc.openalex.normalizedpercentile 0.85
gdc.opencitations.count 8
gdc.plumx.crossrefcites 8
gdc.plumx.mendeley 12
gdc.plumx.scopuscites 11
gdc.scopus.citedcount 11
gdc.wos.citedcount 7
relation.isAuthorOfPublication.latestForDiscovery e0de33d0-b187-47e9-bae7-9b17aaabeb67
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4014-8abe-a4dfe192da5e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
7581.pdf
Size:
8.93 MB
Format:
Adobe Portable Document Format
Description:
Makale (Article)

License bundle

Now showing 1 - 1 of 1
Loading...
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: