Predicting the Soft Error Vulnerability of Gpgpu Applications
Loading...
Date
2022
Authors
Topçu, Burak
Öz, Işıl
Journal Title
Journal ISSN
Volume Title
Publisher
Institute of Electrical and Electronics Engineers Inc.
Open Access Color
Green Open Access
No
OpenAIRE Downloads
OpenAIRE Views
Publicly Funded
No
Abstract
As Graphics Processing Units (GPUs) have evolved to deliver performance increases for general-purpose computations as well as graphics and multimedia applications, soft error reliability becomes an important concern. The soft error vulnerability of the applications is evaluated via fault injection experiments. Since performing fault injection takes impractical times to cover the fault locations in complex GPU hardware structures, prediction-based techniques have been proposed to evaluate the soft error vulnerability of General-Purpose GPU (GPGPU) programs based on the hardware performance characteristics.In this work, we propose ML-based prediction models for the soft error vulnerability evaluation of GPGPU programs. We consider both program characteristics and hardware performance metrics collected from either the simulation or the profiling tools. While we utilize regression models for the prediction of the masked fault rates, we build classification models to specify the vulnerability level of the programs based on their silent data corruption (SDC) and crash rates. Our prediction models achieve maximum prediction accuracy rates of 96.6%, 82.6%, and 87% for masked fault rates, SDCs, and crashes, respectively.
Description
This work was supported by the Scientific and Technological Research Council of Turkey (TÜBİTAK), Grant No: 119E011.
Keywords
Computer graphics, Computer graphics equipment, Soft error, Radiation hardening
Fields of Science
Citation
WoS Q
N/A
Scopus Q
N/A

OpenCitations Citation Count
N/A
Source
Volume
Issue
Start Page
108
End Page
115
SCOPUS™ Citations
2
checked on Apr 27, 2026
Web of Science™ Citations
1
checked on Apr 27, 2026
Page Views
1049
checked on Apr 27, 2026
Downloads
510
checked on Apr 27, 2026
Google Scholar™


