Quantitative Performance Analysis of Blas Libraries on Gpu Architectures
Loading...
Date
Authors
Öz, Işıl
Journal Title
Journal ISSN
Volume Title
Publisher
Open Access Color
GOLD
Green Open Access
No
OpenAIRE Downloads
OpenAIRE Views
Publicly Funded
No
Abstract
Basic Linear Algebra Subprograms (BLAS) are a set of linear algebra routines commonly used by machine learning applications and scientific computing. BLAS libraries with optimized implementations of BLAS routines offer high performance by exploiting parallel execution units in target computing systems. With massively large number of cores, graphics processing units (GPUs) exhibit high performance for computationally-heavy workloads. Recent BLAS libraries utilize parallel cores of GPU architectures efficiently by employing inherent data parallelism. In this study, we analyze GPU-targeted functions from two BLAS libraries, cuBLAS and MAGMA, and evaluate their performance on a single-GPU NVIDIA architecture by considering architectural features and limitations. We collect architectural performance metrics and explore resource utilization characteristics. Our work aims to help researchers and programmers to understand the performance behavior and GPU resource utilization of the BLAS routines implemented by the libraries.
Description
Keywords
Engineering, Temel lineer cebir alt programları;Grafik işlemci birimleri;Performans analizi, Mühendislik, Basic linear algebra subprograms;Graphics processing units;Performance analysis
Fields of Science
Citation
WoS Q
Scopus Q

OpenCitations Citation Count
1
Volume
26
Issue
76
Start Page
40
End Page
48
PlumX Metrics
Captures
Mendeley Readers : 1
Page Views
115
checked on May 03, 2026
Google Scholar™


