Sparse matrix-vector multiplication (SpMV) is an important operation in computational science and needs be accelerated because it often represents the dominant cost in many widely used iterative methods and eigenvalue problems. We achieve this objective by proposing a novel SpMV algorithm based on the compressed sparse row (CSR) on the GPU. Our method dynamically assigns different numbers of rows to each thread block and executes different optimization implementations on the basis of the number of rows it involves for each block. The process of accesses to the CSR arrays is fully coalesced, and the GPU’s DRAM bandwidth is efficiently utilized by loading data into the shared memory, which alleviates the bottleneck of many existing CSR-based algorithms (i.e., CSR-scalar and CSR-vector). Test results on C2050 and K20c GPUs show that our method outperforms a perfect-CSR algorithm that inspires our work, the vendor tuned CUSPARSE V6.5 and CUSP V0.5.1, and three popular algorithms clSpMV, CSR5, and CSR-Adaptive.
from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2dSNoKK
via IFTTT
Εγγραφή σε:
Σχόλια ανάρτησης (Atom)
Δημοφιλείς αναρτήσεις
-
Abstract Bromodomain proteins function as epigenetic readers that recognize acetylated histone tails to facilitate the transcription of t...
-
Objectives To optimise medical students’ early clerkship is a complex task since it is conducted in a context primarily organised to take ca...
-
Abstract Purpose Overcoming the flaws of current data management conditions in head and neck oncology could enable integrated informatio...
-
C. Julian Chen'Correspondence information about the author C. Julian ChenEmail the author C. Julian Chen, Donald A. Miller DOI: https://...
-
ACS Nano DOI: 10.1021/acsnano.7b00032 from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2lNPpuk via...
-
Abstract Polychlorinated biphenyls (PCBs), a group of 209 congeners that differ in the number and position of chlorines on the biphenyl rin...
-
1 abqls-210rm.html Read the latest Journal of Clinical Neurophysiology - Vol. 37, No. 1, January 2020.eml 2 agx3v-nxz96.html Read the late...
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου