Sparse matrix-vector multiplication (SpMV) is an important operation in computational science and needs be accelerated because it often represents the dominant cost in many widely used iterative methods and eigenvalue problems. We achieve this objective by proposing a novel SpMV algorithm based on the compressed sparse row (CSR) on the GPU. Our method dynamically assigns different numbers of rows to each thread block and executes different optimization implementations on the basis of the number of rows it involves for each block. The process of accesses to the CSR arrays is fully coalesced, and the GPU’s DRAM bandwidth is efficiently utilized by loading data into the shared memory, which alleviates the bottleneck of many existing CSR-based algorithms (i.e., CSR-scalar and CSR-vector). Test results on C2050 and K20c GPUs show that our method outperforms a perfect-CSR algorithm that inspires our work, the vendor tuned CUSPARSE V6.5 and CUSP V0.5.1, and three popular algorithms clSpMV, CSR5, and CSR-Adaptive.
from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2dSNoKK
via IFTTT
Εγγραφή σε:
Σχόλια ανάρτησης (Atom)
Δημοφιλείς αναρτήσεις
-
Publication date: Available online 4 January 2018 Source: European Journal of Radiology Author(s): Peiyao Zhang, Jing Wang, Qin Xu, Zhen...
-
Background Hyperthyroidism is associated with increased thrombotic risk. As contact system activation through formation of neutrophil extrac...
-
Zeinab Nazeeh Shata, Marwa R Amin, Heba M El-Kady, Mervat W Abu-Nazel Avicenna Journal of Medicine 2017 7(2):54-63 Background: Unlike ot...
-
Brain Networks are Independently Modulated by Donepezil, Sleep, and Sleep Deprivation. Brain Topogr. 2017 Nov 23;: Authors: Wirsich J...
-
Abstract Diphenylarsinic acid (DPAA) is an organic arsenic compound used for the synthesis of chemical weapons. We previously found that th...
-
Vol.10 No.8 from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/22FyVm0 via IFTTT
-
Whether to wear a pollution filter Development of air quality forecasting system in Macedonia, based on WRF-Chem model Abstract Urban air qu...
-
from Imaging via alkiviadis.1961 on Inoreader http://ift.tt/2hMrBnH
-
To evaluate the effect of Recurrence Score® results (RS; Oncotype DX® multigene assay ODX) on treatment recommendations by Swiss multidiscip...
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου