Sparse matrix-vector multiplication (SpMV) is an important operation in computational science and needs be accelerated because it often represents the dominant cost in many widely used iterative methods and eigenvalue problems. We achieve this objective by proposing a novel SpMV algorithm based on the compressed sparse row (CSR) on the GPU. Our method dynamically assigns different numbers of rows to each thread block and executes different optimization implementations on the basis of the number of rows it involves for each block. The process of accesses to the CSR arrays is fully coalesced, and the GPU’s DRAM bandwidth is efficiently utilized by loading data into the shared memory, which alleviates the bottleneck of many existing CSR-based algorithms (i.e., CSR-scalar and CSR-vector). Test results on C2050 and K20c GPUs show that our method outperforms a perfect-CSR algorithm that inspires our work, the vendor tuned CUSPARSE V6.5 and CUSP V0.5.1, and three popular algorithms clSpMV, CSR5, and CSR-Adaptive.
from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2dSNoKK
via IFTTT
Εγγραφή σε:
Σχόλια ανάρτησης (Atom)
Δημοφιλείς αναρτήσεις
-
Abstract Kenaf is a multipurpose crop, but a lack of genetic information hinders genetic and molecular research. In this study, we aimed t...
-
Spindle cell/pleomorphic lipoma is an uncommonly encountered benign neoplasm that is usually found in the subcutaneous tissues. Rare cases r...
-
As demonstrated by the market reactions to downgrades of various sovereign credit ratings in 2011, the credit rating agencies occupy an impo...
-
Lichtenstein intervention is currently the classic model of the regulated treatment of inguinal hernias by direct local approach. This “tens...
-
HPV vaccine now funded for boys Scoop.co.nz “Most countries who have to date introduced HPV vaccine have focused on the cervical canc...
-
Multi-organ segmentation of the head and neck area: an efficient hierarchical neural networks approach Abstract Purpose In radiation therapy...
-
ORIGINAL ARTICLES Cyclooxygenase-2 and estrogen receptor-β as possible therapeutic targets in desmoid tumors p. 47 Rasha A Khairy DOI :10....
-
Related Articles New alkylresorcinol metabolites in spot urine as biomarkers of whole grain wheat and rye intake in a Swedish middle-a...
-
2016-09-29T05-30-58Z Source: Journal of Applied Pharmaceutical Science Sadhana Nittur Holla, Meena Kumari Kamal Kishore, Mohan Babu Amber...
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου