Sparse matrix-vector multiplication (SpMV) is an important operation in computational science and needs be accelerated because it often represents the dominant cost in many widely used iterative methods and eigenvalue problems. We achieve this objective by proposing a novel SpMV algorithm based on the compressed sparse row (CSR) on the GPU. Our method dynamically assigns different numbers of rows to each thread block and executes different optimization implementations on the basis of the number of rows it involves for each block. The process of accesses to the CSR arrays is fully coalesced, and the GPU’s DRAM bandwidth is efficiently utilized by loading data into the shared memory, which alleviates the bottleneck of many existing CSR-based algorithms (i.e., CSR-scalar and CSR-vector). Test results on C2050 and K20c GPUs show that our method outperforms a perfect-CSR algorithm that inspires our work, the vendor tuned CUSPARSE V6.5 and CUSP V0.5.1, and three popular algorithms clSpMV, CSR5, and CSR-Adaptive.
from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2dSNoKK
via IFTTT
Εγγραφή σε:
Σχόλια ανάρτησης (Atom)
Δημοφιλείς αναρτήσεις
-
Abstract Purpose To test the effects of 4 weeks of unilateral low-load resistance training (LLRT), with and without blood flow restricti...
-
36 new pubmed citations were retrieved for your search. Click on the search hyperlink below to display the complete search results: quality...
-
The genital mucosa is a barrier that is constantly exposed to a variety of pathogens, allergens, and external stimuli. Although both allerge...
-
by Mark A. Valasek, Irene Thung, Esha Gollapalle, Alexey A. Hodkoff, Kaitlyn J. Kelly, Joel M. Baumgartner, Vera Vavinskaya, Grace Y. Lin, A...
-
The receptor tyrosine kinase KIT is an established oncogenic driver of tumor growth in certain tumor types, including gastrointestinal strom...
-
The main idea behind this work was demonstrated in a form of a new thermoelectrochromic sensor on a flexible substrate using graphene as an ...
-
Abstract There are limited published data on the burden of rare cancers in the United States. By using data from the North American Associ...
-
from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2f9YA71 via IFTTT
-
Relativistic hydrodynamics has been quite successful in explaining the collective behaviour of the QCD matter produced in high energy heavy-...
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου