Sparse matrix-vector multiplication (SpMV) is an important operation in computational science and needs be accelerated because it often represents the dominant cost in many widely used iterative methods and eigenvalue problems. We achieve this objective by proposing a novel SpMV algorithm based on the compressed sparse row (CSR) on the GPU. Our method dynamically assigns different numbers of rows to each thread block and executes different optimization implementations on the basis of the number of rows it involves for each block. The process of accesses to the CSR arrays is fully coalesced, and the GPU’s DRAM bandwidth is efficiently utilized by loading data into the shared memory, which alleviates the bottleneck of many existing CSR-based algorithms (i.e., CSR-scalar and CSR-vector). Test results on C2050 and K20c GPUs show that our method outperforms a perfect-CSR algorithm that inspires our work, the vendor tuned CUSPARSE V6.5 and CUSP V0.5.1, and three popular algorithms clSpMV, CSR5, and CSR-Adaptive.
from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2dSNoKK
via IFTTT
Εγγραφή σε:
Σχόλια ανάρτησης (Atom)
Δημοφιλείς αναρτήσεις
-
Music: Elton John: Lyrics: Bernie Taupin: piano and vocals: Elton John: drums: Barry Morgan: bass guitar: Dave Richmond: acoustic guitar: Fr...
-
Information on properly formatting papers and citing sources in several different styles. How to cite legal material in APA style from #Al...
-
Find A+ essays, research papers, book notes, course notes and writing tips. Millions of students use StudyMode to jumpstart their assignment...
-
from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2peztQn via IFTTT
-
from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2perfrQ via IFTTT
-
Sample Literary Essay #1 . A Literary Essay About “Eleven” by Sandra Cisneros . Children are often intimidated and fall silent when in the c...
-
Looking for the best colleges offering Creative Writing Degrees? Visit StartClass to compare colleges based on tuition, SAT scores, acceptan...
-
This simulation shows a single mass on a spring, b = damping constant (friction) A spring generates a force Runge-Kutta method for numerical...
-
Create terrific lightbox jQuery slideshows in second without a line of code. All browsers and devices! from #AlexandrosSfakianakis via Ale...
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου