Sparse matrix-vector multiplication (SpMV) is an important operation in computational science and needs be accelerated because it often represents the dominant cost in many widely used iterative methods and eigenvalue problems. We achieve this objective by proposing a novel SpMV algorithm based on the compressed sparse row (CSR) on the GPU. Our method dynamically assigns different numbers of rows to each thread block and executes different optimization implementations on the basis of the number of rows it involves for each block. The process of accesses to the CSR arrays is fully coalesced, and the GPU’s DRAM bandwidth is efficiently utilized by loading data into the shared memory, which alleviates the bottleneck of many existing CSR-based algorithms (i.e., CSR-scalar and CSR-vector). Test results on C2050 and K20c GPUs show that our method outperforms a perfect-CSR algorithm that inspires our work, the vendor tuned CUSPARSE V6.5 and CUSP V0.5.1, and three popular algorithms clSpMV, CSR5, and CSR-Adaptive.
from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2dSNoKK
via IFTTT
Εγγραφή σε:
Σχόλια ανάρτησης (Atom)
Δημοφιλείς αναρτήσεις
-
Abstract Background A reported penicillin allergy may compromise receipt of recommended antibiotic prophylaxis intended to prevent surgica...
-
Related Articles Feasibility of Brain Atrophy Measurement in Clinical Routine without Prior Standardization of the MRI Protocol:...
-
Rejuvenation Research , Vol. 0, No. 0. from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2EFILxo via I...
-
Letter to the editor of Acta Neurochirurgica: simultaneous pericranial and nasoseptal "double-flap" reconstruction after comb...
-
Abstract The core mission of the Early Stage Professionals in Molecular Imaging Sciences (ESPMIS) Interest Group is to help young scientist...
-
Adenylyl Cyclase-Associated Protein 1 in the Development of Head and Neck Squamous Cell Carcinomas. Bull Exp Biol Med. 2016 Mar 29; A...
-
In view of the performance requirements (e.g., ride comfort, road holding, and suspension space limitation) for vehicle suspension systems, ...
-
Ravikiran N Pawar, Sambhunath Banerjee, Subhajit Bramha, Shekhar Krishnan, Arpita Bhattacharya, Vaskar Saha, Anupam Chakrapani, Saurabh Bhav...
-
Purpose: This phase I study aimed to determine the recommended dose (RD), safety profile, and feasibility of a procedure combining intratum...
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου