Lattice Boltzmann Method (LBM) is a powerful numerical simulation method of the fluid flow. With its data parallel nature, it is a promising candidate for a parallel implementation on a GPU. The LBM, however, is heavily data intensive and memory bound. In particular, moving the data to the adjacent cells in the streaming computation phase incurs a lot of uncoalesced accesses on the GPU which affects the overall performance. Furthermore, the main computation kernels of the LBM use a large number of registers per thread which limits the thread parallelism available at the run time due to the fixed number of registers on the GPU. In this paper, we develop high performance parallelization of the LBM on a GPU by minimizing the overheads associated with the uncoalesced memory accesses while improving the cache locality using the tiling optimization with the data layout change. Furthermore, we aggressively reduce the register uses for the LBM kernels in order to increase the run-time thread parallelism. Experimental results on the Nvidia Tesla K20 GPU show that our approach delivers impressive throughput performance: 1210.63 Million Lattice Updates Per Second (MLUPS).
from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2iZGIcv
via IFTTT
Εγγραφή σε:
Σχόλια ανάρτησης (Atom)
Δημοφιλείς αναρτήσεις
-
from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2o7K1Dm via IFTTT
-
AP ® United States Government and Politics 2014 Free-Response Questions © 2014 The College Board. College Board, Advanced Placement Program,...
-
You know the feeling: you're hanging out somewhere, you look across the room, and suddenly your stomach drops. You start to sweat. Your ...
-
Unit 5: Writing cohesively - Section index. This unit looks at the use of language strategies to create clear, cohesive writing. It shows yo...
-
Introduction Crisis management is a critical organizational function. Failure can result in serious harm to stakeholders, losses for an orga...
-
9781421620831 1421620839 Coral Reef 2008 Square Wall - Wall 9780160782732 0160782732 Code of Federal Regulations, Title 27, Alcohol, Tobacco...
-
918 quotes have been tagged as self-confidence: Edgar Allan Poe: ‘I have great faith in fools - self-confidence my friends will call it.’, R...
-
Abstract This randomized and longitudinal in vivo study aimed to assess different protocols for the treatment of dentin hypersensitivity w...
-
About IRF. The Incentive Research Foundation (IRF), a private not-for-profit foundation, funds research studies and develops products servin...
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου