P. 1
Numerical Methods Implementation on CUDA

Numerical Methods Implementation on CUDA

|Views: 996|Likes:
Published by Devendra Sharma

More info:

Published by: Devendra Sharma on Mar 30, 2012
Copyright:Attribution Non-commercial

Availability:

Read on Scribd mobile: iPhone, iPad and Android.
download as PDF, TXT or read online from Scribd
See more
See less

01/16/2013

pdf

text

original

1. The use of shared memory to perform consecutive reads,which reduces the

time that would have been spent in performing the same reads and write

using global memory.

2. The code is generalised to run on very large number of values.

3. Better load balance

4. Repeated communications of a same value are avoided

5. Use of three kernel functions to increase the extent of parallelization at the

same time continuosly using shared memory.

32

Chapter 6 Implementation Of Parallel Quicksort By Regular Sampling

Algorithm On CUDA

You're Reading a Free Preview

Download
scribd
/*********** DO NOT ALTER ANYTHING BELOW THIS LINE ! ************/ var s_code=s.t();if(s_code)document.write(s_code)//-->