Changes

Jump to: navigation, search

Happy Valley

48 bytes removed, 12:46, 9 April 2018
Switching to shared memory
Low memcpy/compute overlap is related to the Concurrent Kernel Execution. In theory, you can pass chunks of the input array asynchronously into each kernel in the array. However, it seems to be hard to partition the inout data in any meaningful way.
 
 
 
''' Source Code '''
 
<pre>
code here!
</pre>
==== Switching to x86 from x64 ====
56
edits

Navigation menu