Changes

Jump to: navigation, search

GPU610/Turing

234 bytes added, 12:58, 13 December 2015
Assignment 3
[[Image:ColinCampbellGPU610A3G1.png|600px| ]]
I then attempted to modify the code to use shared memory. Unfortunately the way the algorithm accesses rows and columns out of order made this not viable. I tried to convert the problem to use tiling to get around this but was not able to make it work correctly. Because of this I was not able to implement any more optimizations as most were based around using shared memory efficiently.

Navigation menu