Changes

Jump to: navigation, search

GPU610/gpuchill

4 bytes removed, 12:12, 4 March 2019
Results
===== Results =====
You need many billions of points and maybe even trillions to reach a high precision for the final result but using just 2 billion dots causes the program to take over 30 seconds to run. The most intensive part of the program is the loop which is what loops 2 billion times in my run of the program while profiling, which can all be parallelized. We can determine from the profiling that 100% of the time executing the program is spent in the loop but of course that is not possible so we will go with 99.9999%, using a GTX 1080 as an example GPU which has 20 SMX and each having 2048 threads, and using Amdahl's Law we can expect a speedup of 29058976.094x191x
=== Assignment 2 ===
=== Assignment 3 ===
33
edits

Navigation menu