Changes

Savy Cat

224 bytes added, 17:17, 10 April 2018

→‎Unsigned Char vs. Float

==== Unsigned Char vs. Float ====

The first real improvement came from changing PX_TYPE from float back to unsigned char, as used in the serial version. Unsigned char is good enough for all .jpg colour values (255). ~~GPU~~ GPUs are designed to perform operations on floating point numbers, however, we are not performing any calculations outside of the indexing. The performance of the kernel was the same for float or unsigned char. We copy the source image to device once, and back to the host 12 times, making size relevant.

{| class="wikitable"

|Float

|-

|~~Bread~~Tiny_Shay.jpg|~~Pie~~1.93 KB|7.73 KB

|-

|~~Butter~~Medium_Shay.jpg|~~Ice cream~~5.71 MB|22.8 MB

|-

|~~Butter~~Large_Shay.jpg|~~Ice cream~~22.8 MB|91.4 MB

|-

|~~Butter~~Huge_Shay.jpg|~~Ice cream~~91.4 MB|365 MB

|}

This saves almost one second worth of latency for the largest file, bringing cudaMemcpy down to about the same time as the kernel execution:

[[File:Summary-5.png]]

Ermarinz

93

edits

Changes

Savy Cat

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

get involved with CDOT

courses

course projects

links

Tools