Changes

Jump to: navigation, search

GPU621/Intel Advisor

2 bytes added, 18:43, 23 November 2018
Padding
To address this issue, add some padding.
For example, if you have a <code>4 x 19</code> array of floats, and your system has access to a 128-bit vector registers, then you should add 1 column to make the array <code>4 x 20</code> so that the number of columns is evenly divisible by the number of floats that can be loaded onto into a 128-bit vector register, which is 4 floats.
[[File:Memory-alignment-padding.png]]
49
edits

Navigation menu