Open main menu

CDOT Wiki β

Changes

GPU621/Intel Parallel Studio VTune Amplifier

293 bytes added, 20:48, 8 December 2021
Performance
</source>
====Performance====
As expected, the serail version CPU utilization is considered poor due to the fact that only one thread is used for data utilization and Prefix Scan Algorithm. As can be seen from the Hotspot report the main function took 2.297 under Intel compiler with no optimization.
 
[[File:Serial.png]]
== Sources ==
70
edits