Open main menu

CDOT Wiki β

Changes

A-Team

62 bytes removed, 00:26, 8 April 2019
Initial implementation
During assignment 2, we tried a simple kernel that took the shape of a dot product, what this achieved was nothing special, actually as predicted at the end of assignment 1, continuously calling cudaMalloc and cudaMemCpy had severe consequences on time.
====Initial implementation====
[[File:kernel_ms1_call.jpg]]
//version 1 dot product
__global__ void kdot(const float* d_a, const float* d_b, float* d_p, int ni, int nj, int nk) {
}
[[Source Code ms3 neural net]]
This is the final iteration, we will outline the take aways bellow.
113
edits