Changes

A-Team

986 bytes added, 10:27, 8 March 2019

→‎Initial Profile

[[File:neuralnet_chart.jpg]]

After the initial profile it is obvious that the dot product function consumes 97.94% of our run time. Additionally, the transpose function also consumes 1.45% which seems messily, however during back propagation transpose is also called.

// Back propagation

vector<float> dyhat = (yhat - b_y);

// dW3 = a2.T * dyhat

vector<float> dW3 = dot(transpose( &a2[0], BATCH_SIZE, 64 ), dyhat, 64, BATCH_SIZE, 10);

// dz2 = dyhat * W3.T * relu'(a2)

vector<float> dz2 = dot(dyhat, transpose( &W3[0], 64, 10 ), BATCH_SIZE, 10, 64) * reluPrime(a2);

// dW2 = a1.T * dz2

vector<float> dW2 = dot(transpose( &a1[0], BATCH_SIZE, 128 ), dz2, 128, BATCH_SIZE, 64);

// dz1 = dz2 * W2.T * relu'(a1)

vector<float> dz1 = dot(dz2, transpose( &W2[0], 128, 64 ), BATCH_SIZE, 64, 128) * reluPrime(a1);

// dW1 = X.T * dz1

vector<float> dW1 = dot(transpose( &b_X[0], BATCH_SIZE, 784 ), dz1, 784, BATCH_SIZE, 128);

=====Ray Tracing=====

Spdjurovic

113

edits

Changes

A-Team

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

get involved with CDOT

courses

course projects

links

Tools