Tag: neural-networks
-
Weight initialization - impact on layer distribution
This post covers some experiments to demonstrate the impact of weight initialization on the distribution of activations on each layer in neural network, especially the very last layers.