Sparse Networks from Scratch: Faster Training without Losing Performance

mraptor · August 31, 2021, 2:50pm

SeanOConnor · September 1, 2021, 11:49pm

I like the overview of sparse neural networks here:
https://youtu.be/H7-p3OWPpEI

And this video that is trying to tell you something:
https://youtu.be/QEWe-aRBUAs

Anyway a single neuron in a ReLU layer forward connects to n weights in the next layer. In the positive activation state (x>=0) ReLU is f(x)=x and the vector pattern defined by the n forward weights is projected with intensity x onto the next layer. In the negative activation state ReLU is f(x)=0, and nothing is projected onto the next layer.
That doesn’t seem great to me. Perhaps you should keep the positive activation state behavior and change the negative activation state behavior. Have an alternative set of forward connected weights for the next layer and project that vector pattern onto the next layer with intensity x when x is negative. That x is negative doesn’t really matter. Then the activation function is always f(x)=x, with the caveat that the activation function switches between 2 sets of forward connected weights in the next layer depending on x>=0 or not.

Topic		Replies	Views
ReLU neural networks as amplitude modulated dictionaries Lounge	7	734	September 9, 2021
Research on NN sparsity Lounge	10	613	February 19, 2023
Learning Sparse Neural Networks through L0 Regularization Current Research live , journal-club	4	868	August 19, 2019
Sparse dataset (input sparse matrix in neural model) Lounge community	7	1414	May 16, 2018
No influence of learning based on the permanence of proximal connections Numenta Theory spatial-pooling	50	2941	September 11, 2017

Sparse Networks from Scratch: Faster Training without Losing Performance

Related topics