A Review of Learning Rules in Machine Learning - March 8, 2021

In this research meeting, our research intern Alex Cuozzo reviews some notable papers and explains high level concepts related to learning rules in machine learning. Moving away from backpropagation with gradient descent, he talks about various attempts at biologically plausible learning regimes which avoid the weight transport problem and use only local information at the neuron level. He then moves on to discuss work which infers a learning rule from weight updates, and further work using machine learning to create novel optimizers and local learning rules.

Papers / Talks mentioned (in order of presentation):

3 Likes

You can evolve even large nets. Eg. A simple mutation based system like Continuous Gray Code optimization will work.
Why bother when there is backpropagation?:
There are 2 answers, generalization and federated learning.
You get much better generalization because the full data set is used at each training step, not batches as with BP.
If you had 1 million images and 1000 CPU cores (250 raspberry pi 4s for example), then each core gets 1000 images and the full neural model. Each core is sent the same short sparse list of mutations to make to the neural model and returns the cost for its part of the training data.
The costs are summed and if an improvement an accept mutations message is sent to each core else a reject message.
The cost is less than 2 high end GPUs and the power consumpion about 5kwh. Which is about $1 an hour in most places, $75 in Texas.
Of course it would help to use fast networks like sparse ones or other special kinds.

What about apical dendrites that receive backward propagating information? Maybe a good signal to tell distal dendrites now is a good time to form a coincidence detector.

The question of NOW is not clear to me. How narrow a window we have to learn what is good? My guess is biology allows approximate now.

Hello! We started answering HTM forum questions at the beginning of our weekly research meetings. You can find Alex and Subutai’s response here: https://youtu.be/UbxcyLqLfc0?t=34 (start at 0:34)

Here are some suggested papers that might be helpful:
• Towards Deep Learning with Segregated Dendrites
• Dendritic solutions to the credit assignment problem

1 Like