Differentiable plasticity: training plastic neural networks with backpropagation

michaelklachko · April 10, 2018, 6:24pm

How can we build agents that keep learning from experience, quickly and efficiently, after their initial training? Here we take inspiration from the main mechanism of learning in biological brains: synaptic plasticity, carefully tuned by evolution to produce efficient lifelong learning. We show that plasticity, just like connection weights, can be optimized by gradient descent in large (millions of parameters) recurrent networks with Hebbian plastic connections. First, recurrent plastic networks with more than two million parameters can be trained to memorize and reconstruct sets of novel, high-dimensional 1000+ pixels natural images not seen during training. Crucially, traditional non-plastic recurrent networks fail to solve this task. Furthermore, trained plastic networks can also solve generic meta-learning tasks such as the Omniglot task, with competitive results and little parameter overhead. Finally, in reinforcement learning settings, plastic networks outperform a non-plastic equivalent in a maze exploration task. We conclude that differentiable plasticity may provide a powerful novel approach to the learning-to-learn problem.

https://arxiv.org/abs/1804.02464

Bitking · April 10, 2018, 8:18pm

May I add that in the biological model the plasticity is modified by the limbic system.

Memory modulation
“The amygdala is also involved in the modulation of memory consolidation. Following any learning event, the long-term memory for the event is not formed instantaneously. Rather, information regarding the event is slowly assimilated into long-term (potentially lifelong) storage over time, possibly via long-term potentiation. Recent studies suggest that the amygdala regulates memory consolidation in other brain regions. Also, fear conditioning, a type of memory that is impaired following amygdala damage, is mediated in part by long-term potentiation”

vpuente · April 11, 2018, 11:48am

That means that LTP/LTD are modified by limbic system?

cogmission · April 11, 2018, 1:53pm

Seems to me we first need to have them have ongoing/constant interaction with novel stimuli. Secondly, we need a universal encoder that can take input from many different kinds of sensors. And if the prowess of an AGI is a function of and who’s sophistication grows with time - we’re all set? (HTMs already have plasticity)

vpuente · April 11, 2018, 2:25pm

Perhaps the input (to the cortex) is not only encoder dependent. Part of the elements that are in the brainstem are mixing sensory input with cortex prediction to produce new cortex inputs. Seems like the proper way to deal not only with noise but to “converge” the new stimuli to the already known (and save valuable resources).

Some times this prediction mixing might affecting to the input in a funny way:

Topic		Replies	Views
Two Papers on Differentiable plasticity Current Research journal-club	4	900	September 23, 2019
Uninformative memories will prevail: the storage of correlated representations and its consequences General Neuroscience	0	376	February 10, 2020
Efficient dendritic learning as an alternative to synaptic plasticity hypothesis Current Research	19	978	May 5, 2022
Adaptive nodes enrich nonlinear cooperative learning beyond traditional adaptation by links General Neuroscience	5	771	May 4, 2018
Brains@Bay Meetup: Hebbian Learning in Neural Networks Talks and Events live , community	6	1317	October 31, 2019

Differentiable plasticity: training plastic neural networks with backpropagation

Related topics