Proof of concept Basa Ganglia

dmac · November 30, 2017, 3:14pm

Hello,

I created, tested, and analysed a proof of concept for the Basal Ganglia. The model does not include any sort of motor control so it is not useful but I think it’s an interesting stepping stone. This work is based on the work of (Sungar, 2017). The biggest difference between their model and mine is that mine models the Globus Palidus in detail.

Link:
https://github.com/ctrl-z-9000-times/HTM_experiments/blob/master/bg_writeup.pdf

Thank you for reading,
Dmac

rhyolight · November 30, 2017, 4:55pm

I have some simple questions. I don’t know much about the Basal Ganglia or reinforcement learning, so bear with me.

Why is it important to have two pathways (stiatums)?
Is the reward some function of how well the HTM is performing?
Are you using resets in the TM between sequences during training?

dmac · November 30, 2017, 6:11pm

This model would probably work just as well if it had a single pathway through the Striatum because the cortex in this model is very small.
My hypothesis is that the Striatum does not represent the same information as the cortex does. The Striatum only represents cortical information which is relevant to the basal ganglia. The purpose of the Striatum is to reduce the number of neurons which the Globus Palidus (G.P.) must consider without losing information. The D1 and D2 pathways through the Striatum specialize in representing positive and negative events (respectively). Not included in this model is a third type of Striatum Neuron which contains both D1 and D2 receptors, and I hypothesize that these neurons participate in both of the Striatum pathways.
The reward is what the basal ganglia is trying to predict, and the expected value is the basal ganglia’s prediction of the sum of the rewards it will receive in the near future. This model has no control over the reward. The TD error is a function of how well the basal ganglia predicts the rewards.
I do not use resets in the TM. Instead I show the TM 2-4 random inputs between sequences which I think has the effect of preventing predictions from persisting between sequences. Also the order that the sequences are show is random so any predictions from one sequence to the next should be meaningless and should not occur often.

Bitking · December 4, 2017, 6:49am

There are some odd interpretation of the two channels; things like go/no-go and such.
Once you place the channels in a larger framework of motor control most of this falls away and it starts to look like the true purpose is something more mundane like extend & retract.
Please see this paper, particularly section 3, with attention to the bit about posture - a mouse balancing against external forces.
Henry H. Yin, How Basal Ganglia Outputs Generate Behavior

I hope this will give you some background to make an informed evaluation of this proposed model.

rhyolight · December 6, 2017, 5:45pm

Then where to the rewards come from?

Topic		Replies	Views
Proposing a Model for the Basal Ganglia and Reinforcement Learning in HTM Tangential Theories theory , basal-ganglia , reinforcement	16	2895	August 12, 2017
Is there a Basal Ganglia theory equivalent to HTM? Numenta Theory	15	2640	December 5, 2017
HTM and Negative Reinforcement Tangential Theories	17	2355	November 13, 2016
Exploring Reinforcement Learning in HTM Tangential Theories	19	4404	July 4, 2018
Reinforcement Learning at the Synapse Level Tangential Theories	5	1505	August 10, 2017

Proof of concept Basa Ganglia

Related topics