Is it possible for a spatial pooler to learn a Markov process?

hwangtamu · April 12, 2017, 4:27pm

I’m trying out some experiments on HTM for reinforcement learning. And Markov decision process (MDP) is a fundamental type of tasks in reinforcement learning.

My answer to the question is yes. Because spatial poolers are able to represent the mappings from current state to action or from current state to the next future state. I have managed to solve some simplest MDP tasks with method similar to swarming – to find a winner out of a swam. But this method doesn’t change the connections in spatial poolers much, which makes it lack a sense of “learning”.

But how does a spatial pooler learn Markov process efficiently? I know the learning rule of spatial pooler is based on Hebbian learning which is unsupervised. Is it possible to add some kind of guidance for learning? My current idea is to see if some evolutionary algorithms will work.

ycui · April 12, 2017, 5:17pm

@hwangtamu I think you need to use temporal memory, not spatial pooler to learn Markov process. Spatial pooler converts binary input patterns to SDRs. Temporal memory takes SDRs as input and makes prediction of future inputs. Unlike Markov process, temporal memory can maintain long term sequence context and learn very high-order Markov sequences.

I suggest you to take a look at our papers on spatial pooler and temporal memory to learn more about it.

hwangtamu · April 12, 2017, 7:26pm

@ycui Thanks!

I’ve read some of the papers. The temporal memory may help to evaluate the current state, but Markovian tasks do not require temporal memory. I’ll see what I can get from temporal pooler.

ycui · April 12, 2017, 8:41pm

Hmm, I don’t understand your comment “Markovian tasks do not require temporal memory”. Maybe I am missing something here. How could you use spatial pooler to represent a mapping between current state and future state?

scott · April 12, 2017, 9:16pm

You could put the SDRClassifier on top of the SP to get first order predictions, which are sufficient for the problem. The classifier gives multiple predictions with different likelihoods as well.

In this case, though, it is the classifier that is solving the prediction problem, not the SP.

hwangtamu · April 12, 2017, 10:25pm

Scott exactly clarified my confusion.
The architecture of the system can be something like:

Raw Input -> Binary Encoder -> Spatial Pooler -> Layer of SDR Classifier -> Decoder to output

My previous assumption is incorrect. The classifier plays the key role.

Topic		Replies	Views
How efficient would be to use a recurrent SP? Machine Learning spatial-pooling , machine-learning , htm , rnn	14	1125	November 16, 2017
New docs on SP & TM Numenta Theory sequence-memory , spatial-pooling , documentation	0	709	March 15, 2017
Spatial pooler training and Temporal Memory Implementations sequence-memory , spatial-pooling , question	0	452	June 29, 2022
Animation of spatial pooler and temporal memory Engineering sequence-memory , spatial-pooling , visualization	24	3927	November 30, 2016
Spatial pooler sequence Implementations	2	420	November 14, 2019

Is it possible for a spatial pooler to learn a Markov process?

Related topics