Why is the implementation of the temporal memory algorithm so complicated?

OhMyHTM · October 28, 2018, 6:08pm

We can do this with just two vectors, follow the steps below:

Create two vectors to store SDR.
Make a full connection to the elements of these two vectors
Update the weight of connection each time the SDR changes.

These two vectors can store the transition of the pattern and can be used for prediction.

marty1885 · October 28, 2018, 6:16pm

It’s like why we don’t just use FullyConnected/Dense layers in neural networks. Its

Using too much memory
Too slow - O(N^2)

Also, You’ll a 2D connection list isn’t enough. You’ll need a 3D one to make sequence learning to work. Otherwise the algorithm can’t learn any concept about time and acts like a simple a->b mapping algorithm.

OhMyHTM · October 28, 2018, 6:23pm

algorithm optimization can be done later. i think spatial pooler algorithm is slower than O(N^2).

the algorithm can’t learn any concept about time and acts like a simple a->b mapping

what dose this mean?

Paul_Lamb · October 28, 2018, 6:54pm

One reason is that in most applications, the inputs are not all semantically dissimilar. For example if TM algorithm learned transition A - B’. Then later an input came in which shared 25% of the same bits as A. I would expect a weak prediction (depending on the configuration) whereby some (but not all) of the cells representing B’ become predictive.

OhMyHTM · October 28, 2018, 7:01pm

yes, this could be one reason.

Jose_Cueto · October 29, 2018, 6:19am

Your 3-step TM algorithm looks simple which is good, but I honestly do not follow as to how it would satisfy the TM computational/programming requirements. Could you explain more?

My understanding is that TM inputs are active columns that are set up by the SP, they don’t directly care that much about SDRs.

OhMyHTM · October 29, 2018, 7:42pm

i think active columns = On bits of SDR, both inputs/outputs of HTM region are SDR.

Jose_Cueto · October 29, 2018, 9:51pm

In think you meant on bits of the input space. Not all columns that contain a subset of the SDR bits in their receptive fields become active. If they were then the TM algorithm would be simpler. I guess I know how you got your 3-step algorithm.

I think the TM alorithm is not complicated, if one compares this to other learning algorithms out there. It is straight forward however it is a bit hard to make a mental picture of it because the sequence learning is done in a distributed manner.

OhMyHTM · October 29, 2018, 10:11pm

no, input space is not always SDR. Do you know how SP works? SP generates SDR.

Jose_Cueto · October 29, 2018, 10:48pm

Implementation-wise, the SDR is just a concept of what the HTM prefers to represent its inputs. Inputs are encoded as SDRs using specific SDR encoders. The SP I believe is yet another concept, but implementation-wise it maps columns to input bits in the input space in groups called receptive fields. An input bit may or may not fall into a column’s receptive field this also means that if a column gets active then that is because it has “seen a set of bits” that overlaps the input (represented as an SDR). This set in practice is almost always a subset of bits not a superset. So an active column is really carrying the information of “I activate when I see this pattern from my receptive field”.

Going back to your 3-step algo, how does sequence learning work in that simple algo?

OhMyHTM · October 29, 2018, 10:59pm

If you don’t know what I am asking, please don’t reply. thank you.

rhyolight · October 29, 2018, 11:02pm

@OhMyHTM This looks to me like @Jose_Cueto is trying to understand your points and asking you counter questions. I also don’t quite understand the point you are trying to make. You’re saying the input space doesn’t have to be sparse, so may not technically be an SDR, and I can see from Jose’s response he also understands that, too. But you are claiming (it seems to me) that the TM has too many steps, and could be simplified. Correct? If so, that’s big news.

rhyolight · October 29, 2018, 11:04pm

Maybe this is the confusion. It’s not really two vectors, but a vector of vectors. From one neuron’s standpoint, it has X dendritic segments, each having a unique number of synapses. This is more complex than just two vectors.

Does that help clear up our point of view?

Jose_Cueto · October 29, 2018, 11:06pm

My apologies for the confusion. Was trying to clarify and test my understanding of HTM in the implementation point-of-view. Most importantly simplifying the TM impl I believe would be a big code improvement.

OhMyHTM · October 29, 2018, 11:12pm

@rhyolight
If you are confused, I can write a demo later. i think these two vectors can store the transition of the pattern and can be used for prediction.

OhMyHTM · October 30, 2018, 2:06am

here is my idea.

simple, right?

the only thing that needs to be done is to maintain a suitable vector capacity.

sheiser1 · October 30, 2018, 3:36am

What do each of those bit strings represent? It seems quite a feat to get the TM down to two vectors, since there are so many layers of data structures:

Each SP mini-column contains a list of neurons, each of which contains a list of dendrite segments, each of which contains a list of synapses to specific cells (each synapse with a continually updating permanence value). I’m curious how simple a data structure can be while maintaining all of this info.

SporkingIt · December 27, 2018, 12:45am

How would the SP algorithm reach anything close to O(n^2)?

There is a limited number of dendrites with a limited number of synapses. The dendrites are evaluated once per exposure (unless something very different has been introduced into the theory). Training (actual training and boosting) is done once per dendrite per exposure.

This gives O(2n) -> O(n) since there is nothing but linear referencing of the dendrites.

Topic		Replies	Views
My analysis on why Temporal Memory prediction doesn't work on sequential data Numenta Theory sequence-memory	58	7420	February 2, 2020
Sequence in temporal memory Applications	7	727	December 2, 2019
Why do we need TM? Numenta Theory	14	1078	January 16, 2021
TemporalMemory for prediction Engineering question	35	1761	September 24, 2019
Growing Temporal Memory (GTM) Numenta Theory sequence-memory	15	759	July 7, 2023

Why is the implementation of the temporal memory algorithm so complicated?

Related topics