Measuring/Testing TM Sequence storage capacity?

mraptor · June 2, 2020, 7:20pm

I know how to measure TM Transitions capacity :

 TCapacity = (nseg * pat-per-seg) / sparsity

My question is how to measure, probably impossible … then how to test the SeqCapacity of specific implementation of TM.

My conundrum is because there are many possible sequence “types/forms” f.e. :

    aaaaaaaa........
    ababab...........
    ab***ab****ab****.........
    ab*c***ab*c***ab*c.........
    and so on .........

OR what about of sequence that combines all of the above ?

what sort tests do you devise and what sort of statistics or other measures do you use ?

sheiser1 · June 2, 2020, 11:18pm

Yes, this makes the testing of TM inherently subjective to some extent.

When I think of sequence “types/forms”, I imagine two basic parameters:

degree of noise/randomness present
length & complexity of patterns present

So you could test sequences which are:

low noise & low complexity (which would consume the least resources)
high noise & high complexity (which would consume the most)
low noise & high complexity (somewhere in between)
high noise & low complexity (somewhere in between)

Then there’s the question of how to measure the level of resources consumed. I think the number of TM distal segments is a good basic one.

mraptor · June 2, 2020, 11:23pm

how do you represent and calculate complexity in the sequence ?

Then there’s the question of how to measure the level of resources consumed. I think the number of TM distal segments is a good basic one.

yeah , I would test using different number of segments.

sheiser1 · June 3, 2020, 5:43am

I basically mean how much overlap there is between the sequence elements.

So take two repeating sequences of equal length (seq.1 & seq.2) , say:

A, B, C, D, X, B, C, Y, …
and
A, B, C, D, E, F, G, H, …

Both are 8 time steps long, but seq.1 has repeating elements (B and C).
This means that to fully learn seq,1 a TM has to learn to distinguish B after A from B after X. and C before D from C before Y. The 2 learned B’s and C’s mean there are 2 winner cells (and thus 2 segments) in all B-columns and all C columns.

Seq.2 however has no repeating elements, so just 1 segment in the all A-H columns.
This means TM has fewer segments, and also that it learn the precise sequence faster.

I’d measure seq complexity by how long it takes the TM to stabilize. Meaning how long before the anomaly score settles at 0 and the prediction count settles at 1. This should theoretically happen at some point in any noiseless sequence, assuming it repeats enough times.

Topic		Replies	Views
Testing TM implementations? Numenta Theory sequence-memory	7	934	June 16, 2016
Exploring htm.core and the TM parameters NuPIC Community Fork	11	904	January 23, 2023
Raw TM Test (no SP) NuPIC encoders , temporal-memory , category-encoding	30	1371	June 10, 2018
Quck question? Lounge	2	617	April 4, 2021
My analysis on why Temporal Memory prediction doesn't work on sequential data Numenta Theory sequence-memory	58	7428	February 2, 2020

Measuring/Testing TM Sequence storage capacity?

Related topics