How to deal with different timescales for HTM

Andrew_Stephan · July 19, 2020, 7:19pm

I’m wondering what thoughts everyone has on dealing with different timescales in the inputs to an HTM. The brain doesnt know ahead of time what timescale events occur on, and yet it does a good job figuring it out for various events–I.E. speech or reading < 1s, walking ~1s, etc. This is despite the fact that inputs are processed ~ every few milliseconds. But if I were to try running a temporal memory program that sees a thousand SDRs a second and needs to understand events that happen on a timescale of seconds, all context would probably be lost between events. So, my question is: how to preserve context across very long gaps? (Very long on the timescale of individual inputs)

Andrew_Stephan · July 19, 2020, 7:21pm

My first thought is perhaps a second, ‘slow’ TM that reads multiple concatenated inputs from the ‘fast’ TM. But is there any evidence that the brain has multiple systems for working on different scales like this?

AMZ · July 19, 2020, 10:56pm

Subsampling.

Andrew_Stephan · July 19, 2020, 11:17pm

From an engineering perspective that makes sense, but I’m not sure that’s compatible with a brain-inspired implementation. There must be some other way to preserve context over various timescales while still maintaining a single sampling frequency. I could be wrong, though. Maybe the brain does do subsampling as part of the upstream data flow?

sheiser1 · July 19, 2020, 11:31pm

This sounds to me like Temporal Pooling (TP).
In case you’re not familiar, a TP region (as I understand) is one that monitors another region, and stabilizes when a familiar pattern is seen by that region.
That monitored region (maybe Layer 4) monitors the raw sensory data, as typically done in application.

So if a sensory region sees the familiar sequence “A,B,C,…X,Y,Z” it will precisely predict each next element, taking on different activation states for each letter (different specific set of active cells).

The TP region monitoring the sensory region, however, would theoretically maintain a single activation state throughout the entire familiar sequence. That activation would basically represent “alphabet sequence” rather than the constituent letters.

I don’t know what the Neuroscience evidence is, but I believe it does exist. I believe this TP-style region is encapsulated in Layer 2/3 of Numenta’s model of the macro cortical column. The different layers use the same core mechanisms (TM-style distal learning & SP-style activation), but differ in where there input comes from and where their outputs go to (which other regions).

This kind of multi-layer system is enacted with the Network API here:

github.com

numenta/htmresearch/blob/master/htmresearch/frameworks/layers/l2456_network_creation.py

# ----------------------------------------------------------------------
# Numenta Platform for Intelligent Computing (NuPIC)
# Copyright (C) 2016, Numenta, Inc.  Unless you have an agreement
# with Numenta, Inc., for a separate license for this software code, the
# following terms and conditions apply:
#
# This program is free software: you can redistribute it and/or modify
# it under the terms of the GNU Affero Public License version 3 as
# published by the Free Software Foundation.
#
# This program is distributed in the hope that it will be useful,
# but WITHOUT ANY WARRANTY; without even the implied warranty of
# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
# See the GNU Affero Public License for more details.
#
# You should have received a copy of the GNU Affero Public License
# along with this program.  If not, see http://www.gnu.org/licenses.
#
# http://numenta.org/licenses/
# ----------------------------------------------------------------------

This file has been truncated. show original

To those familiar with the macro column model, please correct any faults/gaps here!

Andrew_Stephan · July 19, 2020, 11:50pm

Ahhhhh, that makes a lot of sense. So you have something akin to an SP but its minicolumn analogues respond similarly to any of many unique SDRs, as long as they’re part of the same sequence, in the TM that provides the input. Is that right?

sheiser1 · July 20, 2020, 1:24am

Yes (as far as I understand), and the TP region cells can depolarize cells in the sensory region too through apical feedback. That is enacted here:

github.com

numenta/htmresearch/blob/master/htmresearch/algorithms/apical_dependent_temporal_memory.py

# ----------------------------------------------------------------------
# Numenta Platform for Intelligent Computing (NuPIC)
# Copyright (C) 2017, Numenta, Inc.  Unless you have an agreement
# with Numenta, Inc., for a separate license for this software code, the
# following terms and conditions apply:
#
# This program is free software: you can redistribute it and/or modify
# it under the terms of the GNU Affero Public License version 3 as
# published by the Free Software Foundation.
#
# This program is distributed in the hope that it will be useful,
# but WITHOUT ANY WARRANTY; without even the implied warranty of
# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
# See the GNU Affero Public License for more details.
#
# You should have received a copy of the GNU Affero Public License
# along with this program.  If not, see http://www.gnu.org/licenses.
#
# http://numenta.org/licenses/
# ----------------------------------------------------------------------

This file has been truncated. show original

In its repo (https://github.com/numenta/htmresearch/tree/master/htmresearch/algorithms) there are also temporal pooling scripts, I believe the most current one is the “union_temporal_pooler”. I haven’t delved into it yet, though its where I’d look to drill down on the TP functionality as currently implemented.

Andrew_Stephan · July 20, 2020, 2:23am

Great! I’m looking forward to adding TPs to my library. I haven’t had this much fun coding in like 6 years

sheiser1 · July 20, 2020, 2:51am

Sounds great, I’d be curious how your pooling algorithm works compared to current implementations.

Glad to hear it

Bitking · July 23, 2020, 12:58pm

You might find this interesting:

If I understand this correctly, the distribution over multiple areas are important for encoding multiple time scales.In HTM, this goes to the H of HTM.

markNZed · December 5, 2020, 9:59am

I wanted to bump this topic because it is close to the topic of the next HLC call.

In this paper “The noise robustness of HTM can be improved by using a hierarchy of sequence memory models that operate on different timescales. A sequence memory model that operates over longer timescales would be less sensitive to temporal noise. A lower region in the hierarchy may inherit the robustness to temporal noise through feedback connections to a higher region.”

A macro-column that outputs an SDR could be the input to another macro-column. It should then learn patterns that are sequences of features in the primary input. Those sequences should be at a longer timescale if there are features in the raw input that are stable and beyond the higher-order memory of the mini-columns in the first macro-column.

I’m not sure how the feedback would be implemented from the higher macro-column to the lower macro-column. Perhaps the higher macro-column SDR could be have a lower sparsity and then be combined with the lower level SDR input (a simple union/OR operation) into the lower level macro-column’s input.

On a slight tangent: are there accepted abbreviations for mini-column & macro-columns? Perhaps mini-column->mC and macro-column->hyper-column->hC

Edit: I was wondering about the feedback path and it leads to wondering if people have tried using multiple SP with a single macro-column.This would be able to provide different levels of “compression” of the SDR. Which might be interesting in creating different scales of representation.

Bitking · December 5, 2020, 4:51pm

I suggest this as a floor for perceptual time:

I propose that a single thalamocortical resonance cycle in the cortex is the smallest quanta of human perception.

I propose that a single thalamocortical resonance cycle in the episodic portions of the cortex (those connected to the hippocampus) is the smallest quanta of human experience.

Events faster (shorter?) are collected in one of these time buckets.
These buckets are collected in progressivly larger time buckets as you ascend the hierarchy.

Topic		Replies	Views
TM’s ability to generalize Numenta Theory	12	1420	March 13, 2018
Time Perception and Distortion: The Neuroscience of Subjective Time Numenta Theory	17	2257	September 22, 2016
How SMI might inform Temporal Pooling Numenta Theory	17	2271	July 10, 2017
Any questions for Jeff? Numenta Theory community	34	4021	March 22, 2017
Another HTM test implementation Engineering	18	1761	April 24, 2017

How to deal with different timescales for HTM

Related topics