Basic Spatial Pooler Questions

sheiser1 · December 16, 2019, 9:12pm

In this example there are 3 spatial features per tilmestep (‘timeOfDayBits’, ‘weekendBits’, ‘consumptionBits’), and they are combined since the SP & TM ultimately see one vector per time step – however many spatial features that vector contains.

So if your sequences are composed of words, there’d be one encoding, SP vector and TM output per word (one after the other). The system will learn the transitions between words. There’s a big limitation to this though! Since there is no encoder for words in NuPIC, each different word would be treated as a distinct category by default – so the systems sees no similarity between words.

To my knowledge this issue has been most robustly handled by Cortical.io – which has basically created a method for generating SDRs from words and even groupings of words. You feed in a word, sentence, paragraph or whatever and get back an SDR – as you normally would from the SP. Then these SDRs are fed into the TM just like any other case.

If you’re processing words as you raw data type I’d highly recommend looking into Cortical.io and their work on this.

David_Keeney · December 17, 2019, 4:45pm

Also check out the SimHashDocumentEncoder which is part of the htm-comminity repository htm.core. This is basically an encoder for words.

github.com

htm-community/htm.core/blob/master/src/htm/encoders/SimHashDocumentEncoder.README.md

# SimHash Document Encoder Algorithm Details

SimHash is a Locality-Sensitive Hashing (LSH) algorithm from the world of
nearest-neighbor document similarity search. It is used by the GoogleBot Web
Crawler to find near-duplicate web pages.

"Similarity" here refers to bitwise similarity (small hamming distance, high
overlap), not semantic similarity (encodings for "apple" and "computer" will
have no relation here). Internally, hamming distances are never considered or
adjusted -- they're always the result of a kind of dynamic statistical
distribution.


## Code

| What | Where |
| ---- | ----- |
| Source | `./SimHashDocumentEncoder.hpp` |
| Example | `py/htm/examples/encoders/simhash_document_encoder.py` |
| Tests | `bindings/py/tests/encoders/simhash_document_encoder_test.py` |

This file has been truncated. show original

kkaraoglan · January 8, 2020, 10:20am

Hello Sir, @subutai

I am working on a simple SP application.
I’m trying to understand the block diagram in the article. Are output values given randomly( [8 2 … 10 0])? I didn’t understand how those values were reached

For example, there is 1 input vector. input and output vector is 28 elements. So it does overlap 28 times according to the input vector?

Thank you…

rhyolight · January 8, 2020, 3:23pm

Which article? I don’t use this diagram to explain Spatial Pooling.

kkaraoglan · January 8, 2020, 5:45pm

"Properties of Sparse Distributed Representations and their Application to Hierarchical Temporal Memory "

rhyolight · January 9, 2020, 10:53pm

In the diagram, I believe the output values are just an example of what the values might be. It is trying to show the minicolumn competition. A minicolumn with an overlap score above a threshold k will be activated. This is also called a minicolumn competition, or k-winner-take-all competition.

Topic		Replies	Views
Spatial Pooling Quiz Numenta Theory spatial-pooling , quiz , nupic-wiki	0	1343	April 7, 2017
What is Spatial Pooler Lounge newbie	10	2479	December 3, 2019
Why are the bits of a spatial pooler called columns? Getting Started sequence-memory , spatial-pooling , question	2	357	April 18, 2023
Why doesn't the Spatial Pooler scale well to numerous fields? Numenta Theory spatial-pooling , encoders , question	5	1463	August 18, 2016
When is the HTM Spatial Pooler full? Numenta Theory spatial-pooling	7	1225	May 31, 2016

Basic Spatial Pooler Questions

Related topics