Nupic-master vs htm-core

shiva · June 28, 2021, 2:02pm

Hello
Are the two algorithms ‘nupic-master’ and ‘htm-core’ different in terms of implementation and performance? Which of them works better?
I have a few questions about these two algorithms. I was very confused and the results I get from the run of these codes contradicts what I have read before.Please help me understand the answers to these questions:
1)I give a specific input to the Spatial pooler algorithm from nupic-master and during several runs the output of the algorithm is always the same( the SDR is fixed).But when I give a specific input to the Spatial pooler algorithm from htm-core and run the algorithm several times the SDR output is different each time. What is the problem?

2)In htm-core I defined the parameters as follows:
default_parameters = {
'numActiveColumnsPerInhArea ’ :15.0,
‘potentialRadius’: 16,
‘boostStrength’: 0.0,
‘columnDimensions’: (28, 28),
‘dutyCyclePeriod’: 1000,
‘localAreaDensity’: 0.1,
‘minPctOverlapDutyCycle’: 0.2,
‘potentialPct’: 0.1,
‘stimulusThreshold’: 10,
‘synPermActiveInc’: 0.01,
‘synPermConnected’: 0.5,
‘synPermInactiveDec’: 0.008
}
But the SDR_output has 41 bits (instead of 15 bits) As shown below. Why? How do I set it up?

sp_out.nonzero() (array([ 76, 100, 123, 127, 146, 160, 216, 230, 267, 280, 294, 304,
310,325, 329, 343, 347, 350, 351, 352, 373, 381, 382, 401, 414,
416,428, 456, 497, 552, 554, 570, 576, 592, 593, 619, 652, 692,
695, 728, 748]),)

3)In the ‘nupic-master’ algorithm in both cases “sp.compute(inputArray,True, activeArray)” and “sp.compute(inputArray,False, activeArray)” the output of a particular input is the same and as follows. Shouldn’t the outputs of these two modes be different from each other?

(‘inputArray.nonzero()’, (array([157, 184, 185, 186, 212, 213, 214, 239, 240, 241, 267, 268, 269,
294, 295, 296, 297, 322, 323, 324, 350, 351, 352, 378, 379, 380,
405, 406, 407, 433, 434, 435, 461, 462, 463, 488, 489, 490, 516,
517, 518, 544, 545, 546, 572, 573, 599, 600, 601, 627, 628, 629,
655, 656, 657, 684]),))

(‘activeArray.nonzero()’, (array([ 12, 13, 71, 99, 149, 159, 176, 219, 270, 335, 504, 525, 653, 677, 713]),))

dmac · June 29, 2021, 1:44am

I would recommend using htm-core.
Full disclosure, I wrote parts of htm-core.

Did you remember to set the random seed? If the random seed is zero (the default case) then it uses the current time as the random seed. All of the htm-core algorithms which use random numbers should also accept a keyword argument “seed”.

I think that your Spatial Pooler is configured to have multiple inhibition areas.
Try replacing the potentialRadius parameter with globalInhibition=True.
Also you should probably only specify one of numActiveColumnsPerInhArea and localAreaDensity since they do the same thing and one of them overrides the other.

I hope this helps

shiva · July 15, 2021, 1:15pm

Thanks for your good tips; it was solved. Now I have some new questions. Would you please guide me as always?
First, I set the parameters of the two codes ‘nupic-master’ and ‘htm.core’ in spatial pooler mode, the same and set the parameter ‘localAreaDensity=0.02’ and ‘numActiveColumnsPerInhArea=15’. Then I gave one of the MNIST dataset images as input to these two codes and noticed that when ‘globalInhibition=True’ , the output of Spatial pooler in both cases has 15 bits but is different (shown below)

Htm.core _output =[209,214,219,230,231,346,349,432,468,556,581,595,597,609,627]

Nupic-master _output =[12,13,71,99,149,159,176,219,270,335,504,525,653, 677, 713]

And when ‘globalInhibition=False’ , both the number of active bits and their locations are different.

I did the same for other images from the MNIST database, and the same thing happened again (shown below).

Htm.core _output = [158,174,241,287,291,313,356,367,428,434,571,609,658,689]

Nupic-master _output =[4,10,13,99,149,176,219,270,335,504,525,546,549,653,677]

Why are the active bits in the two algorithms’ nupic-master’ and ‘htm.core’ different?
Is it possible to distinguish which of these two algorithms is better and has a better answer?
What is the reason for the difference in the number of active bits in the ‘globalInhibition=False’ mode?
Are these ‘nupic-master’ and ‘htm.core’ different in algorithm logic, or is their implementation different?
Algorithm’ htm. core’ with variable ‘globalInhibition=False’ always has less active bits. Does this indicate the superiority of the algorithm in sparsing data over other methods?
How can these two algorithms be compared?
I did the same tests for the mHTM algorithm related to the article “A Mathematical Formalization of Hierarchical Temporal Memory’s Spatial Pooler for use in Machine Learning”. The active output bits were different from the above two methods. Do you have information about this algorithm and why it differs?

Stephen_Grossberg · July 15, 2021, 1:29pm

Please ask these questions of the supervisors where you study or work.

shiva · July 15, 2021, 2:14pm

Unfortunately, my supervisor is not very familiar with the HTM algorithm. If anyone knows the answers to my questions, please help me understand. The answers to these questions are probably very simple for those who wrote these codes. I would like to know more about this algorithm and why it happened. Thank you all

dmac · July 15, 2021, 4:34pm

1 The Spatial Pooler algorithm starts with random synapses.
Both implementations use a random seed, so the indexes of the active bits won’t change every time you run it (unless you forget to set the seed or set it to a random number).
However, different implementations use different random number generators so the results will never be identical, even if you use the same random seed.

4 There might be some slight differences, but on the whole they should perform the same algorithm.

5 I can not speak about globalInhibition=False because I’m not very familiar with that part of the code.

6 There are two ways to compare the algorithms:

Speed: how fast does it run?
htm-core runs faster than nupic.

Quality of results: does it do what a spatial pooler is supposed to do?
Both implementations should be about equal on this point.
The properties which SP are supposed to have are described in this paper: https://www.frontiersin.org/articles/10.3389/fncom.2017.00111/full

sheiser1 · July 15, 2021, 6:38pm

First thing I’d do is swap out MNIST for the old reliable Hotgym dataset, since it’s so well studied.

You’re using images right, a much less common data type for HTM, so the deviations could maybe stem from different image encoding processes.

I’d be curious if the deviations remain with Hotgym data.

katrin · July 16, 2021, 2:03pm

That was interesting. Until now, I thought the two algorithms were different. Is there an article on how these two algorithms work?

shiva · July 16, 2021, 2:18pm

Thanks a lot, dear @dmac

One of my goals is to be able to compare the output of different spatial pooler algorithms and analyze the results.Based on the explanation you gave, I realized that the two algorithms Nupic-master and htm.core are the same. But I’m still looking for different results and finding the difference in the output of the algorithms that implement HTM.

I reviewed the article you introduced. It is referenced in this article "Mnatzaganian, J., Fokoué, E., and Kudithipudi, D. (2017). A mathematical formalization of hierarchical temporal memory’s spatial pooler "which is the mHTM code I wrote about before.

Would you mind telling me the differences and similarities between this code and Nupic-master regarding the algorithm?

Another algorithm called Etaler has been written to implement HTM.

2)Do you expect the spatial pooler output of this code to look like a nupic-master or not?

3)Can you explain the differences and similarities between Etaler and Nupic Master?

4)Is there another algorithm that implements HTM, and its logic is different from Nupic-master so that I can check the difference between its output and Nupic-master?

shiva · July 16, 2021, 2:26pm

Thanks for your suggestion

The htm .core code uses the Hotgym dataset, but I did not see the Hotgym dataset as input in the nupic -master algorithm. Do you have this code?

sheiser1 · July 16, 2021, 6:48pm

Yep, its well documented here:

github.com

numenta/nupic/blob/master/examples/opf/clients/hotgym/anomaly/one_gym/README.md

# One Hot Gym Anomaly Tutorial

The program in this folder is the complete source code for the "One Hot Gym Anomaly" Tutorial. You can follow along with the construction of this tutorial's source code in the screencast below.

[![One Hot Gym Anomaly Tutorial Screencast](http://img.youtube.com/vi/1fU2Mw_l7ro/hqdefault.jpg)](http://www.youtube.com/watch?v=1fU2Mw_l7ro)

## Premise

The "hot gym" sample application has been around for a long time, and was one of the first real-world applications of NuPIC that actually worked. The data used is real energy consumption data from a gym in Australia. It is aggregated hourly already, so the input file at [rec-center-hourly.csv](rec-center-hourly.csv) simply contains a timestamp and float value for energy consumption during that hour.

This tutorial picks up where the [One Hot Gym Prediction Tutorial](../../prediction/one_gym/README.md) left off, and shows users how to convert their prediction model into an anomaly detection model.

## How to Run

To run and output data to a local file:

    ./run.py

To run and output data to a **matplotlib** graph:

This file has been truncated. show original

shiva · July 17, 2021, 8:57am

thanks a lot @sheiser1

David_Keeney · July 18, 2021, 3:34pm

As another developer of htm.core, all of the modules implement the same HTM algorithms and we started with the code from nupic.master. Some of the modules were ported from Python to C++, others we used the existing C++ code and re-implemented parts of it to improve the speed and replaced some of the third party dependencies for ones that are more robust or easier to use and maintain. We also replaced many of the flat arrays with the SDR object.

I would not expect htm.core to produce exactly the same bit patterns in the resulting SDR as Nupic, partly due to the randomization that @dmac mentioned and partly do to minor compromises we made to improve speed. There were also some minor differences in the parameters used and the actual bit patterns in the SDR’s is very sensitive to the parameters. However, the job they are performing should be exactly the same.

katrin · July 19, 2021, 1:17am

Thanks for your good explanation @David_Keeney

Topic		Replies	Views
What is the difference in implementation? NuPIC spatial-pooling	14	1243	January 14, 2020
Developing a better NuPIC. Need suggestions Implementations htm-implementations , community , projects	17	1533	July 4, 2018
The output of SP NuPIC	2	728	November 25, 2019
How much of the nupic python implementation use matrix math NuPIC	3	513	October 18, 2018
Is there a maximum length for the inputVector of the Spatial Pooler? NuPIC spatial-pooling , question	8	718	December 4, 2017

Nupic-master vs htm-core

Related topics