Question about serialization behaviour

Finn · April 27, 2021, 1:14pm

Hello,
I am currently working on a BA thesis with the topic “Hierarchical temporal memory for in-car network anomaly detection”. I am nearing the end of my work and I have noticed a few things so far, some of which I can’t answer by simply investigating the issue. So I hoped that someone here can help me out.

I am using HTM-core 2.1.15 with python 3.7 and based my program off the hotgym example. I am analyzing 4 metrics (jitter, avg. gap between packets, bandwidth and frequency for 10ms timeframes) encoded into the same SDR and it works pretty well on our TSSDN network.

One of my main confusions though is that the de-/serialization functions for loading and storing TM/SP (loadFromFile/saveToFile) produce very different results depending on if the model is trained, then stored and loaded again (with learning turned off afterwards) or simply running in online unsupervised learning mode the whole time.
After having done many tests it seems like the TM is the main culprit as it needs a few hundred iterations so that it doesn’t just oscillate in anomaly score the whole time. Another observation is that the anomaly score is waaay less sensitive to the data after loading. Let me show you a few pictures to explain.

First an image of anomaly score oscillating when TM is reloaded but not learning for enough iterations before learn=False (forgive me for the many different values shadowing each other but you should get the gist):

now for the live-reloaded comparison:
This is normal learning with 3 anomalies (DoS attacks) in the data.

This is the reloaded TM/SP fed with the same data and TM gets 900 startup iterations where learn=True

I have done other tests where I learned on a clean dataset with no anomalies and reloaded + analyzed the dataset including anomalies afterwards. It ran on different parameters, was showing VERY much noise on the learning set while showing adequate results after reloading. Same here is that it somehow gets desensitized after loading:

Learning on clean dataset (green values are anomaly raw score, used it for testing/comparing to tm.anomaly)

Analyzing anomalous data after reloading

I am not sure if the parameters would be any help as it is always the same difference in sensitivity it seems and my main question is: Why is there any difference at all, is the serialization working properly or is there something going missing in the process?

Ofc for completion purposes I will provide you the lines that load/store the TM/SP:

storing:
self.sp.saveToFile(_TEST_DIR + '/sp_' + timestring + '.tmp') self.tm.saveToFile(_TEST_DIR + '/tm_' + timestring + '.tmp')

loading:

sp_tmp = SpatialPooler()
SpatialPooler.loadFromFile(sp_tmp, _TEST_DIR + '/sp_' + self.parameters["application"]["model"] + '.tmp')

tm_tmp = TemporalMemory()
TemporalMemory.loadFromFile(tm_tmp, _TEST_DIR + '/tm_' + self.parameters["application"]["model"] + '.tmp')`

dmac · April 27, 2021, 1:55pm

Hi Finn,

A few things:

This sounds like a bug, serialization is probably not working correctly. All things equal, saving and then loading the model should not change the results. To be clear, you’ve tested it with NO save/loading and it works correctly?
Have you tried a installing the HTM.Core library from the source code?

Thanks for reporting this issue!

Finn · April 27, 2021, 2:03pm

hey dmac, thanks for the reply.

to answer your questions:

This sounds like a bug, serialization is probably not working correctly. All things equal, saving and then loading the model should not change the results . To be clear, you’ve tested it with NO save/loading and it works correctly?

Yes, I have tested it without any storing/loading. The results of such a test can be found in the 2nd picture of my 1st post. I am willing to use online learning only without storing, but it’s a matter I would have liked to discuss in my work since both options have their advantages/disadvantages.

Have you tried a installing the HTM.Core library from the source code?

No I haven’t, I installed the library from the 2.1.15 .whl from the release page on github and have just checked whether there were any substantial changes in the TM files since then. It seemed like there weren’t any changes that would impact this behaviour but I could have overlooked something.

dmac · April 27, 2021, 3:28pm

I can not reproduce this bug.

Starting the latest source code from the github page, I modified the hotgym example:

I added the seed parameter to all algorithms so that it should be 100% deterministic.
I added the following snippet of code to save & load the TM to/from file halfway through the test:

    if count == 1000:
      tm.saveToFile("tm_save.temp")
      tm2 = TemporalMemory()
      tm2.loadFromFile("tm_save.temp")
      tm = tm2

And this yields identical results, both with and without save/loading the TM. The results are the same, even all of the floating point numbers which are printed at the end of the program:

Predictive Error (RMS) 1 steps ahead: 8.141706441154033
Predictive Error (RMS) 5 steps ahead: 8.945421905050706
Anomaly Mean 0.03885156511594449
Anomaly Std 0.1481963840587836

dmac · April 27, 2021, 3:40pm

BTW, the TM algorithm is split between the TemporalMemory class and the Connections class.

Again, I would recommend trying to install the HTM.Core library from the source code. If that fixes your issue then tell us, and we will do another release with the latest sources.

Finn · April 27, 2021, 3:46pm

Forgot to mention that but I restart the application between runs in my example.
I have tried reloading it during a single run too once. When I did that it didn’t make any difference to the result either, as if it was never reloaded. Which is weird because you’d think that Python properly gets rid of any stored TM data by reassigning it the way you did.

I have found a recent issue on github where another user states a similar problem but it’s hard to understand the way he tested it:

github.com/htm-community/htm.core

Question about a method in the spatial pooler and temporal memory in algorithm

opened 06:43AM - 25 Apr 21 UTC

zengwufu

from htm.bindings.algorithms import SpatialPooler from htm.bindings.algorithms …import TemporalMemory these two class both have a similar method named loadFromFile(...) and saveToFile(...) , and these two methods has a parameter named args0: str, what is the meaing of it? is it file path string or something else ? i try to use it as file path string, but it does'n work. i run the anomaly detect program using the sp and tm, and i save the tm use the method--saveToFile(file_path_string). then i write a test methods to load the tm from the file and print the tm's getActiveCells(), there is nothing in the stdout. here is my code: anomaly detecting code: for count, record in enumerate(records[10000:]): if count >= numRecords: break dateString = datetime.datetime.strptime(record[0], "%Y-%m-%d %H:%M:%S") value = float(record[1]) value2 = float(record[2]) values.append(value) value2s.append(value2) dateBits = dateEncoder.encode(dateString) valueBits = valueEncoder.encode(value) vlaue2Bits = value2Encoder.encode(value2) activeColumnsDate = SDR(spDate.getColumnDimensions()) activeColumnsValue = SDR(spValue.getColumnDimensions()) activeColumnsValue2 = SDR(spValue2.getColumnDimensions()) spDate.compute(dateBits, True, activeColumnsDate) spValue.compute(valueBits, True, activeColumnsValue) spValue2.compute(vlaue2Bits, True, activeColumnsValue2) activeColumns = SDR([ spDate.getColumnDimensions()[0] + spValue.getColumnDimensions()[0] + spValue2.getColumnDimensions()[0] ]).concatenate([activeColumnsValue, activeColumnsValue2, activeColumnsDate]) tm.compute(activeColumns, learn=True) anomalyScores.append(tm.anomaly) anomalyLikelihood = anomaly_history.anomalyProbability( value+value2, tm.anomaly ) anomalyProb.append(anomalyLikelihood) if tm.anomaly > 0.5 and anomalyLikelihood > 0.7: print(dateString, value, value2, int(value2 + value), tm.anomaly, anomalyLikelihood) # save the spatial_pollers to file spDate.saveToFile(os.path.join(_SP_SAVED_FILE_DIR, 'spDate')) spValue.saveToFile(os.path.join(_SP_SAVED_FILE_DIR, 'spValue')) spValue2.saveToFile(os.path.join(_SP_SAVED_FILE_DIR, 'spValue2')) # save the temporal_memory to file tm.saveToFile(os.path.join(_TM_SAVED_FILE_DIR, 'tm')) the test method: def test_the_saved_tm(): print("hahaha") tm = TemporalMemory().loadFromFile(os.path.join(_TM_SAVED_FILE_DIR, 'tm')) print(tm.getActiveCells()) and in the test method, if i put the first line print statement to the last line, there is even no "hahaha" in the stdout.

dmac · April 27, 2021, 8:40pm

Yes, now I am able to reproduce the bug!

EDIT: Actually, this bug is a bit more complicated…

dmac · April 27, 2021, 9:13pm

To reproduce this bug, I followed this procedure:

Create a full HTM system (encoder, spatial pooler, temporal memory, classifier, anomaly tracker).
Run the first half of the hotgym dataset through the HTM.
Save the HTM to file. ← Problem is here!
Exit the currently running process and restart python.
Load the HTM from file.
Run the second half of the hotgym dataset through the HTM. Observe that the HTM does not perform as well during the second half of the dataset as it did for the first half.

The problem here is that the HTM can not be 100% saved to file. The Predictor and AnomalyLikelihood can not be saved to file despite the fact that they contain learned data. My guess is that you’re using a fully trained HTM with a brand new (untrained) classifier and AnomalyLikelyhood class.

Those classes are capable of being serialized in C++, but there are no python bindings for save/loading those classes. This is an oversight, and it should be relatively straightforward to fix.

thanh-binh.to · April 28, 2021, 5:56am

@dmac I remember that we had the serialization problems 2 years ago but we did not solve it.
I found it by testing MNIST applications. If I run inference directly after training, I get higher scores than I save model after learning, and then load it for inference

dmac · April 28, 2021, 9:20pm

I’d forgotten that most of the htm.core library implements the pickle protocol, so you can serialize all of those classes. pickle — Python object serialization — Python 3.7.10 documentation I was able to do this and show that the bug is fixed (at least for me).

import pickle

# Save the entire HTM system to file.
with open("save.tmp", "wb") as f:
    htm_system = (sp, tm, predictor, anomaly_history)
    pickle.dump(htm_system, f)

# Load the entire HTM system from file.
htm_system = pickle.load(open("save.tmp", "rb"))
(sp, tm, predictor, anomaly_history) = htm_system

I hope this fixes your problem!

Finn · April 29, 2021, 12:39pm

wow I didn’t expect such a simple solution

I have already decided on using online learning mode going forward but I will add it for completion purposes, thank you very much.

Topic		Replies	Views
Misunderstood behavior of HTM NuPIC	13	810	April 30, 2019
Temporal memory lookback window or memory size NuPIC	2	655	June 14, 2019
Question about htm.core anomaly detection NuPIC Community Fork anomaly-detection , question , htm-core	6	941	June 25, 2021
Anomaly score/likelihood question NuPIC Community Fork question	6	735	September 2, 2020
Regarding temporal pattern Engineering	25	804	April 23, 2020

Question about serialization behaviour

Related topics