Misunderstood behavior of HTM

MAK · April 17, 2019, 8:52pm

Hello,
I try to use HTM for solving anomaly detection problem.
I have system that produce logs during the activation process, the system work in cycling manner, one process ended and another same process begin .
Each log contains 1000 samples each sample contains 10 different features that represent the current system mode (categorical features).
I training the HTM model on 40 different valid logs (first I training the SP, than the HTM itself ) , the training be done log after log (training with the first one then with the second etch).
In the evaluation phase I take another 10 different valid logs , and apply on them the training HTM model.
I received strange result,( working with anomaly score without sliding window…)

In all 10 logs the first sample, received high anomaly score
I received high anomaly score in all the 10 evaluation logs **in the same samples **, even the logs was different one from the other in most of the features
Thanks,
MAK

sheiser1 · April 17, 2019, 11:07pm

If you could post the first few rows of data that you’re passing in, along with the model parameters file.

rhyolight · April 22, 2019, 3:26pm

I think you mean you trained the SP first, then the TM, right? (“HTM” is more of an umbrella term for the full technology.)

This is not unexpected. The anomaly scores should go down overall as patterns are learned.

MAK · April 22, 2019, 4:18pm

Hii,

I think you mean you trained the SP first, then the TM, right?

Yes you are right, you catch me…

The anomaly scores should go down overall as patterns are learned.

The anomaly score is received in the validity phase , I means I finished to train the SP and the TM, and I received the high anomaly score in the same position in the validity…, After I finish to training the model, on valid logs
Thanks

rhyolight · April 22, 2019, 4:40pm

I don’t understand the phases of your training. With temporal models like HTM, you don’t have different training vs testing vs validation data. You simply have one data stream representing reality, and you can attach the HTM to it and turn learning on to learn. The model may be trained on any part of the data stream, but must be sequential.

I suggest you choose a number of data points to train HTM models on, and always start evaluating your model after they see that many points.

rhyolight · April 22, 2019, 9:44pm

MAK · April 22, 2019, 9:47pm

Hii,
I first want to examine the HTM capability in offline.
So I take 40 different logs that representing 40 different valid process (each of them contains 1000 records) . I training the SP and the TM on those dataset (learning=true).
Then I run 10 different valid streams with learning=false on the TM&SP that I created above…
And I received the same high error score every time in the same position (different score but in the same position).
I create the validity process for character the error on the valid data.
So what I do wrong? , why I can’t training the SP&TM offline? With different sequence of the same process (with different variations)
Thanks,
MAK

rhyolight · April 22, 2019, 10:04pm

What do you mean by “offline”? This is not a term we use for HTM. It is always an “online learning system”, meaning it learns with each new piece of temporal data, like brains.

When you make learning=false, you prevent learning. So is this what you mean? In this case, the way HTM works does not change, only synapse permanence values are never updated. Everything else works the same. If you make learning=false, the HTM will not train.

sheiser1 · April 22, 2019, 10:43pm

Could you post the first few lines of the data you’re passing in? There could be a mismatch between the way you imagine the experiment and what’s actually happening in NuPIC.

MAK · April 29, 2019, 3:26pm

Hii,
In offline I mean that I have logs that contains features that describe the system mode and behavior
The system working in cycles of 5 minutes (each log describe cycle) .
Each cycle describe the same process (you can look on each cycle for example as item production or card game or even as chess game)
In each cycle the same process is done, but the system behavior and the internal system mode can be different in each step.
So I want to training the HTM model with all available logs , so the HTM will expose as much information as possible and will familiar with all system patterns.
I want to check to ability of the system to recognize anomaly in the system internal mode…by analyze the anomaly score
So I didn’t understand where is the problem in my methodology ?
Thanks,
MAK

rhyolight · April 29, 2019, 4:21pm

These statements contradict each other, don’t they?

If there is one process, and you have lots of data representing this process occurring over and over, this is a good thing to train an HTM model. The model will learn the structure of the temporal process, and as you give it new process data, it should be able to return useful anomaly indications. But you must reset the temporal memory algorithm at the end of each process.

If you have many different processes that each need to be learned, you need to train one HTM model on each process.

sheiser1 · April 29, 2019, 5:00pm

Sounds interesting and I don’t know if there are any problems in your methodology, however I can’t thoroughly check without seeing at least 1 line of data and the accompanying model parameters. It could be that your research concept is perfectly valid, though something in the implementation details is messing up the results.

MAK · April 30, 2019, 5:35am

Hii

These statements contradict each other, don’t they?

No, same process (for example poker game, chess game) but with different values in each feature.

But you must reset the temporal memory algorithm

Thanks for your recommendation, I will add this instruction . and will report about the results.
P.S: can you explain please what happen in the TM internal mode when we active the reset command?
Thanks,
MAK

sheiser1 · April 30, 2019, 5:18pm

It basically knocks the TM out of predicting the current sequence by emptying its sets of predictive and active cells. Calling a reset means that all SP columns will burst on the next time step and the anomaly score will be 1. This is used in cases where the total sequence is composed of sub-sequences with clear ending points, so we know how long these sequences are. Take the following sequence for instance:

A → B → C → D
X → B → C → Y
A → E → F → G
X → E → F → G

If we want to frame this as a set of separate sequences we’d call reset after ‘D’, ‘Y’ and both 'G’s. This way the TM doesn’t learn the transitions from the last element of each subsequence to the first of the next. This means that the first element of any subsequence will always burst. @rhyolight shows a great example using resets on simple melodic sequences in HTM school (about 4:20 in):

The reset function:

  def reset(self,):
    """
    Reset the state of all cells.
    This is normally used between sequences while training. All internal states
    are reset to 0.
    """
    if self.verbosity >= 3:
      print "\n==== RESET ====="

    self.lrnActiveState['t-1'].fill(0)
    self.lrnActiveState['t'].fill(0)
    self.lrnPredictedState['t-1'].fill(0)
    self.lrnPredictedState['t'].fill(0)

    self.infActiveState['t-1'].fill(0)
    self.infActiveState['t'].fill(0)
    self.infPredictedState['t-1'].fill(0)
    self.infPredictedState['t'].fill(0)

    self.cellConfidence['t-1'].fill(0)
    self.cellConfidence['t'].fill(0)

    # Flush the segment update queue
    self.segmentUpdates = {}

    self._internalStats['nInfersSinceReset'] = 0

    #To be removed
    self._internalStats['curPredictionScore'] = 0
    #New prediction score
    self._internalStats['curPredictionScore2']   = 0
    self._internalStats['curFalseNegativeScore'] = 0
    self._internalStats['curFalsePositiveScore'] = 0

    self._internalStats['curMissing'] = 0
    self._internalStats['curExtra'] = 0

    # When a reset occurs, set prevSequenceSignature to the signature of the
    # just-completed sequence and start accumulating histogram for the next
    # sequence.
    self._internalStats['prevSequenceSignature'] = None
    if self.collectSequenceStats:
      if self._internalStats['confHistogram'].sum() > 0:
        sig = self._internalStats['confHistogram'].copy()
        sig.reshape(self.numberOfCols * self.cellsPerColumn)
        self._internalStats['prevSequenceSignature'] = sig
      self._internalStats['confHistogram'].fill(0)

    self.resetCalled = True

    # Clear out input history
    self._prevInfPatterns = []
    self._prevLrnPatterns = []

Topic		Replies	Views
Inability to accumulate minor anomalies? NuPIC encoders , anomaly-detection	9	1449	November 10, 2016
Regarding temporal pattern Engineering	25	806	April 23, 2020
Traning HTM NuPIC	2	428	October 13, 2018
Anomaly detection with HTM.core model on sine Machine Learning question	14	1832	December 8, 2021
HTM loss its prediction ability Engineering anomaly-detection , prediction , htm	17	2310	August 7, 2017

Misunderstood behavior of HTM

Related topics