Anomaly Detection: Optimizing Parameters for Telemetry Data

userProfessor · July 9, 2021, 4:53pm

I am trying to run the HTM anomaly detection algorithm from the NUPIC library in python. I have datasets spanning 22 days and consisting of telemetry information about devices. I have found a hotgym dataset implementation of HTM online, however am unable to pick which parameters to tweak in order to fit my data perfectly.
Additionally, is the Anomaly Score same as Anomaly Likelihood? If not, is there an example (sample implementation) of anomaly likelihood that I could reference to?

Params I am running anomaly detection with:
‘anomalyCacheRecords’: None,
‘autoDetectThreshold’: None,
‘autoDetectWaitRecords’: 2184

trainSPNetOnlyIfRequested: False

dmac · July 10, 2021, 12:31pm

Usually the anomaly score refers to the raw metric, as reported by the temporal-memory code.

The anomaly likelihood function takes the raw anomaly scores and does some statistics on them to determine where the anomalies actually are. It calculates the normal distribution of anomaly scores, and measures how far above the distribution each new anomaly score is.

sheiser1 · July 11, 2021, 9:38pm

Hey @userProfessor,

Welcome!

One great thing about HTM is that there is usually no need to tweak hyperparams! I would leave those alone for now, because in my experience the relevant tweaking is around:

Which feature(s) to include in the model
What encoder params values should be (min/max for Scalar encoder & resolution for RDSE)
What the anomaly likelihood window sizes should be.

The anomaly likelihood comes from comparing a smaller recent window of anomaly scores to a larger window spanning further back. If the distribution of recent anomaly score deviates more from the larger distribution the anomaly likelihood is higher – as the system’s predictability appears to be changing.

So higher anomaly likelihoods signify recent changes in predictability, whether getting more or less predictable. The anomaly score however is just a measure of how predictable one time step was (0 being perfectly predictable and 1 perfectly unpredictable).

If you could show your current implementation and maybe a small snippet of data that would help too.

userProfessor · July 12, 2021, 9:49pm

This is a sample of the data I are using:

Model params Since our dataset is similar to the metrics as the cpu example data, model params are the same as: nupic/model_params.py at master · numenta/nupic · GitHub
Questions Additionally, is there documentation available on the parameter details? Would be glad to get an insight to tweak values, in case.

And, is there an existing implementation of Anomaly likelihood that I could refer to?

dmac · July 16, 2021, 5:40pm

Hi,

I probably should have shared this sooner…

I re-implemented the anomaly likelihood class for htm-core because the original nupic implementation was a mess. My new version is much shorter and easier to comprehend.

Anyways its on htm.core github repo on a branch named anomaly_likelihood_rewrite

github.com

htm-community/htm.core/blob/anomaly_likelihood_rewrite/py/htm/algorithms/anomaly_likelihood.py

# ----------------------------------------------------------------------
# HTM Community Edition of NuPIC
# Copyright (C) 2014-2016, Numenta, Inc.
#               2021, David McDougall
#
# This program is free software: you can redistribute it and/or modify
# it under the terms of the GNU Affero Public License version 3 as
# published by the Free Software Foundation.
#
# This program is distributed in the hope that it will be useful,
# but WITHOUT ANY WARRANTY; without even the implied warranty of
# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
# See the GNU Affero Public License for more details.
#
# You should have received a copy of the GNU Affero Public License
# along with this program.  If not, see http://www.gnu.org/licenses.
# ----------------------------------------------------------------------

import math

This file has been truncated. show original

I kind of abandoned this work though…
The original nupic implementation contained a number of workarounds (hacks) to compensate for known issues with htm / nupic. The score on NAB went down when I tried this re-implementation, and I think its because I removed those hacks.

dmac · July 16, 2021, 6:27pm

Here is my advice for getting your thing to work.

If you were using either nupic’s or htm.core’s anomaly likelihood code, then try the file I just posted.
You will want to use an automatic parameter search.
- This will only help if your program fundamentally works. It won’t fix bugs.
- Doing a parameter search manually, with pen and paper, can be highly informative for learning about what the parameters do, but time consuming.
- Therefore, you will need a program to search automatically in order to find the best parameters.
- Something to be aware of: There is an element of randomness in HTM’s. So when you measure their performance: you wont get one single score but rather there is a distribution of scores, which you’re sampling from.
There are ways to measure the quality of an SDR, so you can check whether each piece of the system is working. If you are using htm.core, then look for the class (EDITED) htm.Metrics

The Metrics class will measure an SDR and print out the following table of info about the SDR:

    Sparsity Min/Mean/Std/Max 0.2 / 0.199993 / 1.70916e-05 / 0.2
    Activation Frequency Min/Mean/Std/Max 0 / 0.2 / 0.149421 / 0.714286
    Entropy 0.83246
    Overlap Min/Mean/Std/Max 0.175 / 0.205824 / 0.0262107 / 0.245

The Activation Frequency is measured for each bit of the SDR. Use this to see if some bits are stuck off (active-freq == 0) or stuck on (active-freq == 1)
The entropy is the binary entropy of the activation frequencies, and it’s been normalized into the range 0,1 where 1 is the maximum possible value.
A higher entropy means that the bits of the SDR are being utilized more equally.
The overlap is the fraction of 1’s which stay the same between consecutive assignments to the SDR.

sheiser1 · July 16, 2021, 7:06pm

Hey @userProfessor ,

github.com

numenta/nupic-legacy/blob/master/examples/opf/clients/hotgym/anomaly/one_gym/README.md

# One Hot Gym Anomaly Tutorial

The program in this folder is the complete source code for the "One Hot Gym Anomaly" Tutorial. You can follow along with the construction of this tutorial's source code in the screencast below.

[![One Hot Gym Anomaly Tutorial Screencast](http://img.youtube.com/vi/1fU2Mw_l7ro/hqdefault.jpg)](http://www.youtube.com/watch?v=1fU2Mw_l7ro)

## Premise

The "hot gym" sample application has been around for a long time, and was one of the first real-world applications of NuPIC that actually worked. The data used is real energy consumption data from a gym in Australia. It is aggregated hourly already, so the input file at [rec-center-hourly.csv](rec-center-hourly.csv) simply contains a timestamp and float value for energy consumption during that hour.

This tutorial picks up where the [One Hot Gym Prediction Tutorial](../../prediction/one_gym/README.md) left off, and shows users how to convert their prediction model into an anomaly detection model.

## How to Run

To run and output data to a local file:

    ./run.py

To run and output data to a **matplotlib** graph:

This file has been truncated. show original

That model params file shows a Scalar encoder with min/max = 0/100. I would definitely recommend checking your data against those – like a histogram of the data w/vertical lines at 0 & 100.

By default I set the mix/max using percentiles of a data sample (maybe 1st/99th or 5th/95th depending on the distribution). You want the bulk of your data comfortably between the min & max, so hopefully the distribution is normal-ish or uniform-ish (to make that easy).

The Scalar encoder sees all values above the max as the max, and all below the min as the mix. So if your data falls outside the bounds you’re losing tons of information on the way from raw data to SDR.

There is some description of the TM parameters here at the top where Temporal Memory class is defined:

github.com

numenta/nupic-legacy/blob/master/src/nupic/algorithms/temporal_memory.py

# ----------------------------------------------------------------------
# Numenta Platform for Intelligent Computing (NuPIC)
# Copyright (C) 2014-2016, Numenta, Inc.  Unless you have an agreement
# with Numenta, Inc., for a separate license for this software code, the
# following terms and conditions apply:
#
# This program is free software: you can redistribute it and/or modify
# it under the terms of the GNU Affero Public License version 3 as
# published by the Free Software Foundation.
#
# This program is distributed in the hope that it will be useful,
# but WITHOUT ANY WARRANTY; without even the implied warranty of
# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
# See the GNU Affero Public License for more details.
#
# You should have received a copy of the GNU Affero Public License
# along with this program.  If not, see http://www.gnu.org/licenses.
#
# http://numenta.org/licenses/
# ----------------------------------------------------------------------

This file has been truncated. show original

Hope this helps!

userProfessor · July 21, 2021, 7:06am

How much do the timestamp params impact the performance of the model?
i.e., the hotgym example specifies weekdays and weekends in the model_params. I was wondering if that impacts the efficiency drastically?

dmac · July 21, 2021, 1:31pm

For the hotgym example the timestamps are important. The hotgym has events which happen at the same time every day/week and I think that some of the anomalies are when the gym opens early/late. Without supplying the timestamps it would be very difficult to detect such anomalies.

Topic		Replies	Views
Question about htm.core anomaly detection NuPIC Community Fork anomaly-detection , question , htm-core	6	941	June 25, 2021
Live Q/A session on Anomaly Detection NuPIC anomaly-detection	10	1443	September 1, 2017
Newbie question: How to get both anomaly score, anomaly likelihood and predictions NuPIC question	5	1080	January 29, 2020
Anomaly score/likelihood question NuPIC Community Fork question	6	735	September 2, 2020
Getting anomaly likelihood NuPIC	7	584	January 29, 2020

Anomaly Detection: Optimizing Parameters for Telemetry Data

Related topics