Supervised multivarient Anomaly Detaction by using HTM

Aliraza · July 24, 2020, 10:45am

Hi, I am new to HTM and start learning HTM, I have a multivariant labeled dataset and want to apply HTM by using SP, TM. I saw many examples like HOTGYM.py, but I didn’t find any example which can help for multivariant supervised anomaly detection.

I need help to design an HTM model for my dataset and find an F1-score.

(Note: In hotgym.py the tm.anomaly function returns anomaly score in floating-point 0 to 1. How can I get an anomaly score in the form of binary?, I already applied threshold function. but it gives bad results)
Snippet of data
Quick reply will be higly apprciated

sheiser1 · July 24, 2020, 5:56pm

The HotGym data set is technically multivariate, if you include the datetime encoder. This is all set in the model params, in the “encoders” dict. That’s where you’ll see “consumption” with a ScalarEncoder, along with “timestamp_timeOfDay” with a DateEncoder. To add more metric fields like “consumption” just add another sub-dictionary within “encoders”.

github.com

numenta/nupic-legacy/blob/master/examples/opf/clients/hotgym/anomaly/model_params.py

# ----------------------------------------------------------------------
# Numenta Platform for Intelligent Computing (NuPIC)
# Copyright (C) 2013, Numenta, Inc.  Unless you have an agreement
# with Numenta, Inc., for a separate license for this software code, the
# following terms and conditions apply:
#
# This program is free software: you can redistribute it and/or modify
# it under the terms of the GNU Affero Public License version 3 as
# published by the Free Software Foundation.
#
# This program is distributed in the hope that it will be useful,
# but WITHOUT ANY WARRANTY; without even the implied warranty of
# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
# See the GNU Affero Public License for more details.
#
# You should have received a copy of the GNU Affero Public License
# along with this program.  If not, see http://www.gnu.org/licenses.
#
# http://numenta.org/licenses/
# ----------------------------------------------------------------------

This file has been truncated. show original

This is what the Anomaly Likelihood is for. It’s a post processing that look for shifts in the distribution of anomaly scores. This value is what is recommended to threshold, not the raw anomaly score.

I’d highly recommend looking into NAB (Numenta Anomaly Benchmark). It’s sort of like an F1 score but designed for real time anomaly detection, rewarding early detection and punishing false positives based on labeled ground-truth anomalies. Numenta applied this scoring system to a bunch of algorithms on like 60 datasets, so its well tested and the code is also open source.

Aliraza · July 25, 2020, 4:22pm

Thanks for reply. I will check and update about my experiments.

Topic		Replies	Views
How to approach anomaly detection in htm.core with multivariate data NuPIC Community Fork question	6	652	June 26, 2021
Anomaly Detection for Multivariate TimeSeries Data NAB question	2	1697	December 31, 2018
Can HTM be used on multivarient time series problems? Engineering	43	4433	February 27, 2021
Anamoly detection with HTM NuPIC	2	819	January 22, 2018
Question about htm.core anomaly detection NuPIC Community Fork anomaly-detection , question , htm-core	6	942	June 25, 2021

Supervised multivarient Anomaly Detaction by using HTM

Related topics