Multi-variate data

sachinaj · May 3, 2021, 10:05am

Hi all,

I have question-related to multi-variate data. I have to find anomalies in 2000+ sensors of different characteristics, some sensors may have trend in it.

Will HTM handles trend and find the anomalies? or do we need to remove trend?

Also, do we need to have 2000+ HTM models? and how much time it may take for time series having 200,000 points?

CollinsEM · May 5, 2021, 3:23am

Are they sampling the same physical phenomenon or are they independent? Is it the same type of signal source for each sensor (e.g. temperature) or are there multiple modalities?

sheiser1 · May 5, 2021, 3:51am

I think removing the trend would be good if possible, because HTM does well with periodic data, so if removing trend makes it less drifty and more periodic that’s good.

You certainly can’t fit 2000+ sensor values into a single model, so yes 1 model per sensor is a better way to scale. If there are clusters of sensors which are highly related to each other you could make fewer models which each contain several sensor values (for instance 400 models with 5 sensors each). I think 1 model per sensor is the easiest way.

Also if there are certain groups of sensors which are measuring very similar things, there may be a lot of shared signal among them. If that’s true you could drop out whatever sensors are redundant, but that also means some exploratory analysis beforehand.

I don’t have a precise answer for this one – depends how your env is setup. Obviously if you could run all these models in parallel it’l be much much faster.

There’s also the issue of setting encoder params for each sensor. Assuming they’re not all the same unit of measurement following the same distribution, each sensor should have custom encoder settings as per its distribution.

I use a simple rule of thumb, where I sample the first x values for each sensor, calculate some custom min/max for each and generate the encoder dict based on that.

Happy anomaly detecting!

Topic		Replies	Views
Would HTM be good for anomaly detection in a sensor network? Getting Started anomaly-detection , question	4	1117	February 19, 2020
Handling multivariate data with hundreds of variables Applications	5	741	July 9, 2019
Can HTM be used on multivarient time series problems? Engineering	43	4432	February 27, 2021
Explainability and HTM Numenta Theory	2	493	July 6, 2019
Anomaly Detection for Multivariate TimeSeries Data NAB question	2	1695	December 31, 2018

Multi-variate data

Related topics