What should I do about gaps of varius interval in the timestamps of my CSV?

rhyolight · April 30, 2018, 3:42pm

Thanks for the kind words.

Probably not. Remember that time is part of the encoding, so interval won’t really matter. There is only one dimension of time for HTM, same as us… the dimension of each moment coming after the previous one. But this dimension is not necessarily linear. As you have noted, sensors may not have regular sampling intervals when gather data. Chemicals in the brain may also affect how much energy is dedicated to processing certain important moments (fight or flight).

For extracting predictions, however, random intervals makes the problem much harder, because have to think about these moments and predict what comes next. To do this, we have to understand the function of time in the equation. If it is linear (all data points are equally spaced), this makes the problem easy. But if there is any randomness, the problem becomes much, much harder.

Don’t use swarming if you are doing scalar anomaly detection. Swarming is tuned to a predicted field, and getting the best predicted values for that particular field. It doesn’t translate well to anomaly detection, IMO. We have a set of model params we use for scalar anomaly detection, I would start there.

You can pre-process your data by aggregating it into a regular interval. There are many ways to do this, which are outside the scope of this conversation. (If someone has advice, please break off a thread from this post.)

But if you just want anomalies, it won’t be a problem. I just ran your data through HTM Studio, and it seems to work:

At least I assumed this was the anomaly you were expecting it to find? It saw it both in volume and close.

Topic		Replies	Views
HTM for fast moving dataset Getting Started question	2	571	April 6, 2020
Don't swarm for Anomaly models NuPIC swarming , anomaly-detection	16	5335	October 17, 2019
Anomaly detection - params and questions / definition of terms HTM.Java	3	980	May 13, 2017
Anomaly detection with HTM.core model Machine Learning question	1	1000	October 1, 2022
Anomaly detection with HTM.core model on sine Machine Learning question	14	1902	December 8, 2021

What should I do about gaps of varius interval in the timestamps of my CSV?

Related topics