HTM for fast moving dataset

khan · April 6, 2020, 2:40pm

I have a question related to applying HTM on real data. What if my data is not exactly regularly spaced, ie.e the measurements are coming after every 30sec approximately, sometimes (29sec,30sec,31 sec)) and comes from a real process and has an obvious daily seasonal pattern. Is it required to be aggregated up to certain extend? Thanks

sheiser1 · April 6, 2020, 4:47pm

Hey @khan welcome!

It shouldn’t be a problem as long as the intervals vary only slightly, but it may be a good idea to aggregate to a larger time interval anyway. The system will learn pattens faster the shorter they are, so if there’s a daily pattern that can observed by sampling every 30 minutes it will be learned much faster than if sampled every 30 seconds. If there are different patterns only visible minute by minute thats why you’d keep the 30 second sample rate, but if larger time chunks are still viable I’d do that.

Here’s the source for the datetime encoder. It’s not set by default to handle sub-minute sampling, so if do keep the 30-second rate it would need some tweaking (in the ScalarEncoder it calls on line 226), or even dropping that encoder entirely and feeding in only the raw data.

github.com

numenta/nupic/blob/master/src/nupic/encoders/date.py

# ----------------------------------------------------------------------
# Numenta Platform for Intelligent Computing (NuPIC)
# Copyright (C) 2013, Numenta, Inc.  Unless you have an agreement
# with Numenta, Inc., for a separate license for this software code, the
# following terms and conditions apply:
#
# This program is free software: you can redistribute it and/or modify
# it under the terms of the GNU Affero Public License version 3 as
# published by the Free Software Foundation.
#
# This program is distributed in the hope that it will be useful,
# but WITHOUT ANY WARRANTY; without even the implied warranty of
# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
# See the GNU Affero Public License for more details.
#
# You should have received a copy of the GNU Affero Public License
# along with this program.  If not, see http://www.gnu.org/licenses.
#
# http://numenta.org/licenses/
# ----------------------------------------------------------------------

This file has been truncated. show original

rhyolight · April 6, 2020, 9:14pm

Here is a relevant discussion:

Topic		Replies	Views
Why can't I encode time in very small increments with DateEncoder? NuPIC encoders , question , data	9	1790	October 3, 2016
Data sampling frequency/interval questions NuPIC	2	365	April 8, 2019
Would HTM be good for anomaly detection in a sensor network? Getting Started anomaly-detection , question	4	1118	February 19, 2020
Data aggregation NuPIC	11	1254	March 15, 2018
Getting started, looking for tips Getting Started question	2	804	May 8, 2020

HTM for fast moving dataset

Related topics