Why maxBoost is always 2.0?

Setus · May 25, 2016, 9:28am

Hi, I’ve talked about this before on gitter, I’ve used swarming on a good deal of test data and the swarm’s generated model_params.py file always assigned maxBoost the value 2.0. Once the model_params.py file was fed into the OPF and run on that same test data, the predicted and anomaly results were always worse than when I manually reassigned maxBoost to the value 1.0. Any word on what that might be? And could someone please explain what maxBoost does? I struggle to find any documentation about that parameter.

breznak · May 25, 2016, 9:41am

maxBoost controls “boosting”, an artificial process of supporting columns that are becoming irrelevant to help achieving distributed-ness etc. And the process is broken, you will be able to find Issues on Github from me about it.

Setting it to 1 effectively disables boosting (the computation is: newValue = columnValue*boostFactor; where boostFactor >=1 & <= maxBoost). Your swarming is “strange”, many common settings (eg NAB benchmarks) use boosting disabled.

OT: boosting should be either fixed or deprecated.

Setus · May 25, 2016, 9:48am

Ah, ok, thank you very much. Glad to hear that the problem is not some setup error on my part.

alavin · May 25, 2016, 1:49pm

Boosting is utilized in the Spatial Pooler (SP) to increase the activity of unused columns; the SP tries to use all of its columns. The boost value for a column is dynamically determined by how often a column is active relative to its neighbors, where the values fall between 1.0 and maxBoost for any given column. This boost value is multiplied by the column’s overlap with the inputs.

There are two boosting mechanism in SP learning: (1) if a column doesn’t become active enough, its boost value is increased, and (2) if a column’s connected synapses don’t overlap well with the input often, the synapse permanences are boosted. Both of these help columns learn connections to the input space, increasing the overlap for inactive columns.

maxBoost is usually set to 1.0. Scenarios with complex inputs, like vision, may call for higher values.

Hope this helps!

rhyolight · May 25, 2016, 2:52pm

So this still demands an answer… why does swarming result in the wrong boosting value? Is there anything we should do about it?

alavin · May 25, 2016, 3:42pm

Before we decide to “do something about it” we should determine if this is indeed true and if this is a problem.

subutai · May 25, 2016, 4:06pm

maxBoost is described in the Spatial Pooler pseudocode [1]. As @alavin mentions at the end of his post, boosting is, by definition, only useful for really long and complex data sources where you need to optimally allocate the SP columns. For most simple streaming tasks it can be set to 1.0.

To my knowledge, swarming does not explore different values of boosting, just uses a default value of 2.0. I think that default should be set to 1.0. There is an existing NuPIC ticket [2] to improve the various swarm parameters, and this should be one of them. There are other parameters that could be improved in swarming.

[1] http://numenta.com/assets/pdf/biological-and-machine-intelligence/0.4/BaMI-Spatial-Pooler.pdf
[2] https://github.com/numenta/nupic/issues/2829

rhyolight · May 25, 2016, 4:14pm

Thanks Subutai… I just created this subtask:

github.com/numenta/nupic-legacy

Always set default value for maxBoost to 1.0

opened 04:12PM - 25 May 16 UTC

closed 11:43PM - 09 Jun 16 UTC

rhyolight

type:bug subject:swarming priority:2

As discussed by @subutai on [HTM Forum](https://discourse.numenta.org/t/why-maxb…oost-is-always-2-0/560/7?u=rhyolight): > maxBoost is described in the Spatial Pooler pseudocode [1]. As Alex Lavin mentions at the end of his post, boosting is, by definition, only useful for really long and complex data sources where you need to optimally allocate the SP columns. For most simple streaming tasks it can be set to 1.0. > > To my knowledge, swarming does not explore different values of boosting, just uses a default value of 2.0. I think that default should be set to 1.0. There is an existing NuPIC ticket [2] to improve the various swarm parameters, and this should be one of them. There are other parameters that could be improved in swarming. > > [1] http://numenta.com/assets/pdf/biological-and-machine-intelligence/0.4/BaMI-Spatial-Pooler.pdf > [2] https://github.com/numenta/nupic/issues/2829

And updated:

github.com/numenta/nupic-legacy

Improve default swarm parameters

opened 12:00AM - 15 Dec 15 UTC

subutai

subject:algorithms subject:swarming

We have learned quite a bit about parameters and their useful ranges. We should …update the swarm parameter ranges to incorporate the latest knowledge from the past year. This issue involves reviewing the existing swarm parameter setting logic, updating them as necessary, and testing on existing examples. --- - [x] [Always set default value for maxBoost to 1.0](https://github.com/numenta/nupic/issues/3144) - [ ] [Test specific parameter combinations for swarming](https://github.com/numenta/nupic/issues/3163)

This would be a super easy newbie issue for someone to contribute…

subutai · May 25, 2016, 4:16pm

Maybe. They need to be guided as to what the actual parameter settings should be and then they need to test it thoroughly on various datasets. I can try updating the issue with specific suggested ranges, but I won’t be able to do it right away.

ddobric · February 2, 2020, 9:22am

Hi all,

I was experimenting with boosting last days and really wander how it can be useful in the real life scenarios. In fact, when boost happen, the SP briefly forgets its learned state or a number cycles.
Here you can see two examples. First one uses maxBoost = 10.0 and second one maxBoost = 1.0. It is obvious that higher boosting makes SP more unstable than lower boost value. But, even maxBoost=1 brings SP to oscillate. It changes completely the state of active columns for few cycles and then comes back to the previous state.

Figure shows continuous training of SP with encoded digit ‘3’ (scalar encoder) 25000 cycles. y axis shows overlap in percent between cycles t and t-1. x axis shows the cycle. All peaks (i.e.: y<60) are cycles when SP enters unstable state and changes the majority of active columns learned in previous cycles.

I understand the concept and idea behind boosting. But changing of learned state (entering unstable state) in a real world scenario is in my opinion No-Go. Even suggested image recognition with large data-sets is not really an useful option. Whatever the size of data-set is, SP will under these conditions always enter unstable state.
It would love to see some comments on this and to learn how other deal with this issue.

Thanks

rhyolight · February 3, 2020, 6:50pm

You should try different max boost values. I seem to remember some scenarios where even as low as 0.2 can be all you need.

Topic		Replies	Views
Problems with boosting? NuPIC boosting	6	891	May 27, 2016
Introduce new spatial pooler boosting rule (breaking change) NuPIC	4	1206	December 10, 2016
Understanding Boosting in Spatial Pooler Engineering	3	1134	August 8, 2016
How boosting changes the SP representation NuPIC spatial-pooling , boosting	23	2454	July 28, 2019
Boostfunction location, and other questions Engineering	12	986	July 3, 2017

Why maxBoost is always 2.0?

Related topics