How should tie-breaking be done when finding winning columns

rhyolight · September 6, 2016, 1:04am

Please review:

I have two questions:

How should these ties be broken?
How are our implementations actually doing it today?

alavin · September 6, 2016, 3:13pm

I’ve answered these questions in the issue here (at least for the Python imp.).

rhyolight · September 6, 2016, 3:21pm

My confusion stems from @cogmission’s comment that the C++ version does tie-breaking differently. I know there have been changes to tie-breaking logic in the not-so-distant past. I just want to ensure that we agree what the method should be, and that it is implemented in that way for all implementations we control.

Seems like one method is for winning columns to be inserted in the front of the tie queue and a line is drawn in the queue based on how many winning columns are needed. Is that the method we want to use everywhere? Is that what is happening in python and java as well?

Thanks for helping me clarify. This may be a non-issue, but I just want to make sure that we are using the same logic in all places. I think my confusion may just be that https://github.com/numenta/nupic/issues/3245 is a duplicate of https://github.com/numenta/nupic.core/issues/702.

cogmission · September 6, 2016, 6:39pm

It looks like while they go about it very differently, they are in fact doing the same thing, which is sorting the indexes of where the overlaps are found, in descending order (indexes of highest to lowest overlap scores). The Java version (with the new PR), does exactly the same thing - although yet again differently - which is not a big deal due to language best practices.

The questions that still remain are:

I think everyone agrees on this, but it is still hanging around - and that is, should the old unused tie-breaker code be removed?
Should there be some ancillary code to make tie-breaking more determinate? @alavin made the comment that Python’s argsort may not handle ties gracefully (i.e. indeterminate unpredictable tie resolution?)
If the consensus for #2 is “Yes”, then how should the ancillary tie code be implemented? Should we use @alavin’s suggested code here?
Alex, I have some questions about the code you proposed, but that’s another topic for discussion once we get this far.

rhyolight · September 6, 2016, 6:47pm

Yes it should. Should self._tieBreaker be removed? · Issue #3245 · numenta/nupic-legacy · GitHub is ready for work, and I’m going to mark Remove obsoleted tie breaker data from SP · Issue #702 · numenta/nupic.core-legacy · GitHub as a duplicate.

alavin · September 6, 2016, 8:10pm

It’s determinate, always preferring the later indices, e.g.:

>>> import numpy as np
>>> values = np.array([3, 0, 0, 1, 3, 7, 0, 3])
>>> np.argsort(values)[::-1]
array([5, 7, 4, 0, 3, 6, 2, 1])  # ties between the values 3 are handled by using the reverse order

The mechanism I suggested breaks ties randomly:

>>> randomValues = numpy.random.random(len(overlaps))
>>> np.lexsort((randomValues, values))[::-1]
array([5, 4, 7, 0, 3, 2, 1, 6])  # compare to the argsort example

cogmission · September 8, 2016, 9:39am

@alavin

Even if we don’t use this solution due to Subutai’s comments on the PR - this was still a very nice suggestion and very good “teaching moment” for me being a Python noob! Thank you, Alex!

(…and yes, I’ve upgraded my status from Python despiser, to Python noob, I’m actually starting to like it a bit!)

alavin · September 8, 2016, 3:57pm

I’m glad you enjoy my pythonic preaching @cogmission.
For more great tips and tricks I highly recommend “Effective Python”.

rhyolight · September 8, 2016, 4:15pm

generate.png500×700 154 KB

alavin · September 8, 2016, 4:58pm

Topic		Replies	Views
Can someone help me figure out how to debug my tic-tac-toe attempt? Engineering usage-help	9	2212	December 15, 2016
How is Overlap Score Implemented for the Spatial Pooler? Engineering spatial-pooling , nupic , overlap	2	1212	February 3, 2017
Algorithm API notebook for new-ish interested people NuPIC	15	2228	April 16, 2019
K-Winner take all computation in spatial pooling Implementations	5	558	September 4, 2020
Survey: Features & API-Compatibility NuPIC Community Fork question , community	22	1526	March 28, 2019

How should tie-breaking be done when finding winning columns

Related topics