Number of combinations?

mraptor · November 4, 2016, 6:57pm

If we have a TM with 5 rows and 100 columns, the possible number of 100bit patterns (by cols) are 5^100.
How many are the possible states if only 20 of those 100 bits can be ON, instead the full 100 ?

thanks

dorinclisu · November 5, 2016, 2:29pm

First, there are n choose w column combinations.
Then, for every column combination there are (nCellsPerColumn)^w cell combinations.
Therefore the total number of states is 100! / (100-20)! * 5^20.

edit: I meant combinations but I used permutations, therefore the formula above is wrong. See below for correct formula.

mraptor · November 12, 2016, 9:43pm

You meant ‘permutations’ not ‘combinations’ , right ?

or isnt it :

100!/ ((100-20)! * 20! )

dorinclisu · November 13, 2016, 9:34am

Oops, I actually meant combinations but I used the permutations formula instead!
It’s indeed 100! / ((100-20)! * 20!) * 5^20
Or more generic: n! / (w!(n-w)!) * (nCellsPerColumn)^w

cogmission · November 13, 2016, 12:38pm

Please excuse if I’ve made a mistake, but when you have 5 rows and 100 columns, your total number of columns == 500. Would you not use the formula over the whole column/cell matrix rather than calculate for one row?

Thus making the formula:

500! / ((500-100)! * 100!) * 5^100

Just curious?

dorinclisu · November 13, 2016, 4:55pm

That is true, but i think that @mraptor meant 100 columns and 5 cells per column, not 5 rows as input topology, so the total number of columns is just 100. Because by having w = n the formula reduces to 5^100, which was the starting point (also mentioned in the TM paper).

Side note: why does TM accept multidimensional inputs and not exclusively flat vectors? I can understand this feature in SP making use of input topology, but as far as I know the TM computation does not care about topology, so the interface is just confusing.

cogmission · November 14, 2016, 1:00pm

Ok, if he meant columns/cells then that makes sense now, thanks. Topology (> 1) dimensions is possible with the SP in certain terms. The dimensions variable is expressed as a 2D entity but it isn’t “committed” in terms of how it relates to columnar dimensionality - meaning it will treat the dimensionality as a flat array at times, and internally I don’t believe the distinction is that important except when computing wrap-around locations for neighborhood computations.

If you look here and here, it seems it in fact treats the activeColumns input as a flat vector? Which interface were you looking at?

dorinclisu · November 14, 2016, 1:26pm

I am not sure what you mean, but I assume it’s the fact the input is always passed as a flat vector (of course, any N-dimensional array is flat in the low-level memory) but in the case of SP with local inhibition, elements on different rows in the input matrix can be treated as neighbors despite being far in the flattened format. No problem with this.

I don’t know Java but at least in Python and C++, the problem is in the constructors / init, that columnDimensions is a vector instead of a simple scalar, suggesting that it matters if you input columnDimensions = (m, n) instead of columnDimensions = (m*n) as in the case of SP, when in the case of TM it doesn’t.

cogmission · November 14, 2016, 1:54pm

I’m going to attempt an explanation here, and you can tell me if it makes sense or you would like it explained in other terms, ok?

The dimensionality of the input field and the SP dimensionality must match: meaning that the same number of dimensions are needed however they don’t have to match when looking at the flat input vector width and the flat SP column vector width. This is the case as far as I remember due to the outer-vector calculation of the infamous rightVecSumAtNZ() method. This was a constraint I remember having to get enormous clarity around while writing the Java version of the SP.

Whether or not this constraint is (still?) the case with the Python and C++ versions or not, I believe the reason why it is specified the same way in the SP and TM is just for the sake of uniformity… If you look here in the C++ TM code, the only value which is pertinent is the number_of_columns, so you are correct in that the dimensionality isn’t made use of in that algorithm.

Python offers a convenience syntax which allows one to get around the constraint of specifying two dimensions, but the Java and C++ versions of course don’t have this kind of syntax candy.

Long answer short. Yes, the (multi) dimensionality is not really used in the TM, but whether it is specified in multiple dimensions or as a flat dimension, the number of columns must be the same, I believe.

mraptor · November 14, 2016, 11:23pm

Sorry for the stupid question. I have always had a problem of converting the word-problem to combinatorics.

In this case you pick the combination formula n-choose-w, because … ?
It is easy to understand that with 2 types (0,1) and 100 positions you get 2^100 permutations.
Then it is easy to imagine 5-rows as 5 types and then the possible permutation of X positions then 5^X permutation.

But why combinations for 20 out of 100 bits of 2 types?

I think I got it … along this description: http://www.math.ucsd.edu/~jverstra/154-part1.pdf

Permutations are : sequences built from sets
Combinations are : sets built from set.

So … we have 100 bits, so we create set S = {1,2,3 … 100 }
So sets of 20 elements of S is the answer.

Set: unique elements
Sequence : ordered elements, may repeat

Topic		Replies	Views
Some doubts on TM algorithm Numenta Theory	1	400	November 9, 2018
Sequence in temporal memory Applications	7	727	December 2, 2019
Quck question? Lounge	2	617	April 4, 2021
How are the cells per columns and length of sequence related? Numenta Theory sequence-memory	33	2509	May 9, 2021
Exploring htm.core and the TM parameters NuPIC Community Fork	11	904	January 23, 2023

Number of combinations?

Related topics