Columns > 2048 when swarming

pelsasser · June 15, 2016, 2:14pm

Hello,

I was recently watching some of the videos regarding multi-variate swarming. One of the suggestions for using multiple inputs is to utilize more than the default of 2048 columns as well as larger data sets. My data set is on the magnitude of millions of rows, so I would like to attempt to swarm utilizing 4 or 5 different fields. I can see in the model_params.py where the number of columns is specified, however I cannot figure out how to change this for swarming purposes. I looked at the schema for the swarm_config.json file and didn’t see that variable anywhere.

Can anyone help shed some light on how to manipulate the number of columns during the swarming process?

Thanks

Phil Elsasser

rhyolight · July 31, 2016, 2:53pm

@pelsasser Sorry for the late response. The lower-level swarming interface described by @scott here will probably give you the flexibility you need, but running those swarms is going to take some serious time and computing power. Correct me if I’m wrong, @alavin, but the SP should be able to handle in input space that is larger than the number of columns.

Since you are mentioning that you have multiple fields that might contribute to the result, I should bring up that many times in the past I’ve thought the same thing and it turned out that none of those fields were chosen by the swarming algorithm to be encoded, which means the swarming process did not think the fields contributed to a better prediction when included.

This discussion might need some details about the data your are analyzing for us to better understand how to help.

alavin · July 31, 2016, 5:40pm

Yes, I’d suggest using a representative subset of your data to swarm over.

You are correct

It’s very common (in ML in general) to find parameters of your dataset to be insignificant. Swarming can help identify if this is the case.

scott · August 3, 2016, 11:19pm

Yes, I’d suggest using a representative subset of your data to swarm over.

You can limit the number of records in the description.py file.

Topic		Replies	Views
Swarming - TypeError: Parameters columnCount and inputWidth must be > 0 NuPIC swarming	4	466	August 22, 2019
Questions on continued learning with multiple inputs after invoking model.run() NuPIC swarming , multiple-inputs	8	1271	March 20, 2017
Can medium Swarm take more than two days NuPIC	5	386	January 30, 2019
Reusing a Permute variable the same swarm NuPIC swarming	4	645	February 14, 2018
Understanding NuPIC and troubleshooting to get the best results NuPIC	2	1498	July 20, 2016

Columns > 2048 when swarming

Related topics