Decode SDR to Word

Gelo123321 · September 21, 2018, 8:39am

Hello guys! I’m trying to create a semantic encoder using HTM and Word2Vec with 50 dimensions. So, i put 50 scalar encoders in multi-encoder and convert dimensions into SDR. Then I get a prediction SDR. The question is: how i can convert this SDR into the word? Do I need using a classifier or not?

Matheus_Araujo · September 21, 2018, 1:35pm

You have an encoder take takes a word as input and gives you a SDR output.
When you have an output SDR and want to know wchich word in your domain it represents I think you should calculate the SDR that most overlaps the corresponding SDRs of your encoder.

rhyolight · September 21, 2018, 4:05pm

How large is the end encoding? Seems like too much data IMO. You should really look into Cortical.IO’s product, which can convert words into SDRS and back (and much, much more).

cogmission · September 21, 2018, 6:38pm

Here’s a link to Cortical.io’s web api demo: http://api.cortical.io

You can test out Fingerprint (SDR) production for several text formats…

Gelo123321 · September 24, 2018, 6:07am

I trying a several different configuration for input/output size. Encoder input (n) = 625, encoders count = 50, total output = 31250 with 2% active columns. I also tried set encoder input to 325.

Gelo123321 · September 24, 2018, 6:11am

Thanks! I will try to calculate overlaps from my SDR.

rhyolight · September 24, 2018, 4:16pm

With an input this large, your spatial pooling size is also going to need to be very large. I don’t think this approach is going to work because compute cost increases quickly as you increase the SP size (that’s a lot of potential connections!)

I suggest you find a way to decrease the size of your input space. I am not sure how you can do this and also represent so many distinct semantic features in the input.

Paul_Lamb · September 25, 2018, 11:24am

One way to reduce size of the input space would be to leverage topology, where semantically similar bits are physically closer to each other in the encoded representation than dissimilar ones. This would allow you to perform a simple scaling algorithm to reduce the size of the input space. This of course would require changing your encoding strategy.

If you decide to write a new encoder anyway, I would recommend exploring other strategies than stacking scalar encoders. One idea that comes to mind might be modding the SP algorithm to work in high dimensional space, so you can round the vectors and use them directly.

Topic		Replies	Views
GloVe Encoder Engineering	7	793	July 28, 2020
Scalar encoder to SDRs Numenta Theory	1	1198	April 11, 2017
Multiencoder and density of SDRs NuPIC	2	555	March 15, 2018
How can I encode data with large number of categories? HTM.Java encoders , category-encoding	8	912	January 4, 2020
Encoder and Spatial Pooler Confusion Getting Started	17	897	April 5, 2019

Decode SDR to Word

Related topics