Repo for merging various Encoders

breznak · February 9, 2019, 9:09pm

Hey all,

for the community repo we’re in the need for more c++ encoders.

I was thinking about that and I’d like to stratch that project and create a shared base of encoders + HTM system. A universal environment for running HTM systems and experiments.

First of all, I’ll be developing, merging and looking for help, ideas of specialized encoders (various modalities- vision, hearing, touch, language, chess positions, numbers, …) So if you have a specialized HTM project, please do hit me up and we can join efforts.

Second, for the basic/common encoders, such as expressing cathegories, integeres, real numbers, …even images, video -> there exists tons of encoders for other ML frameworks (DNN).
Is there a common (c++, open-source, active) collection of such encoders? So we could just reuse them. As encoder “just” provides a binary vector from the world, rest is up to HTM.

marty1885 · February 10, 2019, 1:09am

Cool idea!
What are the requirements for a encoder to be accepted? Are very experimental encoders good (ex: GridCell, Bitking’s Hex grid, vision encoder). Are handy but trivial to implement encoders allowed (Ex: Frequency encoder (FFT + scalar encoder))

dmac · February 10, 2019, 2:38am

I experimented with encoding vision and here is what I came up with:

Use openCV’s retina algorithms. It’s well researched and it works well.
- It does log-polar transforms
- It does transforms from RGB to Parvocellular & Magnocellular images.
  - https://docs.opencv.org/3.1.0/d2/d94/bioinspired_retina.html
- openCV provides a lot of utilities for working with images and movies. It reads & writes video from file. It can display videos too.
I came up with a good method of encoding the resulting image into an SDR. I describe it here: Encoding vision for HTM. I think that this encoder could also work for audio data, but I haven’t tested audio.

breznak · March 1, 2019, 5:43am

definitely! We do support some experimental(research) features in Connections, SP, etc. So encoders for doing active HTM research are welcome.

yes, probably to some degree (we won’t want to end up with 1000 encoders), but so far there are none, so any reasonable code is good.

First, I’d like to make the encoders part of nupic.cpp, as are part of python repo. If the project takes up, I was thinking of making a separate repo “nn.encoders”, so even non-HTM projects can join on the encoders…

breznak · March 1, 2019, 5:45am

This is good!
Now that we have a working vision (MNIST) example MNIST example by breznak · Pull Request #242 · htm-community/htm.core · GitHub , once I get to merging the “image encoder” from nupic.vision, I’d like to compare that with your VisionEncoder!

marty1885 · March 1, 2019, 6:40am

What do you mean for the nn.encoder repo? I think HTM and SNN are the two only models that needs a encoder. (But they encode data differently tho)

breznak · March 1, 2019, 8:01am

well, the initial idea was for htm community, so each project does not have to implement encoders. But then nupic.cpp (with bindings) might be just the right fit.
Then I thought other networks (not only HTM, Spiking…) need encoding, for all DNNs (autoencoders, …) but then the architecture/requirement for encoding will be different, so that idea (shared encoders for many NNs) probably won’t kick.

breznak · March 1, 2019, 7:44pm

I’m happy to announce a number of (most of the numenta’s py) encoders in c++ added in

The encoders are working and with tests, we’re looking for volunteers to help them integrate within the repository (may need some fixes regarding multiplatform support, modern c++11/17 etc)

brev · March 3, 2019, 10:15am

@breznak long time coming, thanks!

breznak · March 3, 2019, 10:28am

We’re now discussing how to implement “decode” operation.
If it makes sense to have decode in encoders (but not rest of HTM pipeline supports it), if classifier is enough, and where decode must exist.

Please join with ideas & help in https://github.com/htm-community/nupic.cpp/issues/297
Biological way to decode SDR back to Objects

Topic		Replies	Views
Encoders created by the community Lounge encoders , community	2	1098	July 26, 2018
How to encode images and other visual data for HTM system Engineering	5	945	July 14, 2020
Future development of fork + how 'far behind' are we? NuPIC Community Fork question , community , development	9	1330	September 2, 2020
Community CategoryEncoder in python exist? NuPIC	8	858	April 12, 2019
Know nothing about encoders? Want to learn? Engineering encoders , education , newbie	6	880	June 30, 2018

Repo for merging various Encoders

Related topics