I experimented with encoding vision and here is what I came up with:
-
Use openCV’s retina algorithms. It’s well researched and it works well.
- It does log-polar transforms
- It does transforms from RGB to Parvocellular & Magnocellular images.
- openCV provides a lot of utilities for working with images and movies. It reads & writes video from file. It can display videos too.
-
I came up with a good method of encoding the resulting image into an SDR. I describe it here: Encoding vision for HTM. I think that this encoder could also work for audio data, but I haven’t tested audio.