Saccades key to vision?

sebjwallace · May 2, 2018, 4:49pm

A naive approach I had thought of in the past is similar to the quad-tree demonstrated above. Imagine all the ‘pixels’ of the retina feeding into the cortex. The layer of pixels is 16x16. Parallel to that layer is the same retina feed but into a layer of 8x8 pixels. Each pixel is on/off depending on the number of on pixels within its ‘receptive field’ of the pixels below it in 16x16 layer. If over half the children are on, then the parent is on.

Putting to together into a 2d hierarchy (or quad-tree) then you have a representation of the image at various levels.
(the below drawing is not exact, but good enough for illustration)
visual_scales

The purpose for this is to control the movements of saccades top-down in the hierarchy/tree. In the 2x2 there are two on pixels representing 2 areas of interest. Down to 4x4 the form becomes clearer representation but serves to focus the attention on the relative objects/corners/edges. The control then feeds further down until you get to an exact edge or corner in 32x32. The jump from (say corner to corner), can easily be done by feeding down the target ‘features’ from 4x4 to target corners in 16x16.

As you can see above - the movement from one point to another is smaller as you go up the hierarchy. The saccades still occur on the 16x16 but the control works on all levels in a coordinated fashion.

This could also help in scale invariance too. If features were to be detected at each level then a feature close-up or far-away will be captured as the same.

But of course, pure theory.

Topic		Replies	Views
Sensorimotor Importance to Vision with Precise Timing Numenta Theory	10	1276	February 13, 2019
How HTM is supposed to deal with spatial invariance? Numenta Theory	50	4211	February 17, 2018
Prototype of Stability Mechanism for Viewpoint Invariance Engineering	7	923	April 16, 2020
Visual Processing and Vector Calculus Tangential Theories	24	1731	July 17, 2019
Is the topology in HTM similar to the convolution in CNN(Convolutional Neural Network)? YouTube topology	18	1817	October 26, 2018

Saccades key to vision?

Related topics