Thanks for an amazing talk, Jeff! I also have a few questions:
- What is the purpose of the L3? Is this the pooling layer that makes L4 more stable?
- Would this be a somewhat correct story? I’m ignoring different regions trying to negotiate the agreement here.
- We get input to L4 from some sensor.
- L6a provides the location of that sensor in the object space. (Where does L6a receives that input from? Just a wild guess?)
- L4 feeds to L3, where a more stable conceptual representation of the signal forms. Does it do temporal pooling to form this stable representation?
- Then L3 projects to L5 (at this point the signal is stable in terms of - layer know the “thing” it perceives) That get’s combined with L6b to understand what is that “thing” in terms of the broader knowledge. I.e. if it’s a handle of a cup or a handle of something else that is similar to touch.
Is this a remotely correct intuition?