How does HTM recognize thousands of objects of the real world?

Now I take on that the model of one object in the real world stores in thousands of columns in our brains, and the same object is modeled from different perceptual processes like vision, audition, etc.

@Paul_Lamb and @Bitking has made the concepts of “level” and “hierachy” clear, as below

But in our neocortex, the connections like picture above is not existed explicitly, there are just the columns within same structure of circuits. When we recognize a cat, there are many levels of features: 1) the edge and texture features, 2) the basic geometrical shapes, 3) feet, body, head, eyes etc. 4) cat. Then how does our cortex represent these features hierachically?