I don’t think that anyone here will argue that this in not true. The HC/EC is situated in the correct relationship with the temporal lobe and the outputs of the WHAT & WHERE streams as a processor of digested high level representations. A good chunk of this is spatial information.
The same basic computational structures are distributed throughout the cortex and used in different way based on the representation of data at that level of processing.
Let me shortcircuit the question about what I mean by levels - I mean the distance or number of maps between the map in question and the primary sensory modality that is being processed.
Speaking of “processing,” I feel that there is a big concept lurking here. I have been pondering what is being done in the cortex for a long time. I think that there is something like Shannon’s seminal paper on the theory of what information is being conveyed and the amount of transformation that is done in each map waiting to be discovered.