Deep Predictive Learning: A Comprehensive Model of Three Visual Streams

I was about to ask this as a question. Which is much more likely, a messy propagation of error signal or a much more organized described in this paper? My intuition tells me it’s the former and the system can likely diverge. What do you think?