Three Tiers Transformers Brains

Maybe this paper is interesting.
https://arxiv.org/abs/2503.04848
The limitations about learning grammers might be due to artifical neural networks being heirachical associative memory, I kind of would imagine.

2 Likes