You can just use simple associative memory with a random intermediate vector assigned to each training pair (vector a recalls vector b).
Then you have (vector a recalls vector r) and (vector r recalls vector b). That gives the pulling apart spoken of in the blog. Since random vectors in higher dimensional space are almost orthogonal. And really that to introduce an error correcting code into the system which you might do explictly if you knew polar codes for example.