Reward hacking in simple HTM agents (using OpenAI Gym)

In the subcortex there are multiple nodes, each evaluating current needs and asserting a goal state based on which one screams the loudest. For example thirst and hunger both drive action but you might ignore food if you are really thirsty. After the thirst is satisfied the hunger wins and you turn to finding food.
Maslow describe a pyramid of needs and as you satisfy the lower needs the higher level goals are able to be activated.In the post on processing in the old brain, I build to a global goal state, driven by inputs from local sub-processors.

2 Likes