Reward hacking in simple HTM agents (using OpenAI Gym)

marty1885 · May 18, 2020, 2:21am

I didn’t add a TM to the agent because the world state is static and nothing is hidden to the agent. So there’s no need of memory. Hm… how could HTM perform reward tracking? I’ve been trying but it seems very difficult with the standard SP/TM and local learning rules. (And tracking rewards as a real number is also tricky)

That’s a very good point!

Topic		Replies	Views
Exploring Reinforcement Learning in HTM Tangential Theories	19	4540	July 4, 2018
HTM Based Autonomous Agent Related Papers	47	6659	September 23, 2019
Right way to get output from an HTM system Getting Started spatial-pooling	23	1549	August 28, 2021
Reinforcement Learning and HTM Algorithm Machine Learning sequence-memory , encoders , question , community , nupic	26	3715	June 18, 2019
An open-source community research project on comparing HTM-RL to conventional RL Related Papers	63	3860	June 19, 2018

Reward hacking in simple HTM agents (using OpenAI Gym)

Related topics