Exciting potentials with HTM agents in OpenAI Gym

marty1885 · October 16, 2019, 1:59pm

I’m still working on this. But I’m too excited to hold back myself and want to share the results right now.

Its my graduation project and I’m building RL agents using HTM algorithms. To my surprise, HTM works rather well (comparing to DQN and A2C) in environments with dense enough rewards. HTM can learn how to act in just a few episodes. But the learning seems to collapse after, say 200 training loops. After that HTM just doen’t know how to act.

(Figure: A HTM agent in the CartPole-v1 environment and getting high rewards very early.)

I’ll release the source code once it’s ready.

rhyolight · October 16, 2019, 2:35pm

marty1885 · October 17, 2019, 3:06pm

Now I can get better and more stable rewards. But the network still eventually collapses.

rhyolight · October 17, 2019, 4:44pm

Has anyone else here used the OpenAI Gym platform?

dmac · October 19, 2019, 11:35pm

I have. It has a variety of reinforcement learning games of varying difficulties. All of the games have the same API, so applying an AI to many different games is easy. It’s nice to work with and experiment with, though I haven’t solved any of them yet.

marty1885 · October 20, 2019, 4:12am

Never mind. I found why the agent goes nuts after a while. It turns out that the environment I use (CartPole-v1) requires the agent to alternate between sending commands going right and left to go slow. But HTM is bad at that.

Topic		Replies	Views
An open-source community research project on comparing HTM-RL to conventional RL Related Papers	63	3355	June 19, 2018
Reward hacking in simple HTM agents (using OpenAI Gym) Engineering	19	1531	June 1, 2020
HTM in OpenAI Gym NuPIC benchmark	8	2033	February 1, 2017
Trying to make an HTM augmented/based RL algorithm Engineering	7	1354	May 6, 2019
Reinforcement Learning and HTM Algorithm Machine Learning sequence-memory , encoders , question , community , nupic	26	3559	June 18, 2019

Exciting potentials with HTM agents in OpenAI Gym

Related topics