Is billions of parameters (or more) the only solution?

A PID controller can also solve the cart-pole problem. The parameters of the controller need to be tuned ahead of time and it does not learn at run time.

Interestingly, the spinal cord and brain stem also use feedback-based controllers to enact muscle control and balance. For more info see How The Spinal Cord Generates Behavior .

3 Likes