Crafting a Reward Function with a Heir-achy of Needs for Complex Goal Formation

Tofara_Moyo · October 26, 2024, 12:16am

author tofara moyo
abstract

We show a simple method for an agent to learn levels of abstractions ordered by priority that ultimately increase the global expected reward. Each level is associated with a separate scalar output of the neural network at each time step t which is fed back to the agent as part of the state at time t+1. The agent then correlates them with features of the state initially randomly. It however learns the correct assignment by doing it in such a way that it increases the global reward. We describe an equation meant to order these scalar values and the global reward in order of priority and hence induce a heir-achy of needs for the agent. This then forms the basis of goal formation for it.

https://www.researchgate.net/publication/385249902_Crafting_a_Reward_Function_with_a_Heir-achy_of_Needs_for_Complex_Goal_Formation

Topic		Replies	Views
Multilevel development of cognitive abilities in an artificial neural network General Neuroscience gnw , reinforcement-learni	0	362	November 14, 2022
Exploring Reinforcement Learning in HTM Tangential Theories	19	4402	July 4, 2018
Exploiting symmetry in reality Tangential Theories agi	22	1250	December 5, 2021
Geoff Hinton and the Thousand Brains Theory Tangential Theories research	2	1044	July 31, 2023
Reinforcenment Learning & Planning without NUMBERS? Tangential Theories	6	570	April 20, 2020

Crafting a Reward Function with a Heir-achy of Needs for Complex Goal Formation

author tofara moyo abstract

Related topics

author tofara moyo
abstract