I would recommend an even more trivial task to start with. Can you design a thermostat that learns to recognize the effects of its own actions as distinct from changes in the environment due to external causes? Can you then engineer a feedback mechanism that allows the system to discover an optimal operating point with minimal guidance? I think there is more than enough challenge in just that task to be worthy of study.
5 Likes