Learning how birds teach themselves to sing Dad's song

sebjwallace · June 23, 2018, 10:17am

I’d like to develop on the “selection of the vocals that have the smallest error” thing.
Depending on how you see that error and selection happening, it may feel as there is still some ANN idea, or a bit from how we think about consciously and actively learning biasing this.

I’m wanting to find something more… ‘On-Intelligence’-style than this (whether or not it was what you had in mind yourself). Less of a conscious choice. Less of error-detection + optimization. More “forward” maybe. Using just prediction from an original intent (same sound as dad) and associated motor signal (babling).
Can’t put my finger exactly on what it is that tickles me, or what it is I’m striving to find…
You guys see what I mean ? Was already how you envisioned the thing ? Have some thoughts about it ?

Yeah, so this idea is pretty general to machine learning I think. The jargon is different ways of saying the same sort of thing.

Not matter what method you are using: gradient-descent, genetic-algorithms, reinforcement-learning, etc. they all have an error function (in genetic algorithms it is called ‘fitness function’ or ‘objective function’ for example), where it is simply a measure of the distance between the current output of the agent and the desired/expected output. They all also have a way to use noise/stochasticity to search the space of possible outputs. Those outputs that have a relatively smaller distance to the desired output are selected, so it could be said they have a ‘smaller error’, so they are ‘good’. In HTM talk they could be said to have a ‘greater overlap’ when talking about SDRs. Either way, its the same thing - a comparison/measure between two things, then selecting the representations with greatest similarity/overlap, then repeat this process. This whole process is common to a vast number of machine learning methods.

So far HTM is working on feed-forward learning from sensory input, to model the world. Once you have a semantic model of the world you can then leverage that to do the process as described above. If you have an objective/goal (say to learn to produce a song-bird song) then that would require generating noisy/stochastic SDRs within the motor region hierarchy. The motor SDRs that produce a similar sounding output as the dad’s song SDR will be remembered. So the agent produces an output from a stochastic motor SDR (‘blabbering’), the sound it produces is fed into the auditory region where it is represented as a sensory input SDR then is compared to the dad’s song SDR. The overlaps are then probably sent to an association area along with the outputs of the motor SDRs that produced that sound. This region is probably where the SDRs are compared and the reinforcing feedback to the motor region is determined. Repeat that enough times and it will build up a representation in the motor region that produces are very similar output to that of the dad song. Again, the process basically being stochastic sampling and semantic selection. It is likely that the motor representation is built from the bottom-up in the motor region as it starts with small features (as they are easier to compare) then combines them to more complex features until the top-level of the hierarchy represents the whole motor representation of the dad song.

I don’t know if this is what the brain does, but it shows how this general machine learning method could be implemented in a HTM system to replicate the learning of the song-bird.

Topic		Replies	Views
A different point of view on building AI system Tangential Theories	45	4013	December 2, 2017
Project to compare mraptor's bbHTM to biology Engineering	21	2098	June 22, 2016
Esperanto NLP using HTM and my findings Engineering	60	4314	November 30, 2018
Learning a collection of features where order doesn't matter Tangential Theories	15	1786	August 30, 2016
HLC How to generate behavior with HTM? Community	23	861	December 18, 2020

Learning how birds teach themselves to sing Dad's song

Related topics