Power-of-Two-Based Permutation Logic

Sean_O_Connor · November 22, 2016, 12:58am

I’m just reading this at the moment:
Power-of-Two-Based Permutation Logic

Generally though I am more looking at papers about saddle points etcetera in deep neural nets.

https://papers.nips.cc/paper/5486-identifying-and-attacking-the-saddle-point-problem-in-high-dimensional-non-convex-optimization.pdf

The basic idea is that you never actually end up being trapped in a local minimum while training a large neural net. There is always some vector direction you can move the weights in to get an improvement. Ie. you can’t go wrong using some variation of the random hill climbing algorithm (for descending!!)
I guess it ends up that the structure of a deep neural net is stupid and the algorithm you train it with is stupid. However if you throw enough floating point operations at it with a GPU cluster you can train it to do things in a few weeks that would take an equivalent biological system millions of evolution to be able to do.

Sean_O_Connor · November 24, 2016, 2:02am

When I try random hill climbing with small nets it doesn’t work too well (or at all.) When I try it with large nets it seems to go far more smoothly. The scaling effects look to be counterintuitive. I guess that is the reason people abandoned the idea in the 1980’s when they could only experiment with small nets.

Sean_O_Connor · November 24, 2016, 5:43am

The biological paper then is suggesting (sparse) locality sensitive hashing. Okay, I’ll think about it that way. Maybe I’ll investigate multiplexing and demultiplexing as well.

Sean_O_Connor · November 25, 2016, 12:44am

I suppose repeatedly demultiplexing from 4 to 15 (+ 1 null) states is a very expansive and very sparsity inducing. In some forms of machine learning you create millions of variations of the input data and then choose some of those variations that best explain the target. Of course the chosen items can be demultiplexed into new variations and used to further explain the target.

Anyway you could have made something like that in the early 1970’s:http://www.ti.com/lit/ds/symlink/74ac11138.pdf

Sean_O_Connor · November 27, 2016, 8:29am

So recurrence plus demultiplexing to create a reservoir and then linear readout. It that it?
Well, who can be sure yet.

https://youtu.be/Seme-955jYk

Sean_O_Connor · November 27, 2016, 3:56pm

I’ll save you the trouble of having to look for the paper:
http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1004967

Sean_O_Connor · November 28, 2016, 2:48am

http://journal.frontiersin.org/article/10.3389/fncom.2010.00024/full

And just as an aside:
https://en.wikipedia.org/wiki/Chaos_computing

Sean_O_Connor · November 28, 2016, 8:38am

It kinda, sorta sounds as if Hawkins/Ahmad synapse building results in micro-correlation learning in the reservoir. Making the reservoir less sparse over time and increasing the number of non-linearities that can be drawn from to make predictions. That is a dual learning system that works together to create great richness in the system yet being very simple to train. And in fact having very obvious and biologically plausible learning mechanisms.
Are kinda and sorta valid scientific terms?

Sean_O_Connor · November 30, 2016, 5:08am

I’m kinda using your forum as a collecting point for related ideas, hope it’s not annoying.
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2773171/
I guess a simplification would be to use Hawkins/Ahmad type neurons as micro-correlators but not to feed the outputs back into the (random) reservoir. Then learn read-out projections from the whole kit and kaboodle to the wanted result. So while the reservoir might only contain a cue for a bright light and a cue for a loud noise a Hawkins/Ahmad neuron projecting into the reservoir could (unsupervised) learn any correlations in the environment between the two cues (simultaneous or one followed the other.) You are enriching the reservoir with unsupervised correlation learning. Obviously if that neuron were to fire it would be good idea to initiate fleeing behavior.
Such a myriad of possibilities though, AI is all about the structure.

Sean_O_Connor · November 30, 2016, 5:30am

http://journal.frontiersin.org/article/10.3389/fncom.2015.00036/full

Sean_O_Connor · November 30, 2016, 7:18am

https://archive.org/details/Redwood_Center_2010_02_24_Gordon_Pipa
I’ll leave it there for the moment and think about how best to code such ideas.

Topic		Replies	Views
Deep neural network trained only by hill climbing Lounge	2	1064	February 6, 2017
Random walk learning Lounge	1	383	May 31, 2018
Application of HTM in today’s ML frameworks Machine Learning	50	3391	March 28, 2019
Geoff Hinton and the Thousand Brains Theory Tangential Theories research	2	1044	July 31, 2023
Binary Reservoirs Lounge	4	793	February 22, 2017

Power-of-Two-Based Permutation Logic

Related topics