What data structures or types or objects are the best to implement synpases on a dendrite segment on a cell and thus potentially create an entire cortical columnar model? The algorithms can be boiled down without actually creating every biological unit but what if we were to make a programming model that replicates a cortical column entirely? How can we represent each unit while managing memory and performance?
Are there any specific abstractions one should implement over these units to make them more program feasible?
P.S.: New to the theory and trying to implement it ground up.
What is different with the approach I am proposing is that for a given cell body synapses with a strength of zero do not have to be maintained or processed in all operations. A pre-scan operation could extract the active connections. This exploits an essential feature of sparsity - only active connections are considered. The path table holds all the potential connections but only the active connections are maintained with each cell body data structure. This could result in a dramatic speedup in certain parts of processing.
There are also certain desirable biological features that are simplified as described in the original post.
If you choose to keep the strength of every synapse, active or not, you do not have to have to keep the location as this is stored in the related path table resulting in a reduction of the data consumed by the pointer to the synapse.
It would also be possible to make a path table that is a circular distribution around the cell body to facilitate a signal sampling that looks a little more biological. Topology counts.
Thank you @Bitking and @rhyolight
It seems interesting. Will look into SOMs and storage mechanisms. I initially thought of storing all synapses externally and then collecting the data for every operation. Since memory optimisation is a concern, using arrays, like @rhyolight mentioned was given a priority. @nivedita and I are using Julia for the implementation and we think the performance will be considerably good after looking at certain benchmarks and testing some code.
From a computational point of view, it is important to be able to quickly access synapses from both the pre-synaptic and post-synaptic side. This can amount to some extra book keeping but is well worth the programming effort and bytes of RAM.
Two required capabilities of HTM synapses are:
Every active input needs to be able to quickly notify every connected post-synaptic site.
Every learning output site needs to be able to quickly see which inputs are connected to it.
Scanning through all of the synapses looking for things takes a long time compared to already having the connectivity data in two arrays-of-arrays which are indexed by the pre-synapse and post-synapse.