How does ImageSensor work?

Joshua_Jang · March 3, 2018, 10:15pm

I had a general question on how ImageSensor (in nupic.vision) works. Would appreciate a somewhat high-level (going into algorithm is fine too) of what it does with the images it takes in and before feeding to SP.

rhyolight · March 3, 2018, 10:57pm

I really don’t know, but I think @scott does. But he’s out of town at cosyne so I would give him a few days to respond if you don’t mind.

scott · March 3, 2018, 11:20pm

The ImageSensor has documentation here:

github.com

numenta/nupic.vision/blob/43e3ef94b54f16c3aa77b55bd2e5d9822720cf3d/src/nupic/vision/regions/ImageSensor.py#L62




def containsConvolutionPostFilter(postFilters):
"""Determine if the post filters contain a convolution filter"""
for p in postFilters:
  if p[0].endswith("Convolution"):
    return True
return False






class ImageSensor(PyRegion):


"""
ImageSensor is an extensible sensor for grayscale and black and white images.
It uses 'filter' and 'explorer' plugins to do advanced image processing and
training.


It loads images either from files on disk or directly from the Numenta tools.
There are several commands for loading images:
- loadSingleImage, for loading a single image file from disk
- loadMultipleImages, for loading multiple image files from disk

There are two main concepts: explorers and filters. Explorers control how an image or section of an image is selected as input each step. The filters process the image in order to preprocess the image (e.g. apply a gabor filter). The actual loading of the image is done through commands to the region that specify a directory in which images are location. See the MNIST example doing that here:

github.com

numenta/nupic.vision/blob/43e3ef94b54f16c3aa77b55bd2e5d9822720cf3d/src/nupic/vision/mnist/run_mnist_experiment.py#L142


def trainNetwork(net, dataDir, networkFile="mnist_net.nta"):
# Some stuff we will need later
sensor = net.regions["sensor"]
sp = net.regions["SP"]
pysp = sp.getSelf()
classifier = net.regions["classifier"]
dutyCycles = numpy.zeros(DEFAULT_SP_PARAMS["columnCount"], dtype=GetNTAReal())


print "============= Loading training images ================="
t1 = time.time()
sensor.executeCommand(["loadMultipleImages", os.path.join(dataDir, "training")])
numTrainingImages = sensor.getParameter("numImages")
start = time.time()
print "Load time for training images:",start-t1
print "Number of training images",numTrainingImages


# First train just the SP
print "============= SP training ================="
classifier.setParameter("inferenceMode", 0)
classifier.setParameter("learningMode", 0)
sp.setParameter("learningMode", 1)

So the ImageSensory doesn’t really enforce any particular algorithm or processing. Instead, it delegates image loading and cropping to “explorers” and preprocessing to “filters.” Let me know if you have any other questions or if the code documentation isn’t clear.

Joshua_Jang · March 4, 2018, 12:22am

Thanks for responding.

I guess I understand the exploring part that views the images in sections and applies filters to those (similar to how CNNs work). I’m more curious now how the output of this preprocessing will look like that gets fed into SP.
I think I can intuitively guess that, for a given feature (like an edge), a filter will be there to capture such, and turn ON a specific bit at index i. But will it turn on multiple bits at different locations if it saw the same feature over exploration? If so, how will SP ‘know’ this belongs to that specific feature unless number of cells is very large and able to capture all the combination of feature + position + etc. ?

scott · March 4, 2018, 4:02am

The output of the ImageSensor region will depend on the filters. So some will have more or less active bits depending on what combination you use. In the MNIST example I sent, I’m not sure off the top of my head what the output will look like but you could run it with a break or print statement to see. I think you can get essentially a black/white image where the black pixels are 1s. If you use a gabor filter then you’d presumably have 1s just at edges.

Joshua_Jang · March 4, 2018, 4:15am

That’s interesting. So for the MNIST example, will it be correct to say it’s almost a direct 1-to-1 translation, as in, each black pixel results in a different active bit at the output? Which code do I have to look at to see this translation take place? So far I looked through some Explorers and Filter codes as well as ImageSensor.

I realize the MNIST example is just using Flash exploration, but for case more like a sweep, how will each section get fed into SP? Will each section be separate iterations that get fed into SP (so 5 feeds if numIterations = 5) or will it be a single feed that concatenates the five together? Would you mind guiding me to the code that does this as well?

scott · March 4, 2018, 5:53pm

Yes, I believe so but I’d have to run it to validate.

It will be separate iterations as you describe. There is a holdFor option in the explorer, for instance, that will result in each image being output multiple iterations.

The FlashExplorer code is here and you can see the specification for explorers in the base class which has decent documentation. Basically, the ImageSensor loads the images and tells the explorer about the data set and then relies on the explorer to determine the position in the image. You can look at the whole ImageSensor.compute function which is the top level region entry point to see how it uses the explorer.

It’s a pretty complex region and I think the top level documentation for the ImageSensor is the best place to start and you can look at the individual explorer and filter classes to see what each of them do.

Topic		Replies	Views
Does Nupic have a wavelet encoder? NuPIC	8	1136	December 1, 2017
Vision Object Recognition Using NuPIC NuPIC vision	0	1586	April 5, 2017
Understanding nupic.torch Machine Learning pytorch	4	804	July 22, 2019
Interaction between SP and TM Newbie question NuPIC	2	557	August 1, 2018
Initial real time Vision to SDR encoder Engineering sdrs , vision	10	1102	January 21, 2019

How does ImageSensor work?

Related Topics