Inference speedup with respect to non sparse model

zoythum · December 8, 2022, 10:31am

I have two models trained on MNIST, one is the model provided in the nupic.torch library, it uses sparse convolution and sparse linear layers and a KWinners activation function. The other replaces KWinners with ReLU and all the sparse layer with the non sparse corresponding layers from PyTorch.
I trained the models and benchmarked them on GPU expecting the sparse one to be faster, instead I found that the non sparse version is almost 2x faster.
My question is if I am doing something wrong or if sparse layers do not bring improvements to the inference time of a model.

Topic		Replies	Views
Inference speedup on PyTorch NuPIC question	0	484	November 29, 2022
Understanding nupic.torch Machine Learning pytorch	4	861	July 22, 2019
Sparse dataset (input sparse matrix in neural model) Lounge community	7	1412	May 16, 2018
Numenta Technology Demonstration: Sparse networks perform inference 50 times faster than dense networks, with competitive accuracy Machine Learning	6	1226	November 12, 2020
Poster Overview: How Can We Be So Slow? Realizing the Performance Benefits of Sparse Networks Current Research	9	953	May 5, 2022

Inference speedup with respect to non sparse model

Related topics