OpenAI Paper Review: GPU Kernels for Block-Sparse Weights

rhyolight · August 8, 2019, 5:31pm

Tomorrow morning:

SeanOConnor · August 20, 2019, 7:25am

Some sort of locality sensitive hashing to switch in and out groups of parameters in various possible ways is a cheap way of boosting performance.

There is even a question of whether you need matrix operations at all except for the synthetic ones that are effected by some fast transform algorithms.

It would be amusing and not amusing at the same time if there are currently clusters of hundreds of GPUs burning up serious electrical power to run a bubble sort level O(n^2) algorithm where a O(nlog(n)) algorithm might do instead.

Anyway the paper is a reminder to try LSH parameter switching with fixed filter bank neural networks and see what happens. It would be unfortunate if it interfered with evolutionary training algorithm I use, I’m not sure what will happen.

Topic		Replies	Views
Research on NN sparsity Lounge	10	609	February 19, 2023
SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems Machine Learning	3	861	April 16, 2021
Poster Overview: How Can We Be So Slow? Realizing the Performance Benefits of Sparse Networks Current Research	9	947	May 5, 2022
Anyone can explain why Numenta latest algo optimizes Deep Learning 100x? Machine Learning	15	1220	May 15, 2023
Algorithmic Speedups via Locality Sensitive Hashing & Bio-Inspired Hashing - September 8, 2021 Current Research	3	442	September 23, 2021

OpenAI Paper Review: GPU Kernels for Block-Sparse Weights

Related topics