CereSpMM is a unified SpMM framework designed for the Cerebras CS-3 wafer-scale processor. It introduces a novel Stationary-A Broadcast-B (SA-BB) computation method and three format-specific SpMM ...
A novel AI-acceleration paper presents a method to optimize sparse matrix multiplication for machine learning models, particularly focusing on structured sparsity. Structured sparsity involves a ...