MKL spmm and spgemm integration by mmelnich · Pull Request #147 · BallisticLA/RandBLAS

mmelnich · 2026-02-04T16:58:39Z

This PR adds Intel MKL sparse BLAS support to RandBLAS, which gives a significant speedup for sparse-dense matrix multiplication when you store your sparse matrix in the right format (CSR for left_spmm, CSC for right_spmm due to internal transpose).
Added the mkl_spmm_impl.hh with RAII wrappers for MKL handles, dispatch logic in spmm_dispatch.hh that tries MKL first and falls back to hand-rolled kernels when MKL can't handle the format, and a new spgemm function for sparse×sparse=dense multiplication.
Also, created a benchmark (spmm_mkl_comparison.cc) that demonstrates the MKL vs hand-rolled performance difference and explains when MKL kicks in.

Also, discovered and fixed a pre-existing latent bug in the public RandBLAS::spmm dense×sparse API wrapper that had an extra argument and would have caused a compile error if anyone tried to use it, and we added tests for that API to prevent regression.

mmelnich · 2026-02-06T15:52:46Z

@rileyjmurray lmk your thoughts

rileyjmurray

Looks pretty good!

See comments for some requested changes.

I'm also going to ask that you update the web documentation to include spgemm on this page and discussion of MKL on this page. You'll also need to update the Limitations page, here, to mention that spgemm requires a third-party library (currently only MKL). As appropriate, make sure to mention that only single and double precision are supported, in contrast to RandBLAS kernels that use any scalar type.

^ All the web docs are defined here.

rileyjmurray · 2026-02-06T17:26:10Z

examples/.gitignore

sparse-data-matrices is just supposed to hold matrix files. Please leave it in the git-ignore. (I'm not sure why sparse-low-rank-approx subfolders would ever be created. Maybe if you tried to build from that directory?)

rileyjmurray · 2026-02-06T17:26:37Z

RandBLAS/config.h.in

+#cmakedefine RandBLAS_HAS_MKL
+// ^ CMake determines whether or not to #define RandBLAS_HAS_MKL
+//
+//   This is set when BLAS++ was built with Intel MKL and the MKL sparse
+//   BLAS header (mkl_spblas.h) is found. When defined, RandBLAS uses
+//   MKL's Inspector-Executor sparse BLAS for accelerated sparse matrix
+//   operations (sparse x dense and sparse x sparse multiplication).
+//
+//   If you don't want to use CMake, define this only if you are linking
+//   to Intel MKL and have mkl_spblas.h available.
+//


Glad you caught this!

rileyjmurray · 2026-02-06T17:29:58Z

examples/sparse-data-matrices/spmm_mkl_comparison.cc

+#include <algorithm>
+#include <numeric>
+
+// Include internal headers for direct kernel access


It seems this file invokes spmm_left and spmm_right, not direct kernel calls.

rileyjmurray · 2026-02-06T17:31:27Z

examples/sparse-data-matrices/spmm_mkl_comparison.cc

I want to make sure this executes properly even if MKL is not available. Can you add a continuous integration build that runs this example with small dimension sizes? Or maybe just update our CI builds so that this example runs on all platforms.

rileyjmurray · 2026-02-06T17:34:52Z

examples/sparse-data-matrices/spmm_mkl_comparison.cc

sparse-data-matrices is just supposed to hold matrix files themselves, not any executables. Please create a new folder called something like "simple-kernel-benchmarks". It's okay if that folder only contains this one file.

rileyjmurray · 2026-02-06T17:40:09Z

RandBLAS/sparse_data/mkl_spmm_impl.hh

+    }
+    MKLSparseHandle& operator=(MKLSparseHandle&& other) noexcept {
+        if (this != &other) {
+            if (handle) mkl_sparse_destroy(handle);


Does mkl_sparse_destroy try to tree attached memory?

rileyjmurray · 2026-02-06T17:40:40Z

RandBLAS/sparse_data/mkl_spmm_impl.hh

+            colptr, colptr + 1, rowidxs, A.vals
+        );
+    } else {
+        static_assert(sizeof(T) == 0, "MKL sparse BLAS only supports float and double.");


Same comment here as with the static_assert in make_mkl_handle_csr.

rileyjmurray · 2026-02-06T17:41:00Z

RandBLAS/sparse_data/mkl_spmm_impl.hh

+            (MKL_INT)A.nnz, rows, cols, A.vals
+        );
+    } else {
+        static_assert(sizeof(T) == 0, "MKL sparse BLAS only supports float and double.");


Same comment as make_mkl_handle_cs[r/c].

rileyjmurray · 2026-02-06T17:41:23Z

RandBLAS/sparse_data/mkl_spmm_impl.hh

+    } else if constexpr (is_coo) {
+        return make_mkl_handle_coo(A);
+    } else {
+        static_assert(sizeof(SpMat) == 0, "Unsupported sparse matrix format for MKL backend.");


Same comment as other static_asserts.

rileyjmurray · 2026-02-06T17:45:41Z

RandBLAS/sparse_data/spmm_dispatch.hh

+    // Try MKL-accelerated path if available.
+    #if defined(RandBLAS_HAS_MKL)
+    if constexpr (sizeof(typename SpMat::index_t) == sizeof(MKL_INT)) {
+        // mkl_left_spmm returns false if it can't handle this case
+        // (e.g., COO with submatrix offsets, CSC format).
+        // Beta is already applied to C above, so pass beta=1 to MKL
+        // so it adds alpha*A*B to the existing (pre-scaled) C.
+        bool handled = RandBLAS::sparse_data::mkl::mkl_left_spmm(
+            layout, Op::NoTrans, opB, d, n, m, alpha,
+            A, ro_a, co_a, B, ldb, (T)1, C, ldc
+        );
+        if (handled)
+            return;
+    }
+    #endif


I'm okay with this implementation for now. Things will get messy if we add support for other sparse matrix libraries (e.g., from Arm or AMD). In the PR description please outline a plan for adding that support in a way that doesn't significantly complicate the spmm_dispatch kernels. (Or explain why you think we should accept the possibility of such complication.)

mmelnich added 5 commits February 2, 2026 20:16

Adding spgemm and spmm through MKL

7f5c6a4

spmm benchmark added

b5a0965

Actually adding the file

ef3f610

Bug fix

2873c1c

Alter tolerance

06c773b

mmelnich changed the title ~~Spmm investigation~~ MKL spmm and spgemm integration Feb 4, 2026

mmelnich requested a review from rileyjmurray February 6, 2026 15:52

rileyjmurray requested changes Feb 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MKL spmm and spgemm integration#147

MKL spmm and spgemm integration#147
mmelnich wants to merge 5 commits intomainfrom
spmm_investigation

mmelnich commented Feb 4, 2026 •

edited

Loading

Uh oh!

mmelnich commented Feb 6, 2026

Uh oh!

rileyjmurray left a comment

Uh oh!

rileyjmurray Feb 6, 2026

Uh oh!

rileyjmurray Feb 6, 2026

Uh oh!

rileyjmurray Feb 6, 2026

Uh oh!

rileyjmurray Feb 6, 2026

Uh oh!

rileyjmurray Feb 6, 2026

Uh oh!

rileyjmurray Feb 6, 2026

Uh oh!

rileyjmurray Feb 6, 2026

Uh oh!

rileyjmurray Feb 6, 2026

Uh oh!

rileyjmurray Feb 6, 2026

Uh oh!

rileyjmurray Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mmelnich commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mmelnich commented Feb 6, 2026

Uh oh!

rileyjmurray left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mmelnich commented Feb 4, 2026 •

edited

Loading