deval deliwala
About
I am a rising masters student at Stanford ICME. I am learning to write elegant but fast numerical code.
Contact
devald [at] stanford [dot] edu
linalg-kernels
replacing BLAS descriptors with safe and contiguous Rust views.
the design principle of the level-1 BLAS kernels.
a fast and generic general matrix-vector multiplication.
designing a triangular matrix-vector multiply microkernel.
GEMM notes
smaller notes documenting progress of GEMM.
simplifying the BLAS GEMM hierarchy for contiguous matrices.
two GEMMs walk into a compiler.
benchmarks for no-transpose GEMM relative to BLIS, OpenBLAS, and faer.
all benchmarks