deval deliwala


About

I am a rising masters student at Stanford ICME. I am learning to write elegant but fast numerical code.

Contact

devald [at] stanford [dot] edu

resume.pdf


linalg-kernels

LAK working note #1

replacing BLAS descriptors with safe and contiguous Rust views.

LAK working note #2

the design principle of the level-1 BLAS kernels.

LAK working note #3

a fast and generic general matrix-vector multiplication.

LAK working note #4

designing a triangular matrix-vector multiply microkernel.

GEMM notes

smaller notes documenting progress of GEMM.

GEMM note #1

simplifying the BLAS GEMM hierarchy for contiguous matrices.

GEMM note #2

two GEMMs walk into a compiler.

GEMM Benchmarks

benchmarks for no-transpose GEMM relative to BLIS, OpenBLAS, and faer.

LAK Benchmarks

all benchmarks