Matmul Kernel Optimization & Tensor Parallel Communication This project builds directly on the machine‐learning systems assignments from CSE 234.