https://arxiv.org/pdf/2406.02528
TLDR the removal of matmul even on large models has a viable pathway according to the findings in this paper.
Last updated: Jan 24 2025 at 00:11 UTC