https://arxiv.org/pdf/2406.02528
TLDR the removal of matmul even on large models has a viable pathway according to the findings in this paper.
Last updated: Nov 22 2024 at 16:03 UTC