https://arxiv.org/pdf/2406.02528
TLDR the removal of matmul even on large models has a viable pathway according to the findings in this paper.
Last updated: Dec 23 2024 at 12:05 UTC