← back to paper
arxiv: 2605.06057 · 2 revisions
FalconGEMM: Surpassing Hardware Peaks with Lower-Complexity Matrix Multiplication