pith. sign in

arxiv: 2601.16622 · v2 · pith:HJHCAEWEnew · submitted 2026-01-23 · 💻 cs.LG · cs.AI

E2Former-V2: On-the-Fly Equivariant Attention with Linear Activation Memory

classification 💻 cs.LG cs.AI
keywords textbfequivariante2former-v2attentiondenseeaasedgeintroduce
0
0 comments X
read the original abstract

Equivariant Graph Neural Networks (EGNNs) have become a widely used approach for modeling 3D atomistic systems. However, mainstream architectures face critical scalability bottlenecks due to the explicit construction of geometric features or dense tensor products on \textit{every} edge. To overcome this, we introduce \textbf{E2Former-V2}, a scalable architecture that integrates algebraic sparsity with hardware-aware execution. We first propose \textbf{E}quivariant \textbf{A}xis-\textbf{A}ligned \textbf{S}parsification (EAAS). EAAS builds on Wigner-$6j$ convolution by exploiting an $\mathrm{SO}(3) \rightarrow \mathrm{SO}(2)$ change of basis to transform computationally expensive dense tensor contractions into efficient, sparse parity re-indexing operations. Building on this representation, we introduce \textbf{On-the-Fly Equivariant Attention}, a fully node-centric mechanism implemented via a custom fused Triton kernel. By eliminating materialized edge tensors and maximizing SRAM utilization, our kernel achieves a \textbf{20$\times$ improvement in TFLOPS} compared to standard implementations. Extensive experiments on the SPICE and OMol25 datasets demonstrate that E2Former-V2 maintains comparable predictive performance while notably accelerating inference. This work demonstrates that large equivariant transformers can be trained efficiently using widely accessible GPU platforms. The code is avalible at https://github.com/IQuestLab/UBio-MolFM/tree/e2formerv2.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. UBio-MolFM: A Universal Molecular Foundation Model for Bio-Systems

    physics.chem-ph 2026-02 unverdicted novelty 6.0

    UBio-MolFM achieves ab initio-level fidelity on large out-of-distribution biomolecular systems using a new multi-fidelity dataset, E2Former-V2 architecture, and three-stage curriculum learning.

  2. Machine Learning Interatomic Potentials: Advancing Open-Source Software for Efficient and Scalable Molecular Simulation

    physics.chem-ph 2026-05 unverdicted novelty 4.0

    mlip v2 is a new software release that integrates API redesign, e3j backend, eSEN model, improved charge modeling, and expanded simulation capabilities to support larger-scale molecular modeling.