Expander SAEs apply left-d-regular expander masks to TopK SAEs, learning only dn decoder parameters instead of mn and tracing a storage-fidelity frontier that reaches 293x compression with 84% retained performance on Qwen2.5-3B.
Canonical reference
Title resolution pending
Canonical reference. 80% of citing Pith papers cite this work as background.
citation-role summary
citation-polarity summary
authors
co-cited works
representative citing papers
The normalized orbit of a bounded normal operator can be a frame, providing a counterexample to Conjecture 3.
Introduces the largest freely available Italian clinical notes corpus with 4M notes and expert-annotated subset for a new CRF-filling benchmark.
Machine learning methods discover a new noncrossing-partition statistic interpreting q,t-Narayana polynomials and yield a combinatorial proof of their symmetry.
SLayerGen generates crystals invariant to any space or layer group via autoregressive lattice and Wyckoff sampling plus equivariant diffusion, achieving gains over bulk models on diperiodic materials after correcting a prior loss inconsistency for hexagonal groups.
Every proper minor-closed graph class admits an optimal (1+o(1)) log n bit adjacency labeling scheme.
A directed weighted two-graph model separates feasibility from movement in solution discovery and yields a detailed complexity classification for path and shortest-path discovery.
The method reformulates ALE mesh motion as independent multi-patch spline parameterizations per time step, using barrier functions, tangential-slip reparameterization, and constant-preserving quasi-interpolation to enable large-rotation FSI simulations.
Superconductivity in high-pressure MnB4 is induced by altermagnetic spin fluctuations, yielding extended-s pairing symmetry.
A new qubit-efficient HUBO encoding for graph partitioning problems like minimum coloring uses logarithmic bits and a lexicographic penalty to cut resources while providing provable optimality conditions.
A survey of 172 open educational datasets from 204 papers across LAK, EDM, and AIED conferences reveals trends, 143 previously uncatalogued datasets, field gaps, and an 8-item PRACTICE checklist for better data publication.
A microlocal lift of Navier-Stokes dynamics on manifolds yields an if-and-only-if geometric criterion for solution blow-up in terms of deformation integrability, directional entropy, and lifted energy.
A 9U CubeSat detector can identify a thermonuclear weapon on a satellite from 4 km away by observing spallation neutrons induced by GeV protons in roughly one week.
O(n log n) algorithm and matching Omega(n log n) lower bound for partitioning a simple polygon's boundary into the minimum number of contiguous visible segments.
Introduces a method to design structure-specific relational inductive biases for a base transformer architecture, enabling end-to-end transcription of documents with intrinsic structures, demonstrated on sheet music, shape drawings, and mechanical engineering drawings.
The paper introduces a probabilistic sign rule for quotients of positive series and integral transforms that reduces monotonicity, log-supermodularity, and log-convexity to kernel sign criteria via moment identities, and applies it to derive new inequalities for hypergeometric, Stieltjes, and Prabha
SPoILeR uses multimodal pre-training to enable accurate novel view synthesis of infrared, polarimetric, and multispectral data from RGB-supervised fine-tuning on new scenes.
NEvo performs evolutionary search guided by a dynamic voxel-level encoding model to synthesize videos that maximize predicted activity in target brain ROIs, recovering known selectivities and revealing temporal dynamics differences.
Every n-vertex H-minor-free graph admits a 3-coloring with monochromatic components of size O_H(n^{4/9}).
The Spin-MInt algorithm is proven symplectic for general K electronic states via explicit verification of the condition MJM^T = J on the coadjoint orbit of the su(K) Lie-Poisson algebra.
The authors synthesize a typology of fourteen OSS sub-genres from a review of 3,925 papers and present a research agenda on cross-sub-genre generalization.
Structural identifiability analysis shows point sources restore identifiability for inferring spatial stochastic dynamics parameters from static snapshots, unlike distributed sources, with limits depending on modeling choices.
Maximal quantum leakage upper-bounds quantum inference accuracy; optimal encodings are pure states, with tight frames and equiangular tight frames optimal when system dimension is small.
In multistage SI(k)R models, the relationship between prevalence peak and weighted stage functional maxima varies with scaling of progression rates, converging under Erlang scaling to a delay model that justifies the factor-two approximation with error bounds and corrections.
citing papers explorer
-
Parameterization-driven arbitrary Lagrangian-Eulerian method for large-deformation isogeometric fluid-structure interaction
The method reformulates ALE mesh motion as independent multi-patch spline parameterizations per time step, using barrier functions, tangential-slip reparameterization, and constant-preserving quasi-interpolation to enable large-rotation FSI simulations.
-
Projection-Based Reconstruction for Achieving High-Order Accuracy from Low-Order DGSEM Simulations
A cP_n P_m scheme for DGSEM-LGL achieves m+1 convergence order via projected high-order components and a compact reconstruction operator that corrects the highest Legendre mode.
-
Entropy correction artificial viscosity for high order DG methods using multiple artificial viscosities
Multiple artificial viscosities with analytical optimal parameters enable more flexible entropy-stable DG simulations than monolithic viscosity for 1D and 2D problems.
-
A spectral-subspace-augmented POD-Galerkin method for parametrized PDEs with limited snapshot data
SS-POD augments standard POD-Galerkin with a spectral-subspace partition and local POD to achieve lower out-of-sample error than either plain POD or pure spectral-Galerkin when only a handful of snapshots are available.
-
Complementary families of approximating polynomials with applications to finite element methods applied to differential equations of arbitrary even spatial order
Complementary polynomial families are introduced to construct C^m finite element bases of order p >= 2m+2, with explicit formulas on canonical intervals, an extended interpolation error formula, and a superconvergence result for linear problems in H^{m+1}.
-
Adjoint-based Perfusion Estimation from Dynamic Contrast-Enhanced Ultrasound: Advection-Diffusion and Two-Compartment Models
Adjoint equations with Tikhonov regularization enable gradient-based reconstruction of perfusion parameters from DCE-US data in advection-diffusion and two-compartment models, validated on synthetic and in vivo data.
-
Kernel-based linear system identification using augmented Krylov subspaces
Augmented Krylov subspaces jointly approximate quadratic forms and log-dets for faster MLE-based hyperparameter tuning in kernel-based linear system identification.
-
Invariant domain preserving limiting of time explicit and time implicit discretizations for systems of conservation laws
A generalized flux-corrected transport limiter for systems of conservation laws enforces invariant domain preservation by expressing the high-order solution as a convex combination of low-order invariant-domain-preserving states, applicable to both explicit and implicit time discretizations.
-
On the Practical Impact of Local Linear Instabilities in Entropy-Stable Schemes
Local linear instabilities in entropy-stable discretizations cause negligible practical errors because their growth is small, oscillatory, boundary-localized, and suppressible, with no direct extension to nonlinear two-point-flux cases.
-
Adaptive anisotropic composite quadratures for residual minimisation in neural PDE approximations
An adaptive anisotropic composite quadrature strategy combined with refresh-based training narrows the gap between training and reference losses in neural residual minimization for PDEs while using quadrature points more efficiently.
-
A Shifted Cohesive-Zone Method for Non-Interface-Fitted Meshes with Applications to Crystal Plasticity
SCZM enables accurate cohesive interface modeling and crystal plasticity on non-interface-fitted meshes by shifting traction-separation laws to a nearby surrogate interface.
-
Neural parametric representations for thin-shell shape optimisation
A neural network with periodic activations parameterizes thin-shell mid-surfaces so that network weights can be optimized to minimize structural compliance subject to a volume limit.
-
A high order stabilization-free virtual element method for general second-order elliptic eigenvalue problem
A novel high-order stabilization-free virtual element method is developed for general second-order elliptic eigenvalue problems, with optimal a priori error estimates for eigenspaces and eigenvalues, validated on various polygonal meshes.
-
Efficient Krylov solvers for inverse source problem in 2D space-time fractional diffusion equation
GLT-based preconditioners are built for multilevel Toeplitz systems from finite-difference discretization of the quasi-boundary regularized inverse source problem, producing eigenvalue clustering around one and faster GMRES convergence.
-
Locally conservative redistribution limiting and applications to the approximation of conservation equations, Part II
Introduces a conservative, discretization- and PDE-agnostic redistribution limiting method for high-order approximations of conservation equations.
-
Multiscale Fourier Neural Operator for Inverse Wave Scattering in Highly Oscillatory Media
MscaleFNO learns mappings from oscillatory media to wavefields for Helmholtz inverse problems and pairs it with diffusion regularization for partial-aperture 2D reconstructions.
-
An indefinite LOBPCG type of algorithm for detecting a definite Hermitian matrix pair
A new subspace algorithm using LOBPCG-style subspaces detects definiteness of large Hermitian matrix pairs (A,B) with indefinite B faster than prior methods on medium, large, and banded cases.
-
Singularity Formation: Synergy in Theoretical, Numerical and Machine Learning Approaches
The work introduces a modulation-based analytical method for singularity proofs in singular PDEs and refines ML techniques like PINNs and KANs to identify blowup solutions, with application to the open 3D Keller-Segel problem.
-
Justification and structure- and asymptotic-preserving discretizations of a hyperbolized Cahn-Hilliard equation
Develops energy-stable asymptotic-preserving discretizations of a hyperbolized Cahn-Hilliard equation via SBP operators and IMEX Runge-Kutta methods guided by relative-energy error estimates.
-
Spectral Bounds for Tensors Derived from Trace Functionals and Wasserstein Distance in Tensor Spaces
Defines trace-based metric and Bures-Wasserstein distance for PSD tensors, derives spectral eigenvalue bounds, and analyzes dependence on PSD condition with examples and complexity.
-
Effect of different clustering approaches on the multilevel fast multipole method for the Helmholtz equation
Clustering approaches significantly affect the stability and efficiency of the multilevel fast multipole method for the Helmholtz equation, with poor choices causing instabilities on non-uniform meshes.
- A-priori error estimation for space-time Galerkin POD for linear evolution problems