super hub Mixed citations

Jumper , author R

Alexander Pritzel, Alex Bridgland, Anna Potapenko, Augustin Žídek, Clemens Meyer, John Jumper + 2 more · 2021 · Nature · DOI 10.1038/s41586-021-03819-2

Mixed citation behavior. Most common role is background (64%).

53 Pith papers citing it

40.9k external citations · Crossref

Background 64% of classified citations

open at publisher browse 53 citing papers more from Alexander Pritzel

hub tools

JSON dossier citing papers JSON publisher DOI

citation-role summary

background 8 method 2 other 1

citation-polarity summary

background 7 use method 2 support 1 unclear 1

authors

Alexander Pritzel Alex Bridgland Anna Potapenko Augustin Žídek Clemens Meyer John Jumper Kathryn Tunyasuvunakool Michael Figurnov Olaf Ronneberger Richard Evans Russ Bates Tim Green

co-cited works

representative citing papers

ENSEMBITS: an alphabet of protein conformational ensembles

cs.LG · 2026-05-13 · unverdicted · novelty 8.0 · 2 refs

Ensembits is the first tokenizer of protein conformational ensembles that outperforms static tokenizers on RMSF prediction and matches them on function and mutation tasks while using less pretraining data.

Higher-Order Fourier Neural Operator: Explicit Mode Mixer for Nonlinear PDEs

cs.CE · 2026-06-26 · unverdicted · novelty 7.0

HO-FNO extends standard FNO with n-linear spectral mixing and shows improved accuracy on nonlinear PDE benchmarks, sometimes with a single layer beating deeper FNO models.

Deep Research in Physical Sciences: A Multi-Agent Framework and Comprehensive Benchmark

physics.comp-ph · 2026-06-17 · unverdicted · novelty 7.0

PhySciBench benchmark shows current AI models achieve at most 33.5% accuracy on physical science tasks; DelveAgent framework improves accuracy by up to 7.5 points and cuts costs to one-third.

Agentic Discovery of Non-Canonical Antimicrobial Peptides with AMPGAN v3

q-bio.QM · 2026-06-15 · unverdicted · novelty 7.0

AMPGAN v3 generates non-canonical AMPs with D-amino acids and modifications using two discriminators for stability, validated with two active candidates in vitro, alongside the PepCraft multi-agent discovery framework.

RATrain: A Resource-Aware Training Runtime for Large Language Models on Bandwidth-Constrained Heterogeneous Supercomputing Platforms

cs.DC · 2026-06-09 · unverdicted · novelty 7.0

RATrain introduces a resource-aware scheduler and MT-3000-specific backend for 1F1B LLM training that achieves 1.35x speedup and 97% scaling efficiency while preserving training correctness.

DPA4: Pushing the Accuracy-Cost Frontier of Interatomic Potentials with EMFA SO(2) Convolution

physics.chem-ph · 2026-06-01 · unverdicted · novelty 7.0

DPA4 is a new SE(3)-equivariant interatomic potential with EMFA SO(2) convolution that sets new accuracy-cost records on Matbench Discovery and SPICE benchmarks using fewer parameters than prior models.

Latent Process Generator Matching

cs.LG · 2026-05-19 · unverdicted · novelty 7.0

Presents a general framework for generator matching on projected image spaces from latent Markov processes, generalizing static latent results to dynamic conditional processes.

Entropy Across the Bridge: Conditional-Marginal Discretization for Flow and Schr\"odinger Samplers

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

Derives a conditional-marginal entropy-rate objective for bridge-aware discretization that yields U-shaped schedules and improves low-NFE sample quality on 2D, CIFAR-10, and protein tasks.

From Mechanistic to Compositional Interpretability

cs.LG · 2026-05-09 · unverdicted · novelty 7.0

The paper introduces compositional interpretability as a category-theoretic framework that casts mechanistic explanations as commuting syntactic-semantic mappings optimized under faithfulness and complexity constraints derived from minimum description length.

ProteinJEPA: Latent prediction complements protein language models

cs.LG · 2026-05-08 · unverdicted · novelty 7.0

Masked-position MLM plus JEPA latent prediction outperforms MLM-only pretraining on 10-11 of 16 downstream tasks for 35M-150M protein models while JEPA alone fails.

TCD-Arena: Assessing Robustness of Time Series Causal Discovery Methods Against Assumption Violations

cs.LG · 2026-05-04 · unverdicted · novelty 7.0

TCD-Arena is a new customizable testing framework that runs millions of experiments to map how 33 different assumption violations affect time series causal discovery methods and shows ensembles can boost overall robustness.

Rates of forgetting for the sequentially Markov coalescent

math.PR · 2026-04-22 · unverdicted · novelty 7.0

SMC forgets its initial condition geometrically in the jump chain and as 1/ℓ in continuous genetic distance, justifying independent-locus approximations.

Dual Triangle Attention: Effective Bidirectional Attention Without Positional Embeddings

q-bio.QM · 2026-04-09 · unverdicted · novelty 7.0

Dual Triangle Attention achieves effective bidirectional attention with built-in positional inductive bias via dual triangular masks, outperforming standard bidirectional attention on position-sensitive tasks and showing strong masked language modeling results with or without positional embeddings.

Stochastic Thermodynamics of Associative Memory

cond-mat.stat-mech · 2026-01-03 · unverdicted · novelty 7.0

DenseAMs show tradeoffs between entropy production, retrieval accuracy, and speed at intermediate loads, with a new failure mode in higher-order networks at finite temperature.

Accelerating Inference for Multilayer Neural Networks with Quantum Computers

quant-ph · 2025-10-08 · unverdicted · novelty 7.0

Quantum circuits for coherent multilayer neural network inference achieve quadratic to polylogarithmic speedups over classical methods depending on quantum data access models for inputs and weights.

AlphaEvolve: A coding agent for scientific and algorithmic discovery

cs.AI · 2025-06-16 · unverdicted · novelty 7.0

AlphaEvolve is an LLM-orchestrated evolutionary coding agent that discovered a 4x4 complex matrix multiplication algorithm using 48 scalar multiplications, the first improvement over Strassen's algorithm in 56 years, plus optimizations for Google data centers and hardware.

Navigating committor landscape of biomolecules with a general pairwise interaction model

physics.comp-ph · 2026-06-30 · unverdicted · novelty 6.0

A novel neural architecture based on Pairformer is introduced for learning committor functions to better capture dynamical features in biomolecular rare events without specialized priors.

Scalable Message-Passing Quantum Graph Neural Networks in the Weisfeiler-Leman Hierarchy

quant-ph · 2026-06-25 · unverdicted · novelty 6.0

The work constructs a permutation-equivariant quantum GNN that implements message passing at selectable Weisfeiler-Leman levels, supports pre-training on small graphs, and demonstrates readout scalability with simulations up to 56 qubits on synthetic, molecular, and TSP datasets.

Identifying structural design principles shaping the computational abilities of recurrent neural networks

q-bio.NC · 2026-06-22 · unverdicted · novelty 6.0

Local 2- and 3-cycles enhance RNN computational capacity for Boolean functions, predicted by structural statistics, while adding interneurons boosts large networks.

JEDEL: Zero-Shot DNA-Encoded Library Design for Early-Stage Drug Discovery

q-bio.BM · 2026-06-21 · unverdicted · novelty 6.0

JEDEL maps pharmacophore patterns to scalable combinatorial synthesis routes for DNA-encoded libraries, producing focused libraries that outperform baselines on 18 targets in zero-shot mode.

The $\alpha$-Index: A Penalized Authorship-Integrity Framework for Position-Weighted Scientific Contribution

cs.DL · 2026-06-21 · unverdicted · novelty 6.0

The α-index is a conserved position-weighted authorship framework with a senior-author penalty that decreases credit as the number of middle authors increases.

Early-Exit Graph Neural Networks for Link Prediction

cs.LG · 2026-06-20 · unverdicted · novelty 6.0

Early-exit GNNs for link prediction move the speed-quality Pareto frontier on the HeaRT benchmark by allowing implicit early exiting without auxiliary losses.

Flexible Kernels for Protein Property Prediction

cs.LG · 2026-06-09 · unverdicted · novelty 6.0

New class of sequence kernels for Gaussian processes that use substitution matrices and local linearity to enable data-efficient prediction of protein properties, with extensions to structure-aware multi-task learning.

PDE-Agents: An LLM-Orchestrated Multi-Agent Framework for Automated Finite Element Simulations with Knowledge Graph-Augmented Reasoning

physics.comp-ph · 2026-06-05 · unverdicted · novelty 6.0

PDE-Agents shows a LangGraph-orchestrated multi-agent LLM framework with GraphRAG that reaches 100% task success and perfect material fidelity on novel materials in ablation tests, with 97.8% success across 1369 production runs.

citing papers explorer

Showing 50 of 53 citing papers.

ENSEMBITS: an alphabet of protein conformational ensembles cs.LG · 2026-05-13 · unverdicted · none · ref 7 · 2 links
Ensembits is the first tokenizer of protein conformational ensembles that outperforms static tokenizers on RMSF prediction and matches them on function and mutation tasks while using less pretraining data.
Higher-Order Fourier Neural Operator: Explicit Mode Mixer for Nonlinear PDEs cs.CE · 2026-06-26 · unverdicted · none · ref 21
HO-FNO extends standard FNO with n-linear spectral mixing and shows improved accuracy on nonlinear PDE benchmarks, sometimes with a single layer beating deeper FNO models.
Deep Research in Physical Sciences: A Multi-Agent Framework and Comprehensive Benchmark physics.comp-ph · 2026-06-17 · unverdicted · none · ref 1
PhySciBench benchmark shows current AI models achieve at most 33.5% accuracy on physical science tasks; DelveAgent framework improves accuracy by up to 7.5 points and cuts costs to one-third.
Agentic Discovery of Non-Canonical Antimicrobial Peptides with AMPGAN v3 q-bio.QM · 2026-06-15 · unverdicted · none · ref 44
AMPGAN v3 generates non-canonical AMPs with D-amino acids and modifications using two discriminators for stability, validated with two active candidates in vitro, alongside the PepCraft multi-agent discovery framework.
RATrain: A Resource-Aware Training Runtime for Large Language Models on Bandwidth-Constrained Heterogeneous Supercomputing Platforms cs.DC · 2026-06-09 · unverdicted · none · ref 10
RATrain introduces a resource-aware scheduler and MT-3000-specific backend for 1F1B LLM training that achieves 1.35x speedup and 97% scaling efficiency while preserving training correctness.
DPA4: Pushing the Accuracy-Cost Frontier of Interatomic Potentials with EMFA SO(2) Convolution physics.chem-ph · 2026-06-01 · unverdicted · none · ref 94
DPA4 is a new SE(3)-equivariant interatomic potential with EMFA SO(2) convolution that sets new accuracy-cost records on Matbench Discovery and SPICE benchmarks using fewer parameters than prior models.
Latent Process Generator Matching cs.LG · 2026-05-19 · unverdicted · none · ref 18
Presents a general framework for generator matching on projected image spaces from latent Markov processes, generalizing static latent results to dynamic conditional processes.
Entropy Across the Bridge: Conditional-Marginal Discretization for Flow and Schr\"odinger Samplers cs.LG · 2026-05-15 · unverdicted · none · ref 19
Derives a conditional-marginal entropy-rate objective for bridge-aware discretization that yields U-shaped schedules and improves low-NFE sample quality on 2D, CIFAR-10, and protein tasks.
From Mechanistic to Compositional Interpretability cs.LG · 2026-05-09 · unverdicted · none · ref 84
The paper introduces compositional interpretability as a category-theoretic framework that casts mechanistic explanations as commuting syntactic-semantic mappings optimized under faithfulness and complexity constraints derived from minimum description length.
ProteinJEPA: Latent prediction complements protein language models cs.LG · 2026-05-08 · unverdicted · none · ref 9
Masked-position MLM plus JEPA latent prediction outperforms MLM-only pretraining on 10-11 of 16 downstream tasks for 35M-150M protein models while JEPA alone fails.
TCD-Arena: Assessing Robustness of Time Series Causal Discovery Methods Against Assumption Violations cs.LG · 2026-05-04 · unverdicted · none · ref 150
TCD-Arena is a new customizable testing framework that runs millions of experiments to map how 33 different assumption violations affect time series causal discovery methods and shows ensembles can boost overall robustness.
Rates of forgetting for the sequentially Markov coalescent math.PR · 2026-04-22 · unverdicted · none · ref 84
SMC forgets its initial condition geometrically in the jump chain and as 1/ℓ in continuous genetic distance, justifying independent-locus approximations.
Dual Triangle Attention: Effective Bidirectional Attention Without Positional Embeddings q-bio.QM · 2026-04-09 · unverdicted · none · ref 4
Dual Triangle Attention achieves effective bidirectional attention with built-in positional inductive bias via dual triangular masks, outperforming standard bidirectional attention on position-sensitive tasks and showing strong masked language modeling results with or without positional embeddings.
Stochastic Thermodynamics of Associative Memory cond-mat.stat-mech · 2026-01-03 · unverdicted · none · ref 32
DenseAMs show tradeoffs between entropy production, retrieval accuracy, and speed at intermediate loads, with a new failure mode in higher-order networks at finite temperature.
Accelerating Inference for Multilayer Neural Networks with Quantum Computers quant-ph · 2025-10-08 · unverdicted · none · ref 12
Quantum circuits for coherent multilayer neural network inference achieve quadratic to polylogarithmic speedups over classical methods depending on quantum data access models for inputs and weights.
AlphaEvolve: A coding agent for scientific and algorithmic discovery cs.AI · 2025-06-16 · unverdicted · none · ref 47
AlphaEvolve is an LLM-orchestrated evolutionary coding agent that discovered a 4x4 complex matrix multiplication algorithm using 48 scalar multiplications, the first improvement over Strassen's algorithm in 56 years, plus optimizations for Google data centers and hardware.
Navigating committor landscape of biomolecules with a general pairwise interaction model physics.comp-ph · 2026-06-30 · unverdicted · none · ref 27
A novel neural architecture based on Pairformer is introduced for learning committor functions to better capture dynamical features in biomolecular rare events without specialized priors.
Scalable Message-Passing Quantum Graph Neural Networks in the Weisfeiler-Leman Hierarchy quant-ph · 2026-06-25 · unverdicted · none · ref 2
The work constructs a permutation-equivariant quantum GNN that implements message passing at selectable Weisfeiler-Leman levels, supports pre-training on small graphs, and demonstrates readout scalability with simulations up to 56 qubits on synthetic, molecular, and TSP datasets.
Identifying structural design principles shaping the computational abilities of recurrent neural networks q-bio.NC · 2026-06-22 · unverdicted · none · ref 10
Local 2- and 3-cycles enhance RNN computational capacity for Boolean functions, predicted by structural statistics, while adding interneurons boosts large networks.
JEDEL: Zero-Shot DNA-Encoded Library Design for Early-Stage Drug Discovery q-bio.BM · 2026-06-21 · unverdicted · none · ref 19
JEDEL maps pharmacophore patterns to scalable combinatorial synthesis routes for DNA-encoded libraries, producing focused libraries that outperform baselines on 18 targets in zero-shot mode.
The $\alpha$-Index: A Penalized Authorship-Integrity Framework for Position-Weighted Scientific Contribution cs.DL · 2026-06-21 · unverdicted · none · ref 21
The α-index is a conserved position-weighted authorship framework with a senior-author penalty that decreases credit as the number of middle authors increases.
Early-Exit Graph Neural Networks for Link Prediction cs.LG · 2026-06-20 · unverdicted · none · ref 2
Early-exit GNNs for link prediction move the speed-quality Pareto frontier on the HeaRT benchmark by allowing implicit early exiting without auxiliary losses.
Flexible Kernels for Protein Property Prediction cs.LG · 2026-06-09 · unverdicted · none · ref 2
New class of sequence kernels for Gaussian processes that use substitution matrices and local linearity to enable data-efficient prediction of protein properties, with extensions to structure-aware multi-task learning.
PDE-Agents: An LLM-Orchestrated Multi-Agent Framework for Automated Finite Element Simulations with Knowledge Graph-Augmented Reasoning physics.comp-ph · 2026-06-05 · unverdicted · none · ref 8
PDE-Agents shows a LangGraph-orchestrated multi-agent LLM framework with GraphRAG that reaches 100% task success and perfect material fidelity on novel materials in ablation tests, with 97.8% success across 1369 production runs.
Methods for Inferring Interaction Potentials from Cross-Linking Mass Spectrometry Data physics.chem-ph · 2026-06-04 · unverdicted · none · ref 23
Develops and tests algorithms adapting inverse Henderson problem solvers to parameterize multi-component interaction potentials from XL-MS data in homogeneous and three-phase systems.
Towards Understanding Self-Pretraining for Sequence Classification cs.LG · 2026-05-20 · unverdicted · none · ref 15
Self-pretraining improves Transformer sequence classification by enabling learning of proximity-biased attention from positional encodings that label supervision alone cannot easily acquire from random starts.
CrystalBoltz: End-to-End Protein Structure Determination via Experiment-Guided Diffusion for X-Ray Crystallography cs.LG · 2026-05-15 · unverdicted · none · ref 15
CrystalBoltz performs experiment-guided posterior sampling with diffusion models on structure-factor amplitudes for protein structure determination, reporting lower RMSD and R-factors than baselines with 33x faster runtime.
NOVA: Fundamental Limits of Knowledge Discovery Through AI cs.AI · 2026-05-12 · unverdicted · none · ref 6 · 2 links
NOVA models the generate-verify-accumulate-retrain loop and proves cumulative discovery cost scales as Theta(c_gen D^alpha) under Zipf tail equivalence with alpha greater than 1.
ShardTensor: Domain Parallelism for Scientific Machine Learning cs.DC · 2026-05-11 · unverdicted · none · ref 15
ShardTensor is a domain-parallelism system for SciML that enables flexible scaling of extreme-resolution spatial datasets by removing the constraint of batch size one per device.
Supercharging Bayesian Inference with Reliable AI-Informed Priors stat.ML · 2026-05-11 · unverdicted · none · ref 10
Rectified AI priors, obtained by correcting AI-induced data laws before embedding them in techniques like Dirichlet process priors, reduce bias, improve credible interval coverage, and boost performance in tasks like skin disease classification.
A physics-informed neural network approach to solve the spatially inhomogeneous electron Boltzmann equation physics.plasm-ph · 2026-05-05 · unverdicted · none · ref 16
A specialized PINN architecture solves the spatially inhomogeneous electron Boltzmann equation with high accuracy across gases and electric field strengths without case-specific tuning.
Flashlight: PyTorch Compiler Extensions to Accelerate Attention Variants cs.LG · 2025-11-03 · unverdicted · none · ref 12
Flashlight is a compiler-native PyTorch framework that generates efficient fused kernels for arbitrary and data-dependent attention variants, supporting more cases than FlexAttention with competitive performance.
Fast and Interpretable Protein Substructure Alignment via Optimal Transport q-bio.QM · 2025-10-12 · unverdicted · none · ref 13
PLASMA applies regularized optimal transport with Sinkhorn iterations to produce fast, interpretable residue-level alignments and similarity scores between protein structures.
Improving Inverse Folding for Peptide Design with Diversity-regularized Direct Preference Optimization cs.LG · 2024-10-25 · unverdicted · none · ref 19
Diversity-regularized DPO fine-tuning of ProteinMPNN improves structural similarity scores by at least 8% over base model and sequence diversity by up to 20% over standard DPO for peptide inverse folding on OpenFold structures.
HSAP: A Hierarchical Sequence-aware Parallelism for Hybrid-Context Generative Models cs.LG · 2026-06-29 · unverdicted · none · ref 32 · 2 links
HSAP introduces a hierarchical framework and sequence-aware algorithm with JIT-optimized NCCL communication to enable correct causal attention computation on hybrid-context packed sequences without limiting parallelism.
The FIL Hypothesis: Inductive Biases Help with Kernel Engineering cs.AI · 2026-06-29 · unverdicted · none · ref 29
The FIL Hypothesis claims that inductive biases outperform purely data-driven methods on GPU programming tasks with non-trivial feedback loops.
Multiscale reconstruction of protein conformations from cryo-EM images eess.IV · 2026-06-16 · unverdicted · none · ref 116
A multiscale optimization method using explicit protein backbone geometry reconstructs atomic models from cryo-EM data, showing improved RMSD and TM scores on three simulated datasets.
Instrumented data for causal scientific machine learning cs.LG · 2026-06-05 · unverdicted · none · ref 5
Instrumented data augments observations with mechanistic models, uncertainty, and counterfactuals to enable causal interventions via Pearl's do-operator in scientific machine learning.
DeltaDiff: Training-Free, Physics-Guided Machine Learning for Predicting Mutant Protein Structures physics.chem-ph · 2026-06-03 · unverdicted · none · ref 18
DeltaDiff is a physics-guided inference method that predicts mutant protein structures from a baseline diffusion model without retraining, tested on three systems with nonlocal changes.
MOSAIC: Efficient Mixture-of-Agent Scheduling via Adaptive Aggregation and Inference Concurrency cs.LG · 2026-06-02 · unverdicted · none · ref 79
MOSAIC uses an Integer Linear Program scheduler for expert placement and prompt assignment plus adaptive aggregation to achieve 1.7-2.3x end-to-end speedup on 4-GPU MoA workloads while keeping accuracy within 0.1pp.
AIBuildAI-2: A Knowledge-Enhanced Agent for Automatically Building AI Models cs.AI · 2026-05-27 · unverdicted · none · ref 6
AIBuildAI-2 introduces a knowledge-enhanced agent with a hierarchical evolving external knowledge base that dynamically loads relevant AI development expertise, achieving first place on MLE-Bench at 70.7% medal rate.
Enabling Structure-Only Initialization and Out-of-Distribution Generalization in GNN-based Molecular Dynamics Simulators physics.chem-ph · 2026-05-10 · unverdicted · none · ref 145
GNN-based MD simulators achieve stable structure-only initialization and reliable OOD generalization through inference-time physics optimization and a GNN barostat on elastic network compression tasks.
Experiment-as-Code Labs: A Declarative Stack for AI-Driven Scientific Discovery eess.SY · 2026-05-06 · unverdicted · none · ref 60 · 2 links
The paper introduces Experiment-as-Code Labs as a declarative stack synthesizing AI agents, systems orchestration, and physical lab control for AI-driven discovery.
Benchmarking open-source tools for in silico antiviral drug discovery q-bio.BM · 2026-05-05 · conditional · none · ref 155
Boltz-2 and fine-tuned DrugFormDTA lead ML-based binding prediction while GNINA leads docking tools on a cleaned antiviral dataset, with performance varying by viral protein.
MIRA: A Score for Conditional Distribution Accuracy and Model Comparison stat.ML · 2026-05-03 · unverdicted · none · ref 141
MIRA is a new analytic score for conditional distribution accuracy derived from equal probability mass assignment, enabling Bayesian model comparison via direct posterior validation.
Sampling Parallelism for Fast and Efficient Bayesian Learning cs.LG · 2026-04-06 · unverdicted · none · ref 20
Sampling parallelism distributes Bayesian sample evaluations across GPUs for near-perfect scaling, lower memory use, and faster convergence via per-GPU data augmentations, outperforming pure data parallelism in diversity.
Galactica: A Large Language Model for Science cs.CL · 2022-11-16 · unverdicted · none · ref 186
Galactica, a science-specialized LLM, reports higher scores than GPT-3, Chinchilla, and PaLM on LaTeX knowledge, mathematical reasoning, and medical QA benchmarks while outperforming general models on BIG-bench.
Building Digital Societies as Ecosystems: How Recognition and Repeat Relationships Sustain Cross-Community Work in Open Source cs.CY · 2026-05-24 · unverdicted · none · ref 17
Cross-boundary collaboration in open source is sustained by a thin carrier layer of contributors and repeat relationships that increase pull request acceptance rates from 42% to 87%.
AIMBio-Mat: An AI-Native FAIR Platform for Closed-Loop Materials Discovery and Biomedical Translation physics.app-ph · 2026-05-20 · unverdicted · none · ref 15
AIMBio-Mat is a conceptual blueprint for an AI-native, FAIR, governance-aware decision layer that formulates biomedical-materials discovery as constrained multi-objective optimization under uncertainty.
The New Associationism: Lessons from Deep Learning cs.AI · 2026-05-19 · unverdicted · none · ref 19
Supervised learning across AI systems vindicates a uniform error-driven associationism for cognition, though operating inside advanced computational structures beyond classical associationist models.

Jumper , author R

hub tools

citation-role summary

citation-polarity summary

authors

co-cited works

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer