Generator: a long-context generative genomic foundation model

URL https: //arxiv · 2025 · arXiv 2502.07272

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

representative citing papers

ViroBench: Benchmarking Nucleotide Foundation Models on Viral Genomics Tasks

cs.LG · 2026-05-25 · unverdicted · novelty 7.0

ViroBench benchmarks 66 nucleotide foundation models on viral tasks, finding weak extrapolation under shifts, decoupling of likelihood from functional validity in generation, and greater value from taxonomic diversity than model scale.

LDARNet: DNA Adaptive Representation Network with Learnable Tokenization for Genomic Modeling

cs.CL · 2026-06-03 · unverdicted · novelty 6.0

LDARNet learns adaptive token boundaries via dynamic chunking in a genomic foundation model and reports gains on histone modification tasks over larger models.

AURORA: Contextual Orthogonalization for Geometric Representation Learning in Healthcare Foundation Models

cs.LG · 2026-05-18 · unverdicted · novelty 6.0

AURORA is a representation learning framework that uses contextual orthogonalization and relational alignment to create disentangled, geometrically interpretable latent spaces in healthcare foundation models.

WISTERIA: Learning Clinical Representations from Noisy Supervision via Multi-View Consistency in Electronic Health Records

cs.LG · 2026-05-10 · unverdicted · novelty 5.0

WISTERIA learns robust clinical representations from noisy EHR labels by enforcing consistency across multiple weak supervision views plus ontology regularization.

In Search of Lost DNA Sequence Pretraining

cs.LG · 2026-04-17 · unverdicted · novelty 5.0

DNA pretraining suffers from inappropriate evaluation datasets, flawed neighbor-masking, and neglected vocabulary design; the authors supply guidelines and a reproducible testbed to fix them.

citing papers explorer

Showing 1 of 1 citing paper after filters.

LDARNet: DNA Adaptive Representation Network with Learnable Tokenization for Genomic Modeling cs.CL · 2026-06-03 · unverdicted · none · ref 12
LDARNet learns adaptive token boundaries via dynamic chunking in a genomic foundation model and reports gains on histone modification tasks over larger models.

Generator: a long-context generative genomic foundation model

fields

years

verdicts

representative citing papers

citing papers explorer