pith. sign in

arxiv: 2605.30963 · v1 · pith:KVRLZ3M6new · submitted 2026-05-29 · 🧬 q-bio.BM · cs.AI

AMix-2: Establishing Protein as a Native Modality in Large Language Models

Pith reviewed 2026-06-28 20:02 UTC · model grok-4.3

classification 🧬 q-bio.BM cs.AI
keywords protein foundation modelprotein-text modelblock-wise diffusionsequence designprotein understandingunified modalityProteinArenadiffusion language modeling
0
0 comments X

The pith

AMix-2 unifies protein understanding and design in one foundation model by sharing token space with text and using block-wise diffusion modeling.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents AMix-2 as a way to make proteins a native part of large language models rather than a separate domain. It does this through a single formulation that puts protein sequences and natural language into the same token space so the model can both interpret biological data and generate new sequences based on text instructions. Instead of generating proteins strictly from left to right, the model uses block-wise diffusion that allows bidirectional information within blocks and refinement over iterations. This is evaluated on a new benchmark called ProteinArena that uses time-aware splits to test real generalization, where the model beats general LLMs and matches specialized protein tools. Experiments show the diffusion approach works better than standard autoregressive training for this task.

Core claim

AMix-2 establishes protein as a native modality in large language models by unifying protein understanding and sequence design within a single foundation model via a unified protein-text formulation and a block-wise diffusion language modeling backbone that combines causal generation across blocks with bidirectional context and iterative refinement within blocks.

What carries the argument

Block-wise diffusion language modeling backbone that enables causal generation across blocks while allowing bidirectional context and iterative refinement within each block, paired with a shared token space for protein sequences and text.

If this is right

  • One foundation model can replace multiple task-specific protein models for both understanding and design tasks.
  • Generation order flexibility from diffusion better suits the non-sequential nature of protein folding and function than strict autoregression.
  • Time-aware and homology-aware evaluation protocols reveal whether models truly generalize beyond training data patterns.
  • Controlled comparisons confirm that diffusion-based training outperforms autoregressive training on protein tasks.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • This setup might allow direct text-to-protein generation for applications like custom enzyme design without separate pipelines.
  • The approach could extend to other sequence modalities such as DNA or RNA in the same model.
  • ProteinArena may serve as a standard testbed for future protein foundation models to ensure fair comparisons.
  • Scaling the shared modality to larger models could lead to emergent capabilities in multi-step biological reasoning.

Load-bearing premise

Embedding natural language and protein sequences in a shared token space plus using block-wise diffusion rather than strict autoregressive factorization will produce performance on realistic generalization tasks that cannot be explained by training data or benchmark choices alone.

What would settle it

Training an autoregressive version of the same model on identical data and showing it matches or exceeds AMix-2 performance on ProteinArena under the time-aware and homology-aware splits.

read the original abstract

We present AMix-2, a protein-text foundation model that establishes protein as a native modality in large language models (LLMs), unifying protein understanding and sequence design within a single foundation model. AMix-2 is built upon two key ideas: (1) a unified protein-text formulation that embeds natural language and protein sequence in a shared token space, enabling one model to perform biological reasoning and conditional design instead of separate downstream task-specialized models; and (2) a block-wise diffusion language modeling backbone that combines causal generation across blocks with bidirectional context and iterative refinement within blocks. This scheme better matches the intrinsic nature of proteins than a strict left-to-right factorization. To evaluate protein foundation models under realistic generalization settings, we further introduce ProteinArena, a comprehensive benchmark with time-aware and homology-aware protocols across various understanding and design tasks, and with baselines covering classical bioinformatics tools, protein-specialized models and LLMs. On ProteinArena, AMix-2 outperforms frontier LLMs and demonstrates competitive performance to task-specific protein models. Controlled experiments further show that the diffusion-based paradigm generally surpasses its autoregressive counterpart, highlighting the advantage of flexible generation order for protein sequences. We release both AMix-2 and ProteinArena to facilitate open research in protein foundation models.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 0 minor

Summary. The manuscript presents AMix-2, a protein-text foundation model that establishes protein as a native modality in LLMs by unifying protein understanding and sequence design within a single model. It relies on a unified protein-text formulation embedding natural language and protein sequences in a shared token space, combined with a block-wise diffusion language modeling backbone that enables causal generation across blocks with bidirectional context and iterative refinement within blocks. The work introduces ProteinArena, a benchmark with time-aware and homology-aware protocols across understanding and design tasks, and reports that AMix-2 outperforms frontier LLMs while remaining competitive with task-specific protein models; controlled experiments indicate the diffusion paradigm generally surpasses its autoregressive counterpart.

Significance. If the performance claims hold under the stated evaluation protocols, the work would be significant for integrating protein sequences as a native modality in LLMs, enabling a single model for both biological reasoning and conditional design tasks rather than task-specialized models. The introduction of ProteinArena provides a standardized, realistic generalization benchmark, and the open release of both the model and benchmark facilitates reproducibility and community research in protein foundation models.

major comments (2)
  1. [Abstract] Abstract: the central claim of outperformance on ProteinArena and competitiveness with task-specific models is asserted without any quantitative metrics, ablation tables, error bars, or baseline numbers; this prevents verification of whether the reported advantage survives controls for data splits, homology leakage, or benchmark construction details.
  2. [Abstract] The weakest assumption—that shared token space plus block-wise diffusion yields generalization not explained by training data or benchmark construction—requires explicit evidence in the results; without reported numbers or controls isolating these factors, the diffusion advantage cannot be confirmed as load-bearing rather than data-dependent.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on the abstract. We agree that the abstract would be strengthened by including key quantitative results and will revise it accordingly in the next version. We address each major comment below.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the central claim of outperformance on ProteinArena and competitiveness with task-specific models is asserted without any quantitative metrics, ablation tables, error bars, or baseline numbers; this prevents verification of whether the reported advantage survives controls for data splits, homology leakage, or benchmark construction details.

    Authors: We agree that the abstract currently lacks specific numbers. The main text (Sections 4 and 5, Tables 1-4, and Figure 3) reports the full quantitative results on ProteinArena, including comparisons to baselines, ablations, and controls for time-aware/homology-aware splits. In revision we will add concise performance deltas (e.g., average improvement over frontier LLMs and competitiveness metrics vs. task-specific models) plus a reference to the benchmark protocols directly into the abstract. revision: yes

  2. Referee: [Abstract] The weakest assumption—that shared token space plus block-wise diffusion yields generalization not explained by training data or benchmark construction—requires explicit evidence in the results; without reported numbers or controls isolating these factors, the diffusion advantage cannot be confirmed as load-bearing rather than data-dependent.

    Authors: The controlled diffusion-vs-autoregressive comparison is presented in Section 4.3 with the same training data and benchmark construction for both paradigms; the results show consistent gains for the block-wise diffusion approach across tasks. We will add a brief clause in the revised abstract that explicitly references these controlled experiments and the ProteinArena protocols (Section 3) to make the supporting evidence visible at the abstract level. revision: yes

Circularity Check

0 steps flagged

No significant circularity in derivation chain

full rationale

The paper presents AMix-2 as an empirical construction: a shared protein-text token space plus block-wise diffusion backbone, trained and evaluated on the newly introduced ProteinArena benchmark with time/homology-aware splits. No equations, fitted parameters, or self-citations are described that would reduce reported performance or the unification claim to a definition or tautology. The central results are controlled comparisons against external baselines (bioinformatics tools, specialized models, frontier LLMs), making the derivation self-contained rather than circular.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no equations, training details, or modeling assumptions; therefore the ledger cannot enumerate free parameters, axioms, or invented entities.

pith-pipeline@v0.9.1-grok · 5837 in / 1247 out tokens · 22359 ms · 2026-06-28T20:02:57.262861+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

74 extracted references · 32 canonical work pages · 8 internal anchors

  1. [1]

    Highly accurate protein structure prediction with alphafold.nature, 596(7873):583–589, 2021

    John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ronneberger, Kathryn Tunyasuvunakool, Russ Bates, Augustin Žídek, Anna Potapenko, et al. Highly accurate protein structure prediction with alphafold.nature, 596(7873):583–589, 2021

  2. [2]

    Kinch, R

    Minkyung Baek, Frank DiMaio, Ivan Anishchenko, Justas Dauparas, Sergey Ovchinnikov, Gyu Rie Lee, Jue Wang, Qian Cong, Lisa N. Kinch, R. Dustin Schaeffer, Claudia Millán, Hahnbeom Park, Carson Adams, Caleb R. Glassman, Andy DeGiovanni, Jose H. Pereira, Andria V. Rodrigues, Alberdina A. van Dijk, Ana C. Ebrecht, Diederik J. Opperman, Theo Sagmeister, Christ...

  3. [3]

    Robust deep learning–based protein sequence design using proteinmpnn.Science, 378(6615):49–56, 2022

    Justas Dauparas, Ivan Anishchenko, Nathaniel Bennett, Hua Bai, Robert J Ragotte, Lukas F Milles, Basile IM Wicky, Alexis Courbet, Rob J de Haas, Neville Bethel, et al. Robust deep learning–based protein sequence design using proteinmpnn.Science, 378(6615):49–56, 2022

  4. [4]

    De novo design of protein structure and function with rfdiffusion.Nature, 620(7976):1089–1100, 2023

    Joseph L Watson, David Juergens, Nathaniel R Bennett, Brian L Trippe, Jason Yim, Helen E Eisenach, Woody Ahern, Andrew J Borst, Robert J Ragotte, Lukas F Milles, et al. De novo design of protein structure and function with rfdiffusion.Nature, 620(7976):1089–1100, 2023

  5. [5]

    Accurate structure prediction of biomolecular interactions with alphafold 3.Nature, pages 1–3, 2024

    Josh Abramson, Jonas Adler, Jack Dunger, Richard Evans, Tim Green, Alexander Pritzel, Olaf Ronneberger, Lindsay Willmore, Andrew J Ballard, Joshua Bambrick, et al. Accurate structure prediction of biomolecular interactions with alphafold 3.Nature, pages 1–3, 2024

  6. [6]

    Evolutionary-scale prediction of atomic-level protein structure with a language model.Science, 379(6637):1123–1130, 2023

    Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, et al. Evolutionary-scale prediction of atomic-level protein structure with a language model.Science, 379(6637):1123–1130, 2023

  7. [7]

    Large language models generate functional protein sequences across diverse families.Nature Biotechnology, 41(8):1099–1106, 2023

    Ali Madani, Ben Krause, Eric R Greene, Subu Subramanian, Benjamin P Mohr, James M Holton, Jose Luis Olmos, Caiming Xiong, Zachary Z Sun, Richard Socher, et al. Large language models generate functional protein sequences across diverse families.Nature Biotechnology, 41(8):1099–1106, 2023

  8. [8]

    Saprot: Protein language modeling with structure-aware vocabulary.bioRxiv, pages 2023–10, 2023

    Jin Su, Chenchen Han, Yuyang Zhou, Junjie Shan, Xibin Zhou, and Fajie Yuan. Saprot: Protein language modeling with structure-aware vocabulary.bioRxiv, pages 2023–10, 2023

  9. [9]

    Protrek: Navigating the protein universe through tri-modal contrastive learning.bioRxiv, pages 2024–05, 2024

    Jin Su, Xibin Zhou, Xuting Zhang, and Fajie Yuan. Protrek: Navigating the protein universe through tri-modal contrastive learning.bioRxiv, pages 2024–05, 2024

  10. [10]

    Simulating 500 million years of evolution with a language model

    Thomas Hayes, Roshan Rao, Halil Akin, Nicholas J Sofroniew, Deniz Oktay, Zeming Lin, Robert Verkuil, Vincent Q Tran, Jonathan Deaton, Marius Wiggert, et al. Simulating 500 million years of evolution with a language model. Science, 387(6736):850–858, 2025

  11. [11]

    Basic local alignment search tool.Journal of molecular biology, 215(3):403–410, 1990

    Stephen F Altschul, Warren Gish, Webb Miller, Eugene W Myers, and David J Lipman. Basic local alignment search tool.Journal of molecular biology, 215(3):403–410, 1990

  12. [12]

    Mmseqs2 enables sensitive protein sequence searching for the analysis of massive data sets.Nature biotechnology, 35(11):1026–1028, 2017

    Martin Steinegger and Johannes Söding. Mmseqs2 enables sensitive protein sequence searching for the analysis of massive data sets.Nature biotechnology, 35(11):1026–1028, 2017

  13. [13]

    Fast and accurate protein structure search with foldseek.Nature biotechnology, 42(2):243–246, 2024

    Michel Van Kempen, Stephanie S Kim, Charlotte Tumescheit, Milot Mirdita, Jeongjae Lee, Cameron LM Gilchrist, Johannes Söding, and Martin Steinegger. Fast and accurate protein structure search with foldseek.Nature biotechnology, 42(2):243–246, 2024

  14. [14]

    Artificial intelligence methods for protein folding and design.Current Opinion in Structural Biology, 93:103066, 2025

    Zhidian Zhang, Chenxi Ou, Yehlin Cho, Yo Akiyama, and Sergey Ovchinnikov. Artificial intelligence methods for protein folding and design.Current Opinion in Structural Biology, 93:103066, 2025. ISSN 0959-440X. doi: https://doi.org/10.1016/j.sbi.2025.103066. URL https://www.sciencedirect.com/science/article/pii/ S0959440X25000843

  15. [15]

    Introducing claude 4.https://www.anthropic.com/news/claude-4, 2026

    Anthropic. Introducing claude 4.https://www.anthropic.com/news/claude-4, 2026

  16. [16]

    Gemini 3.1 pro.https://deepmind.google/models/gemini/pro, 2026

    Google DeepMind. Gemini 3.1 pro.https://deepmind.google/models/gemini/pro, 2026. 15

  17. [17]

    OpenAI GPT-5 System Card

    Aaditya Singh, Adam Fry, Adam Perelman, Adam Tart, Adi Ganesh, Ahmed El-Kishky, Aidan McLaughlin, Aiden Low, AJ Ostrow, Akhila Ananthram, et al. Openai gpt-5 system card.arXiv preprint arXiv:2601.03267, 2025

  18. [18]

    Deepseek-v4: Towards highly efficient million-token context intelligence.https://huggingface

    DeepSeek AI. Deepseek-v4: Towards highly efficient million-token context intelligence.https://huggingface. co/deepseek-ai/DeepSeek-V4-Pro, 2026

  19. [19]

    GLM-5: from Vibe Coding to Agentic Engineering

    Aohan Zeng, Xin Lv, Zhenyu Hou, Zhengxiao Du, Qinkai Zheng, Bin Chen, Da Yin, Chendi Ge, Chenghua Huang, Chengxing Xie, et al. Glm-5: from vibe coding to agentic engineering.arXiv preprint arXiv:2602.15763, 2026

  20. [20]

    Qwen3 Technical Report

    An Yang, Anfeng Li, Baosong Yang, Beichen Zhang, Binyuan Hui, Bo Zheng, Bowen Yu, Chang Gao, Chengen Huang, Chenxu Lv, Chujie Zheng, Dayiheng Liu, Fan Zhou, Fei Huang, Feng Hu, Hao Ge, Haoran Wei, Huan Lin, Jialong Tang, Jian Yang, Jianhong Tu, Jianwei Zhang, Jianxin Yang, Jiaxi Yang, Jing Zhou, Jingren Zhou, Junyang Lin, Kai Dang, Keqin Bao, Kexin Yang, ...

  21. [21]

    Biomni: A general-purpose biomedical ai agent.bioRxiv, pages 2025–05, 2025

    Kexin Huang, Serena Zhang, Hanchen Wang, Yuanhao Qu, Yingzhou Lu, Yusuf Roohani, Ryan Li, Lin Qiu, Junze Zhang, Yin Di, et al. Biomni: A general-purpose biomedical ai agent.bioRxiv, pages 2025–05, 2025

  22. [22]

    Stella: Towards a biomedical world model with self-evolving multimodal agents.bioRxiv, 2026

    Ruofan Jin, Mingyang Xu, Fei Meng, Guancheng Wan, Qingran Cai, Yize Jiang, Jin Han, Yuanyuan Chen, Wanqing Lu, Mengyang Wang, Zhiqian Lan, Yuxuan Jiang, Junhong Liu, Dongyao Wang, Le Cong, and Zaixi Zhang. Stella: Towards a biomedical world model with self-evolving multimodal agents.bioRxiv, 2025. doi: 10.1101/2025.07.01.662467

  23. [23]

    Maddison, and Bo Wang

    Adibvafa Fallahpour, Andrew Magnuson, Purav Gupta, Shihao Ma, Jack Naimer, Arnav Shah, Haonan Duan, Omar Ibrahim, Hani Goodarzi, Chris J. Maddison, and Bo Wang. Bioreason: Incentivizing multimodal biological reasoning within a dna-llm model, 2025. URLhttps://arxiv.org/abs/2505.23579

  24. [24]

    Alexander Rives, Joshua Meier, Tom Sercu, Siddharth Goyal, Zeming Lin, Jason Liu, Demi Guo, Myle Ott, C Lawrence Zitnick, Jerry Ma, et al. Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences.Proceedings of the National Academy of Sciences, 118(15):e2016239118, 2021

  25. [25]

    Msa transformer

    Roshan M Rao, Jason Liu, Robert Verkuil, Joshua Meier, John Canny, Pieter Abbeel, Tom Sercu, and Alexander Rives. Msa transformer. InInternational Conference on Machine Learning, pages 8844–8856. PMLR, 2021

  26. [26]

    Transformer protein language models are unsupervised structure learners

    Roshan Rao, Joshua Meier, Tom Sercu, Sergey Ovchinnikov, and Alexander Rives. Transformer protein language models are unsupervised structure learners. InInternational Conference on Learning Representations, 2021. URLhttps://openreview.net/forum?id=fylclEqgvgd

  27. [27]

    Diffusion language models are versatile protein learners.arXiv preprint arXiv:2402.18567, 2024

    Xinyou Wang, Zaixiang Zheng, Fei Ye, Dongyu Xue, Shujian Huang, and Quanquan Gu. Diffusion language models are versatile protein learners.arXiv preprint arXiv:2402.18567, 2024

  28. [28]

    Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

    Marianne Arriola, Aaron Gokaslan, Justin T Chiu, Zhihan Yang, Zhixuan Qi, Jiaqi Han, Subham Sekhar Sahoo, and Volodymyr Kuleshov. Block diffusion: Interpolating between autoregressive and diffusion language models. InThe Thirteenth International Conference on Learning Representations, 2025. URL https: //arxiv.org/abs/2503.09573

  29. [29]

    Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

    Yuxuan Song, Zheng Zhang, Cheng Luo, Pengyang Gao, Fan Xia, Hao Luo, Zheng Li, Yuehang Yang, Hongli Yu, Xingwei Qu, Yuwei Fu, Jing Su, Ge Zhang, Wenhao Huang, Mingxuan Wang, Lin Yan, Xiaoying Jia, Jingjing Liu, Wei-Ying Ma, Ya-Qin Zhang, Yonghui Wu, and Hao Zhou. Seed diffusion: A large-scale diffusion language model with high-speed inference, 2025. URLht...

  30. [30]

    10 Two-Stage Fine-Tuning for Protein Sequence Generation with Targeted Amino-Acid Composition Thumuluri, V ., Almagro Armenteros, J

    The UniProt Consortium. Uniprot: the universal protein knowledgebase in 2023.Nucleic Acids Research, 51 (D1):D523–D531, 01 2023. ISSN 0305-1048. doi: 10.1093/nar/gkac1052. URLhttps://doi.org/10.1093/nar/ gkac1052

  31. [31]

    Care: a benchmark suite for the classification and retrieval of enzymes.Advances in Neural Information Processing Systems, 37:3094–3121, 2024

    Jason Yang, Ariane Mora, Shengchao Liu, Bruce J Wittmann, Anima Anandkumar, Frances H Arnold, and Yisong Yue. Care: a benchmark suite for the classification and retrieval of enzymes.Advances in Neural Information Processing Systems, 37:3094–3121, 2024. 16

  32. [32]

    Vaishali P Waman, Nicola Bordin, Andy Lau, Shaun Kandathil, Jude Wells, David Miller, Sameer Velankar, David T Jones, Ian Sillitoe, and Christine Orengo. Cath v4. 4: major expansion of cath by experimental and predicted structural data.Nucleic Acids Research, 53(D1):D348–D355, 2025

  33. [33]

    Pdfbench: A benchmark for de novo protein design from function.arXiv preprint arXiv:2505.20346, 2025

    Jiahao Kuang, Nuowei Liu, Jie Wang, Changzhi Sun, Tao Ji, and Yuanbin Wu. Pdfbench: A benchmark for de novo protein design from function.arXiv preprint arXiv:2505.20346, 2025

  34. [34]

    Leveraging biomolecule and natural language through multi-modal learning: A survey, 2025

    Qizhi Pei, Zhimeng Zhou, Kaiyuan Gao, Jinhua Zhu, Yue Wang, Zun Wang, Tao Qin, Lijun Wu, and Rui Yan. Leveraging biomolecule and natural language through multi-modal learning: A survey, 2025. URL https://arxiv.org/abs/2403.01528

  35. [35]

    Clustering huge protein sequence sets in linear time

    M Steinegger and J Söding. Clustering huge protein sequence sets in linear time. nat commun 9: 2542, 2018

  36. [36]

    Interpro: the protein sequence classification resource in 2025.Nucleic acids research, 53(D1):D444–D456, 2025

    Matthias Blum, Antonina Andreeva, Laise Cavalcanti Florentino, Sara Rocio Chuguransky, Tiago Grego, Emma Hobbs, Beatriz Lazaro Pinto, Ailsa Orr, Typhaine Paysan-Lafosse, Irina Ponamareva, et al. Interpro: the protein sequence classification resource in 2025.Nucleic acids research, 53(D1):D444–D456, 2025

  37. [37]

    Martin, Karine Michoud, Claire O’Donovan, Isabelle Phan, Sandrine Pilbout, and Michel Schneider

    Brigitte Boeckmann, Amos Bairoch, Rolf Apweiler, Marie-Claude Blatter, Anne Estreicher, Elisabeth Gasteiger, Maria J. Martin, Karine Michoud, Claire O’Donovan, Isabelle Phan, Sandrine Pilbout, and Michel Schneider. The swiss-prot protein knowledgebase and its supplement trembl in 2003.Nucleic Acids Research, 31(1):365–370, 01 2003. ISSN 0305-1048. doi: 10...

  38. [38]

    Johnson, Jonathan Ho, Daniel Tarlow, and Rianne van den Berg

    Jacob Austin, Daniel D. Johnson, Jonathan Ho, Daniel Tarlow, and Rianne van den Berg. Structured denoising diffusion models in discrete state-spaces. InAdvances in Neural Information Processing Systems, 2021

  39. [39]

    Large Language Diffusion Models

    Shen Nie, Fengqi Zhu, Zebin You, Xiaolu Zhang, Jingyang Ou, Jun Hu, Jun Zhou, Yankai Lin, Ji-Rong Wen, and Chongxuan Li. Large language diffusion models.arXiv preprint arXiv:2502.09992, 2025

  40. [40]

    Simple and effective masked diffusion language models

    Subham Sekhar Sahoo, Marianne Arriola, Aaron Gokaslan, Edgar Mariano Marroquin, Alexander M Rush, Yair Schiff, Justin T Chiu, and Volodymyr Kuleshov. Simple and effective masked diffusion language models. InThe Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024. URL https://openreview.net/forum?id=L4uaAR4ArM

  41. [41]

    Beyond fixed: Training-free variable-length denoising for diffusion large language models.arXiv preprint arXiv:2508.00819, 2025

    Jinsong Li, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Jiaqi Wang, and Dahua Lin. Beyond fixed: Training-free variable-length denoising for diffusion large language models.arXiv preprint arXiv:2508.00819, 2025

  42. [42]

    Moses, Alex X

    Sarah Alamdari, Nitya Thakkar, Rianne van den Berg, Neil Tenenholtz, Robert Strome, Alan M. Moses, Alex X. Lu, Nicolò Fusi, Ava P. Amini, and Kevin K. Yang. Protein generation with evolutionary diffusion: sequence is all you need.bioRxiv, 2024. doi: 10.1101/2023.09.11.556673. URL https://www.biorxiv.org/content/early/ 2024/11/04/2023.09.11.556673

  43. [43]

    Jiaxin Shi, Kehang Han, Zhe Wang, Arnaud Doucet, and Michalis K. Titsias. Simplified and generalized masked diffusion for discrete data. InAdvances in Neural Information Processing Systems, 2024

  44. [44]

    Your absorbing discrete diffusion secretly models the conditional distributions of clean data, 2024

    Jingyang Ou, Shen Nie, Kaiwen Xue, Fengqi Zhu, Jiacheng Sun, Zhenguo Li, and Chongxuan Li. Your absorbing discrete diffusion secretly models the conditional distributions of clean data, 2024

  45. [45]

    Marks, Lucy J

    Debora S. Marks, Lucy J. Colwell, Robert Sheridan, Thomas A. Hopf, Andrea Pagnani, Riccardo Zecchina, and Chris Sander. Protein 3d structure computed from evolutionary sequence variation.PLOS ONE, 6(12):1–20, 12

  46. [46]

    URLhttps://doi.org/10.1371/journal.pone.0028766

    doi: 10.1371/journal.pone.0028766. URLhttps://doi.org/10.1371/journal.pone.0028766

  47. [47]

    Evaluating protein transfer learning with tape

    Roshan Rao, Nicholas Bhattacharya, Neil Thomas, Yan Duan, Peter Chen, John Canny, Pieter Abbeel, and Yun Song. Evaluating protein transfer learning with tape. In H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett, editors,Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc., 2019

  48. [48]

    FLIP: Benchmark tasks in fitness landscape inference for proteins

    Christian Dallago, Jody Mou, Kadina E Johnston, Bruce Wittmann, Nick Bhattacharya, Samuel Goldman, Ali Madani, and Kevin K Yang. FLIP: Benchmark tasks in fitness landscape inference for proteins. InThirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2), 2021. URLhttps://openreview.net/forum?id=p2dMLEwL8tF

  49. [49]

    Peer: A comprehensive and multi-task benchmark for protein sequence understanding

    Minghao Xu, Zuobai Zhang, Jiarui Lu, Zhaocheng Zhu, Yangtian Zhang, Ma Chang, Runcheng Liu, and Jian Tang. Peer: A comprehensive and multi-task benchmark for protein sequence understanding. In S. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, and A. Oh, editors,Advances in Neural Information Processing Systems, volume 35, pages 35156–35173. Curran A...

  50. [50]

    A text-guided pro- tein design framework.Nature Machine Intelligence, 7(4):580–591, March 2025

    Shengchao Liu, Yanjing Li, Zhuoxinran Li, Anthony Gitter, Yutao Zhu, Jiarui Lu, Zhao Xu, Weili Nie, Arvind Ramanathan, Chaowei Xiao, Jian Tang, Hongyu Guo, and Anima Anandkumar. A text-guided pro- tein design framework.Nature Machine Intelligence, 7(4):580–591, March 2025. ISSN 2522-5839. doi: 10.1038/s42256-025-01011-z. URLhttp://dx.doi.org/10.1038/s4225...

  51. [51]

    Ingraham, Max Baranov, Zak Costello, Karl W

    John B. Ingraham, Max Baranov, Zak Costello, Karl W. Barber, Wujie Wang, Ahmed Ismail, Vincent Frappier, Dana M. Lord, Christopher Ng-Thow-Hing, Erik R. Van Vlack, Shan Tie, Vincent Xue, Sarah C. Cowles, Alan Leung, João V. Rodrigues, Claudio L. Morales-Perez, Alex M. Ayoub, Robin Green, Katherine Puentes, Frank Oplinger, Nishant V. Panwar, Fritz Obermeye...

  52. [52]

    doi: 10.1038/s41586-023-06728-8

  53. [53]

    A Survey on LLM-as-a-Judge

    Jiawei Gu, Xuhui Jiang, Zhichao Shi, Hexiang Tan, Xuehao Zhai, Chengjin Xu, Wei Li, Yinghan Shen, Shengjie Ma, Honghao Liu, Saizhuo Wang, Kun Zhang, Yuanzhuo Wang, Wen Gao, Lionel Ni, and Jian Guo. A survey on llm-as-a-judge, 2025. URLhttps://arxiv.org/abs/2411.15594

  54. [54]

    Language models of protein sequences at the scale of evolution enable accurate structure prediction.bioRxiv, 2022

    Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Allan dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Sal Candido, et al. Language models of protein sequences at the scale of evolution enable accurate structure prediction.bioRxiv, 2022

  55. [55]

    Progen: Language modeling for protein generation.arXiv preprint arXiv:2004.03497, 2020

    Ali Madani, Bryan McCann, Nikhil Naik, Nitish Shirish Keskar, Namrata Anand, Raphael R Eguchi, Po-Ssu Huang, and Richard Socher. Progen: Language modeling for protein generation.arXiv preprint arXiv:2004.03497, 2020

  56. [56]

    Progen2: exploring the boundaries of protein language models.Cell systems, 14(11):968–978, 2023

    Erik Nijkamp, Jeffrey A Ruffolo, Eli N Weinstein, Nikhil Naik, and Ali Madani. Progen2: exploring the boundaries of protein language models.Cell systems, 14(11):968–978, 2023

  57. [57]

    Curran, Alexander M

    Aadyot Bhatnagar, Sarthak Jain, Joel Beazer, Samuel C. Curran, Alexander M. Hoffnagle, Kyle Shan Ching, Michael Martyn, Stephen Nayfach, Jeffrey A. Ruffolo, and Ali Madani. Scaling unlocks broader generation and deeper functional understanding of proteins. InThe Thirty-ninth Annual Conference on Neural Information Processing Systems, 2026. URLhttps://open...

  58. [58]

    Dplm-2: A multimodal diffusion protein language model.arXiv preprint arXiv:2410.13782, 2024

    Xinyou Wang, Zaixiang Zheng, Fei Ye, Dongyu Xue, Shujian Huang, and Quanquan Gu. Dplm-2: A multimodal diffusion protein language model.arXiv preprint arXiv:2410.13782, 2024

  59. [59]

    Elucidating the design space of multimodal protein language models

    Cheng-Yen Hsieh, Xinyou Wang, Daiheng Zhang, Dongyu Xue, Fei Ye, Shujian Huang, Zaixiang Zheng, and Quanquan Gu. Elucidating the design space of multimodal protein language models. InInternational Conference on Machine Learning, 2025

  60. [60]

    Towards A Generative Protein Evolution Machine with DPLM-Evo

    Xinyou Wang, Liang Hong, Jiasheng Ye, Zaixiang Zheng, Yu Li, Shujian Huang, and Quanquan Gu. Towards a generative protein evolution machine with dplm-evo, 2026. URLhttps://arxiv.org/abs/2605.00182

  61. [61]

    Protein design with dynamic protein vocabulary.arXiv preprint arXiv:2505.18966, 2025

    Nuowei Liu, Jiahao Kuang, Yanting Liu, Changzhi Sun, Tao Ji, Yuanbin Wu, and Man Lan. Protein design with dynamic protein vocabulary.arXiv preprint arXiv:2505.18966, 2025

  62. [62]

    Toward de novo protein design from natural language.bioRxiv, 2025

    Fengyuan Dai, Shiyang You, Yudian Zhu, Yuan Gao, Lihao Fu, Xibin Zhou, Jin Su, Chentong Wang, Yuliang Fan, Xiaoxiao Ma, Xianjun Deng, Letong Yu, Hui Qian, Yan He, Yitao Ke, Chenchen Han, Xing Chang, Liangzhen Zheng, Sheng Wang, Yajie Wang, Anping Zeng, Shunzhi Wang, Tong Si, Jianming Liu, Hongyuan Lu, and Fajie Yuan. Toward de novo protein design from nat...

  63. [63]

    Nature language model: Deciphering the language of nature for scientific discovery, 2025

    Yingce Xia, Peiran Jin, Shufang Xie, Liang He, Chuan Cao, Renqian Luo, Guoqing Liu, Yue Wang, Zequn Liu, Yuan-Jyue Chen, Zekun Guo, Yeqi Bai, Pan Deng, Yaosen Min, Ziheng Lu, Hongxia Hao, Han Yang, Jielan Li, Chang Liu, Jia Zhang, Jianwei Zhu, Ran Bi, Kehan Wu, Wei Zhang, Kaiyuan Gao, Qizhi Pei, Qian Wang, Xixian Liu, Yanting Li, Houtian Zhu, Yeqing Lu, M...

  64. [64]

    Claude for life sciences

    Anthropic. Claude for life sciences. https://www.anthropic.com/news/claude-for-life-sciences, October 2025

  65. [65]

    Introducing GPT-Rosalind.https://openai.com/index/introducing-gpt-rosalind/, 2026

    OpenAI. Introducing GPT-Rosalind.https://openai.com/index/introducing-gpt-rosalind/, 2026. 18

  66. [66]

    Alphafold protein structure database in 2024: providing structure coverage for over 214 million protein sequences.Nucleic Acids Research, 52(D1):D368– D375, 01 2024

    Mihaly Varadi, Damian Bertoni, Paulyna Magana, Urmila Paramval, Ivanna Pidruchna, Malarvizhi Radhakrishnan, Maxim Tsenkov, Sreenath Nair, Milot Mirdita, Jingi Yeo, Oleg Kovalevskiy, Kathryn Tunyasuvunakool, Agata Laydon, Augustin Žídek, Hamish Tomlinson, Dhavanthi Hariharan, Josh Abrahamson, Tim Green, John Jumper, Ewan Birney, Martin Steinegger, Demis Ha...

  67. [67]

    CFP-gen: Combinatorial functional protein generation via diffusion language models

    Junbo Yin, Chao Zha, Wenjia He, Chencheng Xu, and Xin Gao. CFP-gen: Combinatorial functional protein generation via diffusion language models. InForty-second International Conference on Machine Learning,

  68. [68]

    URLhttps://openreview.net/forum?id=EiM163eZyg

  69. [69]

    Neural text generation with unlikelihood training, 2019

    Sean Welleck, Ilia Kulikov, Stephen Roller, Emily Dinan, Kyunghyun Cho, and Jason Weston. Neural text generation with unlikelihood training, 2019. URLhttps://arxiv.org/abs/1908.04319

  70. [70]

    Uniprot: the universal protein knowledgebase in 2025.Nucleic Acids Research, 53 (D1):D609–D617, 01 2025

    The UniProt Consortium. Uniprot: the universal protein knowledgebase in 2025.Nucleic Acids Research, 53 (D1):D609–D617, 01 2025. ISSN 1362-4962. doi: 10.1093/nar/gkae1010. URLhttps://doi.org/10.1093/nar/ gkae1010

  71. [71]

    Quinn, Amaia Sangrador-Vegas, Maxim Scheremetjew, Siew-Yit Yong, Rodrigo Lopez, and Sarah Hunter

    Philip Jones, David Binns, Hsin-Yu Chang, Matthew Fraser, Weizhong Li, Craig McAnulla, Hamish McWilliam, John Maslen, Alex Mitchell, Gift Nuka, Sebastien Pesseat, Antony F. Quinn, Amaia Sangrador-Vegas, Maxim Scheremetjew, Siew-Yit Yong, Rodrigo Lopez, and Sarah Hunter. Interproscan 5: genome-scale protein function classification.Bioinformatics, 30(9):123...

  72. [72]

    An N-terminal basic tail rich in (K R) that may enhance RNA association

  73. [73]

    The central KOW-like KNDTVVVLSGDDKGKQGAVLELIPAKKAAIV segment providing the β-barrel structure

  74. [74]

    structure-aware

    A C-terminal helix with conserved (A L)-rich residues to complete the ribosomal protein fold. Connecting these elements yields a plausible full protein sequence, denoted asMGKIRKNDTVVVLSGDDK GKQGAVLELIPAKKAAIVKGVNIKTKHRKPSNKNTSGEIITFEAPILLSKLALVAKKATKDKPAIPTRVGFKIENKKKIRIAKK TGKAI, which meets the length and functional criteria for a KOW-containing riboso...