SILO outperforms five baselines on eight protein fitness landscapes by using trajectory-level imitation on trajectories selected via hierarchical beam search and biological proxy guidance under limited oracle budgets.
Improved off-policy reinforcement learning in biological sequence design.arXiv preprint arXiv:2410.04461, 2024
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LG 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Self-Improvement Imitation with Biologically Guided Search for Protein Design Under Oracle Budgets
SILO outperforms five baselines on eight protein fitness landscapes by using trajectory-level imitation on trajectories selected via hierarchical beam search and biological proxy guidance under limited oracle budgets.