STOMP extends direct preference optimization to the multi-objective setting via smooth Tchebysheff scalarization and standardization of observed rewards, achieving highest hypervolume in eight of nine protein engineering evaluations.
ISBN 9798331314385
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it