A self-evolving framework with proposer-solver-generator roles, Solver Token Entropy, and multi-scale internal evaluation improves unified LMMs on understanding and generation tasks using only self-derived consistency signals.
Multimodal learning with next-token prediction for large multimodal models.Nature, 650(8101):327–333, 2026
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
MSI is a multimodal representation learning framework that identifies key microstructural features governing mechanical behavior in structural alloys from spatial observations.
citing papers explorer
-
Interpretable Material Spatial Intelligence for Discovery of Governing Microstructural Features
MSI is a multimodal representation learning framework that identifies key microstructural features governing mechanical behavior in structural alloys from spatial observations.