Every9D-21M supplies 21.8M real-world 9D pose annotations for 700 everyday categories by propagating manual canonical poses through cross-instance alignment in object-centric videos and verifying them multiview.
Orient anything v2: Unifying orientation and rotation understanding.arXiv preprint arXiv:2601.05573, 2026
3 Pith papers cite this work. Polarity classification is still indexing.
3
Pith papers citing it
fields
cs.CV 3years
2026 3representative citing papers
A coarse canonical mesh bottleneck plus multi-view consistency lets a shared object frame emerge from self-supervised training on in-the-wild videos without canonical labels or category conditioning.
OmniView-Space framework with MPSM, tool-guided reasoning, and distillation achieves SOTA on spatial reasoning benchmarks for MLLMs while reducing external geometry dependencies.
citing papers explorer
-
Every9D-21M: Large-Scale Real-World 9D Canonicalization of Everyday Objects
Every9D-21M supplies 21.8M real-world 9D pose annotations for 700 everyday categories by propagating manual canonical poses through cross-instance alignment in object-centric videos and verifying them multiview.