Transactions on Machine Learning Research Journal pp

Oquab, M · 2024

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

representative citing papers

ViPS: Video-informed Pose Spaces for Auto-Rigged Meshes

cs.CV · 2026-04-19 · unverdicted · novelty 7.0 · 2 refs

ViPS learns a universal, controllable pose space for auto-rigged meshes by transferring motion priors from video diffusion models, matching SOTA performance on plausibility and diversity while enabling zero-shot generalization.

ROAR-3D: Routing Arbitrary Views for High-Fidelity 3D Generation

cs.CV · 2026-05-20 · unverdicted · novelty 6.0

ROAR-3D adds a token-wise view router and dual-stream attention to pretrained single-view 3D generators so they can use arbitrary unposed images for higher-fidelity output.

AdaVFM: Adaptive Vision Foundation Models for Edge Intelligence via LLM-Guided Execution

cs.CV · 2026-04-17 · unverdicted · novelty 6.0

AdaVFM integrates neural architecture search into vision foundation model backbones and uses a cloud multimodal LLM agent to enable runtime-adaptive lightweight subnet execution, delivering up to 7.9% higher accuracy and 77.9% lower FLOPs than fixed-size baselines on edge devices.

citing papers explorer

Showing 3 of 3 citing papers.

ViPS: Video-informed Pose Spaces for Auto-Rigged Meshes cs.CV · 2026-04-19 · unverdicted · none · ref 32 · 2 links
ViPS learns a universal, controllable pose space for auto-rigged meshes by transferring motion priors from video diffusion models, matching SOTA performance on plausibility and diversity while enabling zero-shot generalization.
ROAR-3D: Routing Arbitrary Views for High-Fidelity 3D Generation cs.CV · 2026-05-20 · unverdicted · none · ref 39
ROAR-3D adds a token-wise view router and dual-stream attention to pretrained single-view 3D generators so they can use arbitrary unposed images for higher-fidelity output.
AdaVFM: Adaptive Vision Foundation Models for Edge Intelligence via LLM-Guided Execution cs.CV · 2026-04-17 · unverdicted · none · ref 50
AdaVFM integrates neural architecture search into vision foundation model backbones and uses a cloud multimodal LLM agent to enable runtime-adaptive lightweight subnet execution, delivering up to 7.9% higher accuracy and 77.9% lower FLOPs than fixed-size baselines on edge devices.

Transactions on Machine Learning Research Journal pp

fields

years

verdicts

representative citing papers

citing papers explorer