Depth anything v2

· 2024

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

browse 7 citing papers

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

CDPR: Cross-modal Diffusion with Polarization for Reliable Monocular Depth Estimation

cs.CV · 2026-04-13 · unverdicted · novelty 7.0

CDPR integrates polarization priors into a diffusion-based monocular depth estimator via shared latent space and adaptive gating, outperforming RGB-only methods in challenging scenes.

UfM*: Uncertainty from Motion* for DNN Depth Estimation Using Gaussians

cs.RO · 2026-05-21 · unverdicted · novelty 6.0

UfM* uses Gaussian mixtures to compute multiview disagreement for uncertainty in depth estimation with single inference per image, reducing energy and memory use.

Thermal-Only Crowd Counting with Deployment-Time Privacy Protection

cs.CV · 2026-05-16 · unverdicted · novelty 6.0

A privacy-preserving thermal-only crowd counting framework extracts enhanced features from thermal images via single-step LCM denoising in a depth-to-RGB diffusion model and matches RGB-T fusion performance without RGB input at inference.

H-OmniStereo: Zero-Shot Omnidirectional Stereo Matching with Heading-Aligned Normal Priors

cs.CV · 2026-05-14 · unverdicted · novelty 6.0

H-OmniStereo trains a stereo matcher on 2.8 million synthetic equirectangular pairs and adds a heading-aligned normal prior to improve zero-shot accuracy and generalization on out-of-domain and real omnidirectional data.

Angle-I2P: Angle-Consistent-Aware Hierarchical Attention for Cross-Modality Outlier Rejection

cs.CV · 2026-05-06 · unverdicted · novelty 6.0 · 2 refs

Angle-I2P rejects outliers in cross-modality registration via scale-invariant angular consistency and hierarchical attention, reporting state-of-the-art inlier ratio and registration recall on 7Scenes, RGBD Scenes V2, and a self-collected dataset.

Clutter-Robust Vision-Language-Action Models through Object-Centric and Geometry Grounding

cs.RO · 2025-12-27 · conditional · novelty 6.0

OBEYED-VLA improves VLA robustness in cluttered real-world manipulation by disentangling perception into VLM-based object-centric grounding and geometry-aware stages, then fine-tuning the policy only on single-object demonstrations.

Geometry Reinforced Efficient Attention Tuning Equipped with Normals for Robust Stereo Matching

cs.CV · 2026-04-10 · unverdicted · novelty 5.0

GREATEN fuses surface normals with image features via gated contextual-geometric fusion and efficient sparse attentions to cut stereo matching errors by up to 30% on real datasets when trained solely on synthetic data.

citing papers explorer

Showing 7 of 7 citing papers.

CDPR: Cross-modal Diffusion with Polarization for Reliable Monocular Depth Estimation cs.CV · 2026-04-13 · unverdicted · none · ref 6
CDPR integrates polarization priors into a diffusion-based monocular depth estimator via shared latent space and adaptive gating, outperforming RGB-only methods in challenging scenes.
UfM*: Uncertainty from Motion* for DNN Depth Estimation Using Gaussians cs.RO · 2026-05-21 · unverdicted · none · ref 1
UfM* uses Gaussian mixtures to compute multiview disagreement for uncertainty in depth estimation with single inference per image, reducing energy and memory use.
Thermal-Only Crowd Counting with Deployment-Time Privacy Protection cs.CV · 2026-05-16 · unverdicted · none · ref 52
A privacy-preserving thermal-only crowd counting framework extracts enhanced features from thermal images via single-step LCM denoising in a depth-to-RGB diffusion model and matches RGB-T fusion performance without RGB input at inference.
H-OmniStereo: Zero-Shot Omnidirectional Stereo Matching with Heading-Aligned Normal Priors cs.CV · 2026-05-14 · unverdicted · none · ref 8
H-OmniStereo trains a stereo matcher on 2.8 million synthetic equirectangular pairs and adds a heading-aligned normal prior to improve zero-shot accuracy and generalization on out-of-domain and real omnidirectional data.
Angle-I2P: Angle-Consistent-Aware Hierarchical Attention for Cross-Modality Outlier Rejection cs.CV · 2026-05-06 · unverdicted · none · ref 29 · 2 links
Angle-I2P rejects outliers in cross-modality registration via scale-invariant angular consistency and hierarchical attention, reporting state-of-the-art inlier ratio and registration recall on 7Scenes, RGBD Scenes V2, and a self-collected dataset.
Clutter-Robust Vision-Language-Action Models through Object-Centric and Geometry Grounding cs.RO · 2025-12-27 · conditional · none · ref 40
OBEYED-VLA improves VLA robustness in cluttered real-world manipulation by disentangling perception into VLM-based object-centric grounding and geometry-aware stages, then fine-tuning the policy only on single-object demonstrations.
Geometry Reinforced Efficient Attention Tuning Equipped with Normals for Robust Stereo Matching cs.CV · 2026-04-10 · unverdicted · none · ref 6
GREATEN fuses surface normals with image features via gated contextual-geometric fusion and efficient sparse attentions to cut stereo matching errors by up to 30% on real datasets when trained solely on synthetic data.

Depth anything v2

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer