arXiv preprint arXiv:1907.01341 , year=

Towards robust monocular depth estimation: Mixing datasets for zero-shot cross-dataset transfer , author= · 1907 · arXiv 1907.01341

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

dataset 1

citation-polarity summary

use dataset 1

representative citing papers

WildLIFT: Lifting monocular drone video to 3D for species-agnostic wildlife monitoring

cs.CV · 2026-04-27 · unverdicted · novelty 6.0

WildLIFT lifts monocular drone video to 3D for species-agnostic wildlife detection, tracking, and viewpoint analysis by integrating scene geometry with open-vocabulary segmentation.

Zero-shot World Models Are Developmentally Efficient Learners

cs.AI · 2026-04-11 · unverdicted · novelty 6.0

A zero-shot visual world model trained on one child's experience achieves broad competence on physical understanding benchmarks while matching developmental behavioral patterns.

MetaMorph: Multimodal Understanding and Generation via Instruction Tuning

cs.CV · 2024-12-18 · unverdicted · novelty 6.0

VPiT enables pretrained LLMs to perform both visual understanding and generation by predicting discrete text tokens and continuous visual tokens, with understanding data proving more effective than generation-specific data.

The Role and Relationship of Initialization and Densification in 3D Gaussian Splatting

cs.CV · 2026-03-21 · unverdicted · novelty 5.0

Current densification methods in 3D Gaussian Splatting do not significantly benefit from dense initializations and perform similarly to sparse SfM-based ones.

Depth-Aware Rover: A Study of Edge AI and Monocular Vision for Real-World Implementation

cs.CV · 2026-04-24 · unverdicted · novelty 3.0

Monocular depth estimation with UniDepthV2 on Raspberry Pi enables cost-effective rover navigation, proving more robust than stereo vision in real-world tests at 0.1 FPS depth and 10 FPS detection.

citing papers explorer

Showing 5 of 5 citing papers.

WildLIFT: Lifting monocular drone video to 3D for species-agnostic wildlife monitoring cs.CV · 2026-04-27 · unverdicted · none · ref 3
WildLIFT lifts monocular drone video to 3D for species-agnostic wildlife detection, tracking, and viewpoint analysis by integrating scene geometry with open-vocabulary segmentation.
Zero-shot World Models Are Developmentally Efficient Learners cs.AI · 2026-04-11 · unverdicted · none · ref 60
A zero-shot visual world model trained on one child's experience achieves broad competence on physical understanding benchmarks while matching developmental behavioral patterns.
MetaMorph: Multimodal Understanding and Generation via Instruction Tuning cs.CV · 2024-12-18 · unverdicted · none · ref 200
VPiT enables pretrained LLMs to perform both visual understanding and generation by predicting discrete text tokens and continuous visual tokens, with understanding data proving more effective than generation-specific data.
The Role and Relationship of Initialization and Densification in 3D Gaussian Splatting cs.CV · 2026-03-21 · unverdicted · none · ref 37
Current densification methods in 3D Gaussian Splatting do not significantly benefit from dense initializations and perform similarly to sparse SfM-based ones.
Depth-Aware Rover: A Study of Edge AI and Monocular Vision for Real-World Implementation cs.CV · 2026-04-24 · unverdicted · none · ref 4
Monocular depth estimation with UniDepthV2 on Raspberry Pi enables cost-effective rover navigation, proving more robust than stereo vision in real-world tests at 0.1 FPS depth and 10 FPS detection.

arXiv preprint arXiv:1907.01341 , year=

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer