Title resolution pending

Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C Berg, Wan-Yen Lo, et al

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

browse 12 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

ShadeBench: A Benchmark Dataset for Building Shade Simulation in Sustainable Society

cs.CV · 2026-05-19 · unverdicted · novelty 7.0

ShadeBench is a multimodal benchmark dataset for urban shade understanding that includes temporally varying shade maps, satellite imagery, building representations, and text to support shade generation, segmentation, and 3D reconstruction tasks.

From Diffusion to Rectified Flow: Rethinking Text-Based Segmentation

cs.CV · 2026-05-06 · unverdicted · novelty 7.0

RLFSeg repurposes pretrained generative models via Rectified Flow for direct latent-space image-to-mask mapping in text-based segmentation, outperforming diffusion-based methods especially in zero-shot cases.

FlowAnchor: Stabilizing the Editing Signal for Inversion-Free Video Editing

cs.CV · 2026-04-24 · unverdicted · novelty 7.0

FlowAnchor stabilizes editing signals in flow-based inversion-free video editing via spatial-aware attention refinement and adaptive magnitude modulation for improved faithfulness and temporal coherence.

CFSR: Geometry-Conditioned Shadow Removal via Physical Disentanglement

cs.CV · 2026-04-20 · unverdicted · novelty 7.0

CFSR reframes shadow removal as a physics-constrained process using geometric and semantic priors from depth, DINO, CLIP, and frequency decoupling to achieve claimed state-of-the-art results.

OVS-DINO: Open-Vocabulary Segmentation via Structure-Aligned SAM-DINO with Language Guidance

cs.CV · 2026-04-09 · unverdicted · novelty 7.0

OVS-DINO structurally aligns DINO with SAM to revitalize attenuated boundary features, achieving SOTA gains of 2.1% average and 6.3% on Cityscapes in weakly-supervised open-vocabulary segmentation.

Learning Physics from Pretrained Video Models: A Multimodal Continuous and Sequential World Interaction Models for Robotic Manipulation

cs.RO · 2026-02-18 · unverdicted · novelty 7.0

PhysGen uses video models to learn physics for robots, outperforming baselines by up to 13.8% on Libero and matching specialized models in real-world tasks.

Low-Cost Hard-Label Adversarial Attack with Theoretical Foundations

cs.LG · 2026-01-17 · unverdicted · novelty 7.0

Presents a new theoretically grounded hard-label attack with zero-query initialization and low-complexity optimization that outperforms prior methods across image datasets and models.

MiXR: Harvesting and Recomposing Geometry from Real-World Objects for In-Situ 3D Design

cs.HC · 2026-05-10 · unverdicted · novelty 6.0 · 2 refs

MiXR enables in-situ 3D compositional modeling by harvesting real-world geometry in XR and using generative AI to synthesize coherent models from user-defined assemblies.

SandSim: Curve-Guided Gaussian Splatting for Reconstructing Sand Painting Processes

cs.GR · 2026-04-30 · unverdicted · novelty 6.0

SandSim reconstructs temporally coherent sand painting processes from single images using curve-guided Gaussian splatting, subtractive compositing for accumulation, and semantic-guided stroke planning.

Decoding the Delta: Unifying Remote Sensing Change Detection and Understanding with Multimodal Large Language Models

cs.CV · 2026-04-15 · unverdicted · novelty 6.0

Delta-LLaVA adds Change-Enhanced Attention, Change-SEG with prior embeddings, and Local Causal Attention to MLLMs to overcome temporal blindness, outperforming general models on a new unified benchmark for bi- and tri-temporal remote sensing tasks.

Structure-Semantic Decoupled Modulation of Global Geospatial Embeddings for High-Resolution Remote Sensing Mapping

cs.CV · 2026-04-21 · unverdicted · novelty 5.0

SSDM decouples global geospatial embeddings into structural modulation and semantic injection pathways to improve accuracy and consistency in high-resolution remote sensing land cover mapping.

Interactive Interface For Semantic Segmentation Dataset Synthesis

cs.CV · 2025-06-30 · unverdicted · novelty 3.0

SynthLab provides a modular visual data synthesis platform and interactive drag-and-drop interface for semantic segmentation datasets, shown accessible via user studies across diverse users.

citing papers explorer

Showing 12 of 12 citing papers.

ShadeBench: A Benchmark Dataset for Building Shade Simulation in Sustainable Society cs.CV · 2026-05-19 · unverdicted · none · ref 22
ShadeBench is a multimodal benchmark dataset for urban shade understanding that includes temporally varying shade maps, satellite imagery, building representations, and text to support shade generation, segmentation, and 3D reconstruction tasks.
From Diffusion to Rectified Flow: Rethinking Text-Based Segmentation cs.CV · 2026-05-06 · unverdicted · none · ref 20
RLFSeg repurposes pretrained generative models via Rectified Flow for direct latent-space image-to-mask mapping in text-based segmentation, outperforming diffusion-based methods especially in zero-shot cases.
FlowAnchor: Stabilizing the Editing Signal for Inversion-Free Video Editing cs.CV · 2026-04-24 · unverdicted · none · ref 12
FlowAnchor stabilizes editing signals in flow-based inversion-free video editing via spatial-aware attention refinement and adaptive magnitude modulation for improved faithfulness and temporal coherence.
CFSR: Geometry-Conditioned Shadow Removal via Physical Disentanglement cs.CV · 2026-04-20 · unverdicted · none · ref 22
CFSR reframes shadow removal as a physics-constrained process using geometric and semantic priors from depth, DINO, CLIP, and frequency decoupling to achieve claimed state-of-the-art results.
OVS-DINO: Open-Vocabulary Segmentation via Structure-Aligned SAM-DINO with Language Guidance cs.CV · 2026-04-09 · unverdicted · none · ref 22
OVS-DINO structurally aligns DINO with SAM to revitalize attenuated boundary features, achieving SOTA gains of 2.1% average and 6.3% on Cityscapes in weakly-supervised open-vocabulary segmentation.
Learning Physics from Pretrained Video Models: A Multimodal Continuous and Sequential World Interaction Models for Robotic Manipulation cs.RO · 2026-02-18 · unverdicted · none · ref 27
PhysGen uses video models to learn physics for robots, outperforming baselines by up to 13.8% on Libero and matching specialized models in real-world tasks.
Low-Cost Hard-Label Adversarial Attack with Theoretical Foundations cs.LG · 2026-01-17 · unverdicted · none · ref 31
Presents a new theoretically grounded hard-label attack with zero-query initialization and low-complexity optimization that outperforms prior methods across image datasets and models.
MiXR: Harvesting and Recomposing Geometry from Real-World Objects for In-Situ 3D Design cs.HC · 2026-05-10 · unverdicted · none · ref 27 · 2 links
MiXR enables in-situ 3D compositional modeling by harvesting real-world geometry in XR and using generative AI to synthesize coherent models from user-defined assemblies.
SandSim: Curve-Guided Gaussian Splatting for Reconstructing Sand Painting Processes cs.GR · 2026-04-30 · unverdicted · none · ref 23
SandSim reconstructs temporally coherent sand painting processes from single images using curve-guided Gaussian splatting, subtractive compositing for accumulation, and semantic-guided stroke planning.
Decoding the Delta: Unifying Remote Sensing Change Detection and Understanding with Multimodal Large Language Models cs.CV · 2026-04-15 · unverdicted · none · ref 22
Delta-LLaVA adds Change-Enhanced Attention, Change-SEG with prior embeddings, and Local Causal Attention to MLLMs to overcome temporal blindness, outperforming general models on a new unified benchmark for bi- and tri-temporal remote sensing tasks.
Structure-Semantic Decoupled Modulation of Global Geospatial Embeddings for High-Resolution Remote Sensing Mapping cs.CV · 2026-04-21 · unverdicted · none · ref 22
SSDM decouples global geospatial embeddings into structural modulation and semantic injection pathways to improve accuracy and consistency in high-resolution remote sensing land cover mapping.
Interactive Interface For Semantic Segmentation Dataset Synthesis cs.CV · 2025-06-30 · unverdicted · none · ref 7
SynthLab provides a modular visual data synthesis platform and interactive drag-and-drop interface for semantic segmentation datasets, shown accessible via user studies across diverse users.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer