Title resolution pending

Association for Computational Linguistics

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

browse 5 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

AnomalyVFM -- Transforming Vision Foundation Models into Zero-Shot Anomaly Detectors

cs.CV · 2026-01-28 · conditional · novelty 7.0

AnomalyVFM converts vision foundation models into zero-shot anomaly detectors via three-stage synthetic dataset generation plus low-rank adapters and weighted pixel loss, reaching 94.1% average image AUROC across nine datasets.

Training Multi-Image Vision Agents via End2End Reinforcement Learning

cs.CV · 2025-12-05 · unverdicted · novelty 7.0

IMAgent trains a multi-image vision agent via pure end-to-end RL with visual reflection tools and a two-layer motion trajectory masking strategy, reaching SOTA on single- and multi-image benchmarks while revealing tool-use effects on attention.

MultiModalPFN: Extending Prior-Data Fitted Networks for Multimodal Tabular Learning

cs.LG · 2026-02-23 · unverdicted · novelty 6.0

MultiModalPFN extends TabPFN with modality projectors, a multi-head gated MLP, and cross-attention pooler to unify tabular and non-tabular inputs, outperforming prior methods on medical and general multimodal datasets.

Masked Language Prompting for Generative Data Augmentation in Few-shot Fashion Style Recognition

cs.CV · 2025-04-28 · unverdicted · novelty 6.0

Masked Language Prompting masks selected words in reference captions and leverages LLMs to produce diverse, semantically coherent completions for style-consistent generative image augmentation without fine-tuning.

KFC-W: Generating 3D-Consistent Videos from Unposed Internet Photos

cs.CV · 2024-11-20 · unverdicted · novelty 5.0

KFC-W is a self-supervised 3D-aware video model trained on videos and multiview internet photos that produces geometrically consistent interpolations between unposed input images without any 3D annotations.

citing papers explorer

Showing 5 of 5 citing papers.

AnomalyVFM -- Transforming Vision Foundation Models into Zero-Shot Anomaly Detectors cs.CV · 2026-01-28 · conditional · none · ref 79
AnomalyVFM converts vision foundation models into zero-shot anomaly detectors via three-stage synthetic dataset generation plus low-rank adapters and weighted pixel loss, reaching 94.1% average image AUROC across nine datasets.
Training Multi-Image Vision Agents via End2End Reinforcement Learning cs.CV · 2025-12-05 · unverdicted · none · ref 57
IMAgent trains a multi-image vision agent via pure end-to-end RL with visual reflection tools and a two-layer motion trajectory masking strategy, reaching SOTA on single- and multi-image benchmarks while revealing tool-use effects on attention.
MultiModalPFN: Extending Prior-Data Fitted Networks for Multimodal Tabular Learning cs.LG · 2026-02-23 · unverdicted · none · ref 61
MultiModalPFN extends TabPFN with modality projectors, a multi-head gated MLP, and cross-attention pooler to unify tabular and non-tabular inputs, outperforming prior methods on medical and general multimodal datasets.
Masked Language Prompting for Generative Data Augmentation in Few-shot Fashion Style Recognition cs.CV · 2025-04-28 · unverdicted · none · ref 7
Masked Language Prompting masks selected words in reference captions and leverages LLMs to produce diverse, semantically coherent completions for style-consistent generative image augmentation without fine-tuning.
KFC-W: Generating 3D-Consistent Videos from Unposed Internet Photos cs.CV · 2024-11-20 · unverdicted · none · ref 85
KFC-W is a self-supervised 3D-aware video model trained on videos and multiview internet photos that produces geometrically consistent interpolations between unposed input images without any 3D annotations.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer