In: ICLR (2019)

Loshchilov, I · 2019

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

browse 9 citing papers

citation-role summary

background 1 method 1

citation-polarity summary

background 1 use method 1

representative citing papers

Exploring deep learning for Event-Based Saliency Prediction with a Transformer-based model

cs.CV · 2026-05-22 · unverdicted · novelty 8.0

SEST is the first deep learning model for event-based saliency prediction, using a pretrained Swin Transformer backbone and synthetic benchmarks to outperform prior event methods while transferring to real event streams.

HAC: Parameter-Efficient Hyperbolic Adaptation of CLIP for Zero-Shot VQA

cs.CV · 2026-04-26 · unverdicted · novelty 7.0

HAC provides a parameter-efficient way to move CLIP into hyperbolic geometry, yielding consistent gains on zero-shot VQA benchmarks without any VQA training data overlap.

Metonymy in vision models undermines attention-based interpretability

cs.CV · 2026-05-07 · unverdicted · novelty 6.0

Pretrained vision transformers exhibit strong intra-object leakage where each part representation encodes information from the entire object, undermining the faithfulness of attention-based part-centric interpretability methods.

Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Models

cs.CV · 2026-04-28 · unverdicted · novelty 6.0

Refinement via Regeneration (RvR) reformulates image refinement in unified multimodal models as conditional regeneration using prompt and semantic tokens from the initial image, yielding higher alignment scores than editing-based methods.

Free Geometry: Refining 3D Reconstruction from Longer Versions of Itself

cs.CV · 2026-04-15 · unverdicted · novelty 6.0

Free Geometry enables test-time self-improvement of 3D reconstruction models via cross-view consistency between full and masked observations, yielding average gains of 3.73% in pose accuracy and 2.88% in point maps.

DVGT-2: Vision-Geometry-Action Model for Autonomous Driving at Scale

cs.CV · 2026-04-01 · unverdicted · novelty 6.0

DVGT-2 is a streaming vision-geometry-action model that jointly reconstructs dense 3D geometry and plans trajectories online, achieving better reconstruction than prior batch methods while transferring directly to planning benchmarks without fine-tuning.

Spatial-Frequency Gated Swin Transformer for Remote Sensing Single-Image Super-Resolution

cs.CV · 2026-05-10 · unverdicted · novelty 5.0

SFG-SwinSR improves PSNR to 45.19 dB and SSIM to 0.9852 on SpaceNet by adding a depthwise-blur plus gated spatial branch inside each Swin2SR feed-forward network.

StratFormer: Adaptive Opponent Modeling and Exploitation in Imperfect-Information Games

cs.AI · 2026-04-28 · unverdicted · novelty 5.0

StratFormer uses a two-phase curriculum with dual-turn tokens and bucket-rate features to model and exploit opponents in Leduc Hold'em, gaining +0.106 BB/hand on average over GTO while keeping near-equilibrium safety.

Weak-to-Strong Knowledge Distillation Accelerates Visual Learning

cs.CV · 2026-04-16 · unverdicted · novelty 5.0

Weak-to-strong knowledge distillation applied early and then turned off accelerates convergence to target performance in visual learning tasks by factors of 1.7-4.8x.

citing papers explorer

Showing 9 of 9 citing papers.

Exploring deep learning for Event-Based Saliency Prediction with a Transformer-based model cs.CV · 2026-05-22 · unverdicted · none · ref 27
SEST is the first deep learning model for event-based saliency prediction, using a pretrained Swin Transformer backbone and synthetic benchmarks to outperform prior event methods while transferring to real event streams.
HAC: Parameter-Efficient Hyperbolic Adaptation of CLIP for Zero-Shot VQA cs.CV · 2026-04-26 · unverdicted · none · ref 20
HAC provides a parameter-efficient way to move CLIP into hyperbolic geometry, yielding consistent gains on zero-shot VQA benchmarks without any VQA training data overlap.
Metonymy in vision models undermines attention-based interpretability cs.CV · 2026-05-07 · unverdicted · none · ref 33
Pretrained vision transformers exhibit strong intra-object leakage where each part representation encodes information from the entire object, undermining the faithfulness of attention-based part-centric interpretability methods.
Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Models cs.CV · 2026-04-28 · unverdicted · none · ref 32
Refinement via Regeneration (RvR) reformulates image refinement in unified multimodal models as conditional regeneration using prompt and semantic tokens from the initial image, yielding higher alignment scores than editing-based methods.
Free Geometry: Refining 3D Reconstruction from Longer Versions of Itself cs.CV · 2026-04-15 · unverdicted · none · ref 8
Free Geometry enables test-time self-improvement of 3D reconstruction models via cross-view consistency between full and masked observations, yielding average gains of 3.73% in pose accuracy and 2.88% in point maps.
DVGT-2: Vision-Geometry-Action Model for Autonomous Driving at Scale cs.CV · 2026-04-01 · unverdicted · none · ref 49
DVGT-2 is a streaming vision-geometry-action model that jointly reconstructs dense 3D geometry and plans trajectories online, achieving better reconstruction than prior batch methods while transferring directly to planning benchmarks without fine-tuning.
Spatial-Frequency Gated Swin Transformer for Remote Sensing Single-Image Super-Resolution cs.CV · 2026-05-10 · unverdicted · none · ref 17
SFG-SwinSR improves PSNR to 45.19 dB and SSIM to 0.9852 on SpaceNet by adding a depthwise-blur plus gated spatial branch inside each Swin2SR feed-forward network.
StratFormer: Adaptive Opponent Modeling and Exploitation in Imperfect-Information Games cs.AI · 2026-04-28 · unverdicted · none · ref 16
StratFormer uses a two-phase curriculum with dual-turn tokens and bucket-rate features to model and exploit opponents in Leduc Hold'em, gaining +0.106 BB/hand on average over GTO while keeping near-equilibrium safety.
Weak-to-Strong Knowledge Distillation Accelerates Visual Learning cs.CV · 2026-04-16 · unverdicted · none · ref 29
Weak-to-strong knowledge distillation applied early and then turned off accelerates convergence to target performance in visual learning tasks by factors of 1.7-4.8x.

In: ICLR (2019)

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer