hub

International conference on machine learning , pages=

Efficientnet: Rethinking model scaling for convolutional neural networks , author= · 2019

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

browse 12 citing papers

hub tools

JSON dossier citing papers JSON

representative citing papers

Empirical Evidence for Simply Connected Decision Regions in Image Classifiers

cs.CV · 2026-05-07 · unverdicted · novelty 7.0

Empirical tests with quad-mesh filling indicate that decision regions in modern image classifiers are simply connected.

One-Step Diffusion with Inverse Residual Fields for Unsupervised Industrial Anomaly Detection

cs.CV · 2026-04-20 · unverdicted · novelty 7.0

OSD-IRF performs unsupervised industrial anomaly detection with a single diffusion step by evaluating anomalies in inverse residual field space under a Gaussian, delivering SOTA or competitive results with roughly 2x speedup.

The Multi-Block DC Function Class: Theory, Algorithms, and Applications

math.OC · 2026-04-19 · unverdicted · novelty 7.0

The Multi-Block DC class admits polynomial-size DC decompositions for problems that require exponential size under standard DC programming and supplies explicit constructive formulations for deep ReLU networks together with convergent batch and stochastic algorithms.

Low Latency Gaze Tracking via Latent Optical Sensing

cs.CV · 2026-05-18 · unverdicted · novelty 6.0

A hardware prototype performs gaze estimation by optically encoding task-relevant features with a microlens array and mask, captured on a 4x4 phototransistor array and decoded by a small neural network, reaching 3.4 ms latency with competitive accuracy.

Venus-DeFakerOne: Unified Fake Image Detection & Localization

cs.CV · 2026-05-13 · unverdicted · novelty 6.0

DeFakerOne integrates InternVL2 and SAM2 into a single model that achieves state-of-the-art results on 39 detection and 9 localization benchmarks for unified fake image detection and localization.

PragLocker: Protecting Agent Intellectual Property in Untrusted Deployments via Non-Portable Prompts

cs.CR · 2026-05-07 · unverdicted · novelty 6.0 · 2 refs

PragLocker generates function-preserving but non-portable prompts for LLM agents via code-symbol semantic anchoring followed by target-model feedback noise injection.

DyGRO-VLA: Cross-Task Scaling of Vision-Language-Action Models via Dynamic Grouped Residual Optimization

cs.RO · 2026-05-17 · unverdicted · novelty 5.0

DyGRO-VLA is a two-stage optimization framework for cross-task scaling of Vision-Language-Action models via dynamic grouped residual optimization in RL.

Higher Resolution, Better Generalization: Unlocking Visual Scaling in Deep Reinforcement Learning

cs.LG · 2026-05-11 · unverdicted · novelty 5.0

Higher-resolution observations with global-average-pooling encoders improve RL performance and generalization by enabling more localized visual attention, yielding up to 28% gains over standard Impala encoders.

Lightning Unified Video Editing via In-Context Sparse Attention

cs.CV · 2026-05-06 · unverdicted · novelty 5.0

ISA prunes low-saliency context tokens and routes queries by sharpness to either full or 0-th order Taylor sparse attention, enabling LIVEditor to cut attention latency ~60% while beating prior video editing methods on three benchmarks.

Learning Invariant Modality Representation for Robust Multimodal Learning from a Causal Inference Perspective

cs.LG · 2026-04-20 · unverdicted · novelty 5.0

CmIR uses causal inference to separate invariant causal representations from spurious ones in multimodal data, improving generalization under distribution shifts and noise via invariance, mutual information, and reconstruction constraints.

ESsEN: Training Compact Discriminative Vision-Language Transformers in a Low-Resource Setting

cs.CV · 2026-04-20 · unverdicted · novelty 5.0

ESsEN is a parameter-efficient two-tower vision-language transformer that matches larger models on discriminative tasks after training end-to-end with limited data and resources.

AgriKD: Cross-Architecture Knowledge Distillation for Efficient Leaf Disease Classification

cs.CV · 2026-05-02 · unverdicted · novelty 4.0

AgriKD distills multi-level knowledge from Vision Transformers to lightweight CNNs, achieving comparable leaf disease classification accuracy with 172x fewer parameters and 18-22x faster inference.

citing papers explorer

Showing 12 of 12 citing papers.

Empirical Evidence for Simply Connected Decision Regions in Image Classifiers cs.CV · 2026-05-07 · unverdicted · none · ref 12
Empirical tests with quad-mesh filling indicate that decision regions in modern image classifiers are simply connected.
One-Step Diffusion with Inverse Residual Fields for Unsupervised Industrial Anomaly Detection cs.CV · 2026-04-20 · unverdicted · none · ref 24
OSD-IRF performs unsupervised industrial anomaly detection with a single diffusion step by evaluating anomalies in inverse residual field space under a Gaussian, delivering SOTA or competitive results with roughly 2x speedup.
The Multi-Block DC Function Class: Theory, Algorithms, and Applications math.OC · 2026-04-19 · unverdicted · none · ref 104
The Multi-Block DC class admits polynomial-size DC decompositions for problems that require exponential size under standard DC programming and supplies explicit constructive formulations for deep ReLU networks together with convergent batch and stochastic algorithms.
Low Latency Gaze Tracking via Latent Optical Sensing cs.CV · 2026-05-18 · unverdicted · none · ref 72
A hardware prototype performs gaze estimation by optically encoding task-relevant features with a microlens array and mask, captured on a 4x4 phototransistor array and decoded by a small neural network, reaching 3.4 ms latency with competitive accuracy.
Venus-DeFakerOne: Unified Fake Image Detection & Localization cs.CV · 2026-05-13 · unverdicted · none · ref 108
DeFakerOne integrates InternVL2 and SAM2 into a single model that achieves state-of-the-art results on 39 detection and 9 localization benchmarks for unified fake image detection and localization.
PragLocker: Protecting Agent Intellectual Property in Untrusted Deployments via Non-Portable Prompts cs.CR · 2026-05-07 · unverdicted · none · ref 57 · 2 links
PragLocker generates function-preserving but non-portable prompts for LLM agents via code-symbol semantic anchoring followed by target-model feedback noise injection.
DyGRO-VLA: Cross-Task Scaling of Vision-Language-Action Models via Dynamic Grouped Residual Optimization cs.RO · 2026-05-17 · unverdicted · none · ref 71
DyGRO-VLA is a two-stage optimization framework for cross-task scaling of Vision-Language-Action models via dynamic grouped residual optimization in RL.
Higher Resolution, Better Generalization: Unlocking Visual Scaling in Deep Reinforcement Learning cs.LG · 2026-05-11 · unverdicted · none · ref 20
Higher-resolution observations with global-average-pooling encoders improve RL performance and generalization by enabling more localized visual attention, yielding up to 28% gains over standard Impala encoders.
Lightning Unified Video Editing via In-Context Sparse Attention cs.CV · 2026-05-06 · unverdicted · none · ref 167
ISA prunes low-saliency context tokens and routes queries by sharpness to either full or 0-th order Taylor sparse attention, enabling LIVEditor to cut attention latency ~60% while beating prior video editing methods on three benchmarks.
Learning Invariant Modality Representation for Robust Multimodal Learning from a Causal Inference Perspective cs.LG · 2026-04-20 · unverdicted · none · ref 50
CmIR uses causal inference to separate invariant causal representations from spurious ones in multimodal data, improving generalization under distribution shifts and noise via invariance, mutual information, and reconstruction constraints.
ESsEN: Training Compact Discriminative Vision-Language Transformers in a Low-Resource Setting cs.CV · 2026-04-20 · unverdicted · none · ref 44
ESsEN is a parameter-efficient two-tower vision-language transformer that matches larger models on discriminative tasks after training end-to-end with limited data and resources.
AgriKD: Cross-Architecture Knowledge Distillation for Efficient Leaf Disease Classification cs.CV · 2026-05-02 · unverdicted · none · ref 47
AgriKD distills multi-level knowledge from Vision Transformers to lightweight CNNs, achieving comparable leaf disease classification accuracy with 172x fewer parameters and 18-22x faster inference.

International conference on machine learning , pages=

hub tools

fields

years

verdicts

representative citing papers

citing papers explorer