A convnet for the 2020s

· 2022

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

browse 7 citing papers

citation-role summary

baseline 2

citation-polarity summary

baseline 2

representative citing papers

From Scene to Object: Text-Guided Dual-Gaze Prediction

cs.CV · 2026-04-22 · unverdicted · novelty 7.0

DualGaze-VLM uses text guidance and a new object-level dataset G-W3DA to predict driver attention, beating prior models by up to 17.8% in similarity metrics and passing human visual Turing tests at 88%.

Lowering the Barrier to IREX Participation: Open-Source Algorithms, Toolkit, and Benchmarking for Iris Recognition

cs.CV · 2026-05-20 · accept · novelty 6.0

Open-source neural network iris matchers (TripletIris using batch-hard triplet loss and ArcIris using ArcFace loss) plus compliant C++ implementations of HDBIF and CRYPTS are released, evaluated on IREX X and eight academic datasets, and accompanied by segmentation tools to lower entry barriers for

Light-ResKAN: A Parameter-Sharing Lightweight KAN with Gram Polynomials for Efficient SAR Image Recognition

cs.CV · 2026-04-02 · unverdicted · novelty 6.0

Light-ResKAN reaches 99.09% accuracy on MSTAR SAR images with 82.9 times fewer FLOPs and 163.78 times fewer parameters than VGG16 by combining KAN convolutions, Gram polynomials, and channel-wise parameter sharing.

Validating Computational Markers of Depressive Behavior: Cross-Linguistic Speech-Based Depression Detection with Neurophysiological Validation

eess.AS · 2026-04-02 · unverdicted · novelty 6.0

The CDMA speech depression model generalizes across languages, favors emotional speech, and aligns with EEG markers of emotional dysregulation.

Universal Smoothness via Bernstein Polynomials: A Constructive Approximation Approach for Activation Functions

cs.AI · 2026-05-04 · unverdicted · novelty 5.0

BerLU constructs a C1-differentiable activation with Lipschitz constant 1 via Bernstein polynomial approximation, showing better performance and efficiency than baselines on image classification with ViTs and CNNs.

UniPASE: A Generative Model for Universal Speech Enhancement with High Fidelity and Low Hallucinations

eess.AS · 2026-04-16 · unverdicted · novelty 5.0

UniPASE extends the PASE framework with DeWavLM-Omni to convert degraded speech into high-fidelity, low-hallucination audio across sampling rates via phonetic enhancement, acoustic adaptation, and multi-rate vocoding.

AI Powered Image Analysis for Phishing Detection

cs.CV · 2026-04-15 · unverdicted · novelty 3.0

ConvNeXt-Tiny outperforms ViT-Base with higher F1-score and better efficiency for image-based phishing detection from webpage screenshots when decision thresholds are optimized.

citing papers explorer

Showing 7 of 7 citing papers.

From Scene to Object: Text-Guided Dual-Gaze Prediction cs.CV · 2026-04-22 · unverdicted · none · ref 33
DualGaze-VLM uses text guidance and a new object-level dataset G-W3DA to predict driver attention, beating prior models by up to 17.8% in similarity metrics and passing human visual Turing tests at 88%.
Lowering the Barrier to IREX Participation: Open-Source Algorithms, Toolkit, and Benchmarking for Iris Recognition cs.CV · 2026-05-20 · accept · none · ref 47
Open-source neural network iris matchers (TripletIris using batch-hard triplet loss and ArcIris using ArcFace loss) plus compliant C++ implementations of HDBIF and CRYPTS are released, evaluated on IREX X and eight academic datasets, and accompanied by segmentation tools to lower entry barriers for
Light-ResKAN: A Parameter-Sharing Lightweight KAN with Gram Polynomials for Efficient SAR Image Recognition cs.CV · 2026-04-02 · unverdicted · none · ref 63
Light-ResKAN reaches 99.09% accuracy on MSTAR SAR images with 82.9 times fewer FLOPs and 163.78 times fewer parameters than VGG16 by combining KAN convolutions, Gram polynomials, and channel-wise parameter sharing.
Validating Computational Markers of Depressive Behavior: Cross-Linguistic Speech-Based Depression Detection with Neurophysiological Validation eess.AS · 2026-04-02 · unverdicted · none · ref 49
The CDMA speech depression model generalizes across languages, favors emotional speech, and aligns with EEG markers of emotional dysregulation.
Universal Smoothness via Bernstein Polynomials: A Constructive Approximation Approach for Activation Functions cs.AI · 2026-05-04 · unverdicted · none · ref 33
BerLU constructs a C1-differentiable activation with Lipschitz constant 1 via Bernstein polynomial approximation, showing better performance and efficiency than baselines on image classification with ViTs and CNNs.
UniPASE: A Generative Model for Universal Speech Enhancement with High Fidelity and Low Hallucinations eess.AS · 2026-04-16 · unverdicted · none · ref 70
UniPASE extends the PASE framework with DeWavLM-Omni to convert degraded speech into high-fidelity, low-hallucination audio across sampling rates via phonetic enhancement, acoustic adaptation, and multi-rate vocoding.
AI Powered Image Analysis for Phishing Detection cs.CV · 2026-04-15 · unverdicted · none · ref 3
ConvNeXt-Tiny outperforms ViT-Base with higher F1-score and better efficiency for image-based phishing detection from webpage screenshots when decision thresholds are optimized.

A convnet for the 2020s

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer