hub Tool reference

Deep residual learning for image recognition

· 2016

Tool reference. 80% of classified Pith citations use this work as a method, library, or software dependency, not as a substantive claim.

69 Pith papers citing it

Method reference 80% of classified citations

browse 69 citing papers

hub tools

JSON dossier citing papers JSON

citation-role summary

method 8 background 1 baseline 1

citation-polarity summary

use method 8 background 1 baseline 1

representative citing papers

iMiGUE-3K: A Large-Scale Benchmark for Micro-Gesture Analysis with Self-Supervised Learning

cs.CV · 2026-05-16 · unverdicted · novelty 8.0

iMiGUE-3K is the largest in-the-wild micro-gesture video dataset with 3.4K clips and 37M frames from real interviews, supporting self-supervised foundation models and benchmarks that show micro-gestures improve emotion understanding.

MetaEarth-MM: Unified Multimodal Remote Sensing Image Generation with Scene-centered Joint Modeling

cs.CV · 2026-05-19 · conditional · novelty 7.0

MetaEarth-MM unifies multi-modal remote sensing image generation and any-to-any translation across five modalities via scene-centered joint modeling on the new EarthMM dataset.

Interactive State Space Model with Cross-Modal Local Scanning for Depth Super-Resolution

cs.CV · 2026-05-12 · unverdicted · novelty 7.0

A Mamba-based interactive state space model with cross-modal local scanning achieves competitive guided depth super-resolution performance at linear computational cost.

Classification-Head Bias in Class-Level Machine Unlearning: Diagnosis, Mitigation, and Evaluation

cs.LG · 2026-05-09 · conditional · novelty 7.0

Class-level unlearning shortcuts via bias suppression in the classification head; new bias-aware training mechanisms and bias-specific metrics are introduced to diagnose and reduce this dependence.

InterMesh: Explicit Interaction-Aware End-to-End Multi-Person Human Mesh Recovery

cs.CV · 2026-05-06 · conditional · novelty 7.0

InterMesh explicitly incorporates human-object interaction semantics into multi-person mesh recovery via a detector and two lightweight modules, delivering up to 9.9% MPJPE reduction on interaction-heavy datasets.

ShapeGrasp: Simultaneous Visuo-Haptic Shape Completion and Grasping for Improved Robot Manipulation

cs.RO · 2026-05-04 · conditional · novelty 7.0

ShapeGrasp improves grasp success on unknown objects to 84-91% by iteratively updating a 3D shape model with visuo-haptic feedback during real-world grasp attempts.

Learning from Compressed CT: Feature Attention Style Transfer and Structured Factorized Projections for Resource-Efficient Medical Image Analysis

cs.CV · 2026-05-01 · unverdicted · novelty 7.0

CT-Lite combines Feature Attention Style Transfer (FAST) and Structured Factorized Projections (SFP) with contrastive learning to reach AUROC within 5-7% of uncompressed baselines on compressed CT volumes across three datasets while using far fewer parameters.

Hierarchical Spatio-Channel Clustering for Efficient Model Compression in Medical Image Analysis

cs.CV · 2026-04-25 · unverdicted · novelty 7.0

A spatio-channel clustering framework for CNN compression reduces FLOPs by 81% and raises brain tumor MRI classification accuracy from 87.76% to 89.80% compared with global SVD and Tucker baselines.

Latent Space Probing for Adult Content Detection in Video Generative Models

cs.CV · 2026-04-25 · unverdicted · novelty 7.0

Latent space probing on CogVideoX achieves 97.29% F1 for adult content detection on a new 11k-clip dataset with 4-6ms overhead.

Channel-Level Semantic Perturbations: Unlearnable Examples for Diverse Training Paradigms

cs.LG · 2026-04-18 · unverdicted · novelty 7.0

Unlearnable examples fail under pretraining-finetuning due to semantic filtering by frozen layers, but Shallow Semantic Camouflage restores effectiveness by confining perturbations to semantically valid subspaces.

Physically-Induced Atmospheric Adversarial Perturbations: Enhancing Transferability and Robustness in Remote Sensing Image Classification

cs.CV · 2026-04-16 · unverdicted · novelty 7.0

FogFool creates fog-based adversarial perturbations using Perlin noise optimization to achieve high black-box transferability (83.74% TASR) and robustness to defenses in remote sensing classification.

CDPR: Cross-modal Diffusion with Polarization for Reliable Monocular Depth Estimation

cs.CV · 2026-04-13 · unverdicted · novelty 7.0

CDPR integrates polarization priors into a diffusion-based monocular depth estimator via shared latent space and adaptive gating, outperforming RGB-only methods in challenging scenes.

Generalized Small Object Detection:A Point-Prompted Paradigm and Benchmark

cs.CV · 2026-04-03 · unverdicted · novelty 7.0

TinySet-9M dataset and DEAL point-prompted framework deliver 31.4% relative AP75 gain over supervised baselines for small object detection with one click at inference and generalization to unseen categories.

Sparse Bayesian Learning Algorithms Revisited: From Learning Majorizers to Structured Algorithmic Learning using Neural Networks

eess.SP · 2026-04-02 · conditional · novelty 7.0

SBL algorithms are unified under majorization-minimization with new convergence results, and a dimension-invariant neural network learns superior data-driven update rules that generalize across matrices and parameters.

Beyond Corner Patches: Semantics-Aware Backdoor Attack in Federated Learning

cs.CR · 2026-03-31 · unverdicted · novelty 7.0

SABLE shows that semantics-aware natural triggers enable effective backdoor attacks in federated learning against multiple aggregation rules while preserving benign accuracy.

Membership Inference for Contrastive Pre-training Models with Text-only PII Queries

cs.CR · 2026-03-15 · unverdicted · novelty 7.0

UMID infers membership in contrastive pre-training data using only text queries by performing latent inversion and comparing similarity and variability signals to synthetic gibberish references via unsupervised anomaly detection.

CBEN -- A Multimodal Machine Learning Dataset for Cloud Robust Remote Sensing Image Understanding

cs.CV · 2026-02-13 · accept · novelty 7.0

CBEN provides paired optical-radar images with cloud occlusion, revealing 23-33 point AP drops in clear-sky trained models and 17-29 point relative gains when models are trained on cloudy data.

CoLA-Flow Policy: Temporally Coherent Imitation Learning via Continuous Latent Action Flow Matching for Robotic Manipulation

cs.RO · 2026-01-30 · unverdicted · novelty 7.0 · 2 refs

CoLA-Flow Policy encodes action sequences into a continuous latent space and learns an explicit flow there, yielding near-single-step inference with up to 93.7% smoother trajectories and 25-point higher task success than raw-action flow baselines.

Building Deep Graph Predictors with Graph Imitation Learning

cs.CV · 2026-01-21 · unverdicted · novelty 7.0

GRAIL trains graph predictors via imitation learning by modeling generation as sequential decisions on partial graph embeddings, matching or exceeding prior methods on 18 benchmarks.

Re-Key-Free, Risky-Free: Adaptable Model Usage Control

cs.CR · 2025-11-24 · unverdicted · novelty 7.0

AdaLoc keeps a model locked to authorized users by confining all post-deployment updates to a chosen subset of weights, preserving both task performance for authorized use and near-random accuracy for unauthorized use across vision and language models.

TAR: Text Semantic Assisted Cross-modal Image Registration Framework for Optical and SAR Images

cs.CV · 2026-05-12 · unverdicted · novelty 6.0

TAR uses frozen text encoders on remote sensing scene descriptions to boost high-level features for coarse-to-fine optical-SAR image registration under large deformations.

Reward-Guided Semantic Evolution for Test-time Adaptive Object Detection

cs.CV · 2026-05-06 · unverdicted · novelty 6.0

RGSE adapts text embeddings at test time via evolutionary search, using cosine similarity rewards from high-confidence visual proposals to improve open-vocabulary object detection under distribution shifts.

RFPrompt: Prompt-Based Expert Adaptation of the Large Wireless Model for Modulation Classification

cs.LG · 2026-05-05 · unverdicted · novelty 6.0

RFPrompt adapts the Large Wireless Model via deep prompt tokens to improve out-of-distribution robustness in modulation classification while training only a small number of parameters.

MSACT: Multistage Spatial Alignment for Stable Low-Latency Fine Manipulation

cs.RO · 2026-05-01 · unverdicted · novelty 6.0

MSACT improves localization stability and task success rates in limited-data bimanual manipulation by extracting stable 2D attention points and aligning predicted attention sequences across frames without keypoint labels.

citing papers explorer

Showing 4 of 4 citing papers after filters.

GFSR: Geometric Fidelity and Spatial Refinement for Reliable Lane Detection cs.CV · 2026-05-22 · unreviewed · ref 40
Lowering the Barrier to IREX Participation: Open-Source Algorithms, Toolkit, and Benchmarking for Iris Recognition cs.CV · 2026-05-20 · unreviewed · ref 43
Explainable AI in Speaker Recognition -- Making Latent Representations Understandable eess.AS · 2026-04-25 · unreviewed · ref 1
EmbodiTTA: Resource-Efficient Test-Time Adaptation for Embodied Visual Systems cs.LG · 2025-05-02 · unreviewed · ref 28

Deep residual learning for image recognition

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer