Deep residual learning for image recognition

Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

browse 7 citing papers

citation-role summary

background 3

citation-polarity summary

background 3

representative citing papers

nuScenes: A multimodal dataset for autonomous driving

cs.LG · 2019-03-26 · accept · novelty 8.0

nuScenes provides the first public autonomous-driving dataset that includes synchronized 360-degree data from cameras, radars, and lidar together with 3D bounding-box annotations across 1000 scenes.

Scalable Diffusion Models with Transformers

cs.CV · 2022-12-19 · unverdicted · novelty 7.0

DiTs achieve SOTA FID of 2.27 on ImageNet 256x256 by scaling transformer-based latent diffusion models, with performance improving consistently as Gflops increase.

PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving

cs.CV · 2025-07-23 · unverdicted · novelty 6.0

PRIX presents an efficient camera-only planner with a novel CaRT module that matches larger multimodal models on NavSim and nuScenes while reducing model size and inference time.

No Forgetting Learning: Buffer-free Continual Learning Classification

cs.LG · 2025-03-06 · unverdicted · novelty 6.0

NFL is a buffer-free continual learning framework that decomposes networks, applies stepwise freezing with knowledge distillation, and adds an auto-encoder in NFL+ to match replay-based performance on image benchmarks while using only 2.53% of the memory.

Learn2Synth: Learning Optimal Data Synthesis Using Hypergradients for Brain Image Segmentation

cs.CV · 2024-11-23 · unverdicted · novelty 6.0

Learn2Synth optimizes data synthesis parameters with hypergradients to train segmentation networks solely on synthetic brain images that generalize to real scans.

MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action

cs.CV · 2023-03-20 · unverdicted · novelty 6.0

MM-REACT uses textual prompts to let ChatGPT collaborate with external vision experts for zero-shot multimodal reasoning and action on advanced visual tasks.

YOLOX: Exceeding YOLO Series in 2021

cs.CV · 2021-07-18 · accept · novelty 6.0

YOLOX exceeds prior YOLO models by adopting anchor-free detection, decoupled heads, and SimOTA assignment to reach 50.0% AP on COCO for the large variant.

citing papers explorer

Showing 7 of 7 citing papers.

nuScenes: A multimodal dataset for autonomous driving cs.LG · 2019-03-26 · accept · none · ref 39
nuScenes provides the first public autonomous-driving dataset that includes synchronized 360-degree data from cameras, radars, and lidar together with 3D bounding-box annotations across 1000 scenes.
Scalable Diffusion Models with Transformers cs.CV · 2022-12-19 · unverdicted · none · ref 15
DiTs achieve SOTA FID of 2.27 on ImageNet 256x256 by scaling transformer-based latent diffusion models, with performance improving consistently as Gflops increase.
PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving cs.CV · 2025-07-23 · unverdicted · none · ref 20
PRIX presents an efficient camera-only planner with a novel CaRT module that matches larger multimodal models on NavSim and nuScenes while reducing model size and inference time.
No Forgetting Learning: Buffer-free Continual Learning Classification cs.LG · 2025-03-06 · unverdicted · none · ref 13
NFL is a buffer-free continual learning framework that decomposes networks, applies stepwise freezing with knowledge distillation, and adds an auto-encoder in NFL+ to match replay-based performance on image benchmarks while using only 2.53% of the memory.
Learn2Synth: Learning Optimal Data Synthesis Using Hypergradients for Brain Image Segmentation cs.CV · 2024-11-23 · unverdicted · none · ref 30
Learn2Synth optimizes data synthesis parameters with hypergradients to train segmentation networks solely on synthetic brain images that generalize to real scans.
MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action cs.CV · 2023-03-20 · unverdicted · none · ref 12
MM-REACT uses textual prompts to let ChatGPT collaborate with external vision experts for zero-shot multimodal reasoning and action on advanced visual tasks.
YOLOX: Exceeding YOLO Series in 2021 cs.CV · 2021-07-18 · accept · none · ref 9
YOLOX exceeds prior YOLO models by adopting anchor-free detection, decoupled heads, and SimOTA assignment to reach 50.0% AP on COCO for the large variant.

Deep residual learning for image recognition

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer