hub

Title resolution pending

Girshick, Ross · 2015 · cs.CV · arXiv 1504.08083

11 Pith papers cite this work. Polarity classification is still indexing.

11 Pith papers citing it

open full Pith review browse 11 citing papers arXiv PDF

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

abstract

This paper proposes a Fast Region-based Convolutional Network method (Fast R-CNN) for object detection. Fast R-CNN builds on previous work to efficiently classify object proposals using deep convolutional networks. Compared to previous work, Fast R-CNN employs several innovations to improve training and testing speed while also increasing detection accuracy. Fast R-CNN trains the very deep VGG16 network 9x faster than R-CNN, is 213x faster at test-time, and achieves a higher mAP on PASCAL VOC 2012. Compared to SPPnet, Fast R-CNN trains VGG16 3x faster, tests 10x faster, and is more accurate. Fast R-CNN is implemented in Python and C++ (using Caffe) and is available under the open-source MIT License at https://github.com/rbgirshick/fast-rcnn.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 1 method 1

citation-polarity summary

background 1 use method 1

representative citing papers

Pose Estimation for Non-Cooperative Rendezvous Using Neural Networks

cs.CV · 2019-06-24 · unverdicted · novelty 7.0

SPN is a CNN that detects a spacecraft bounding box, classifies then regresses attitude, and optimizes position via Gauss-Newton, achieving degree-level attitude and cm-level position errors on real images after training only on synthetic data.

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

cs.CV · 2015-10-01 · conditional · novelty 7.0

A pruning-quantization-Huffman pipeline compresses deep neural networks 35-49x without accuracy loss.

AMAR: Lightweight Attention-Based Multi-User Activity Recognition from Wi-Fi CSI

eess.SP · 2026-05-20 · unverdicted · novelty 6.0

AMAR uses a transformer with learnable query embeddings for set-based prediction of concurrent activities from composite Wi-Fi CSI, combined with edge feature extraction and vector quantization for bandwidth-efficient deployment.

A Multitask Network for Localization and Recognition of Text in Images

cs.CL · 2019-06-21 · unverdicted · novelty 6.0

Presents an end-to-end multitask CNN with FPN, dynamic RoI pooling, and convolutional attention for simultaneous lexicon-free text localization and recognition in complex images.

CalibFree: Self-Supervised View Feature Separation for Calibration-Free Multi-Camera Multi-Object Tracking

cs.CV · 2026-05-10 · unverdicted · novelty 6.0

CalibFree enables calibration-free multi-camera tracking via self-supervised feature separation through single-view distillation and cross-view reconstruction, reporting 3% higher accuracy and 7.5% better F1 on tested datasets.

Efficient Multi-Domain Network Learning by Covariance Normalization

cs.CV · 2019-06-24 · unverdicted · novelty 5.0

CovNorm reduces parameters in domain-adaptive layers via two PCAs and a mini-adaptation layer, enabling efficient multi-domain learning with performance close to full fine-tuning.

GarmNet: Improving Global with Local Perception for Robotic Laundry Folding

cs.RO · 2019-06-30 · unverdicted · novelty 4.0

GarmNet jointly localizes garments and detects grasp landmarks on the CloPeMa dataset, reducing localization error by 24.7% when landmark detection is included.

Label-Efficient School Detection from Aerial Imagery via Weakly Supervised Pretraining and Fine-Tuning

cs.CV · 2026-05-05 · unverdicted · novelty 4.0

A two-stage weakly supervised pipeline pretrains on auto-generated school labels from sparse points and fine-tunes on only 50 manual examples to achieve strong detection performance in aerial imagery.

Learning to count small and clustered objects with application to bacterial colonies

cs.CV · 2026-04-21 · unverdicted · novelty 4.0

ACFamNet Pro reaches 9.64% mean normalized absolute error on bacterial colony images under 5-fold cross-validation, beating FamNet by 12.71%.

RGB-D image-based Object Detection: from Traditional Methods to Deep Learning Techniques

cs.CV · 2019-07-22 · unverdicted · novelty 2.0

A survey of RGB-D object detection from traditional hand-crafted features with machine learning to deep learning techniques.

Understanding Deep Learning Techniques for Image Segmentation

cs.CV · 2019-07-13 · unverdicted · novelty 1.0

A 2019 survey that categorizes and intuitively explains major deep learning techniques for image segmentation, progressing from classical methods to modern neural architectures.

citing papers explorer

Showing 11 of 11 citing papers.

Pose Estimation for Non-Cooperative Rendezvous Using Neural Networks cs.CV · 2019-06-24 · unverdicted · none · ref 31 · internal anchor
SPN is a CNN that detects a spacecraft bounding box, classifies then regresses attitude, and optimizes position via Gauss-Newton, achieving degree-level attitude and cm-level position errors on real images after training only on synthetic data.
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding cs.CV · 2015-10-01 · conditional · none · ref 5
A pruning-quantization-Huffman pipeline compresses deep neural networks 35-49x without accuracy loss.
AMAR: Lightweight Attention-Based Multi-User Activity Recognition from Wi-Fi CSI eess.SP · 2026-05-20 · unverdicted · none · ref 33 · internal anchor
AMAR uses a transformer with learnable query embeddings for set-based prediction of concurrent activities from composite Wi-Fi CSI, combined with edge feature extraction and vector quantization for bandwidth-efficient deployment.
A Multitask Network for Localization and Recognition of Text in Images cs.CL · 2019-06-21 · unverdicted · none · ref 6 · internal anchor
Presents an end-to-end multitask CNN with FPN, dynamic RoI pooling, and convolutional attention for simultaneous lexicon-free text localization and recognition in complex images.
CalibFree: Self-Supervised View Feature Separation for Calibration-Free Multi-Camera Multi-Object Tracking cs.CV · 2026-05-10 · unverdicted · none · ref 24
CalibFree enables calibration-free multi-camera tracking via self-supervised feature separation through single-view distillation and cross-view reconstruction, reporting 3% higher accuracy and 7.5% better F1 on tested datasets.
Efficient Multi-Domain Network Learning by Covariance Normalization cs.CV · 2019-06-24 · unverdicted · none · ref 9 · internal anchor
CovNorm reduces parameters in domain-adaptive layers via two PCAs and a mini-adaptation layer, enabling efficient multi-domain learning with performance close to full fine-tuning.
GarmNet: Improving Global with Local Perception for Robotic Laundry Folding cs.RO · 2019-06-30 · unverdicted · none · ref 5 · internal anchor
GarmNet jointly localizes garments and detects grasp landmarks on the CloPeMa dataset, reducing localization error by 24.7% when landmark detection is included.
Label-Efficient School Detection from Aerial Imagery via Weakly Supervised Pretraining and Fine-Tuning cs.CV · 2026-05-05 · unverdicted · none · ref 11
A two-stage weakly supervised pipeline pretrains on auto-generated school labels from sparse points and fine-tunes on only 50 manual examples to achieve strong detection performance in aerial imagery.
Learning to count small and clustered objects with application to bacterial colonies cs.CV · 2026-04-21 · unverdicted · none · ref 29
ACFamNet Pro reaches 9.64% mean normalized absolute error on bacterial colony images under 5-fold cross-validation, beating FamNet by 12.71%.
RGB-D image-based Object Detection: from Traditional Methods to Deep Learning Techniques cs.CV · 2019-07-22 · unverdicted · none · ref 27 · internal anchor
A survey of RGB-D object detection from traditional hand-crafted features with machine learning to deep learning techniques.
Understanding Deep Learning Techniques for Image Segmentation cs.CV · 2019-07-13 · unverdicted · none · ref 69 · internal anchor
A 2019 survey that categorizes and intuitively explains major deep learning techniques for image segmentation, progressing from classical methods to modern neural architectures.

Title resolution pending

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer