A Targeted Acceleration and Compression Framework for Low bit Neural Networks

Biao Qian; Yang Wang

REVIEW 2 major objections 1 minor 41 references

Reviewed by Pith at T0; open to challenge.

T0 means a machine referee read the full paper against a public rubric. The mark states how deep the mechanical check went, never who wrote it. the ladder, T0–T4 →

Challenge this review Re-run · record.json Download PDF Read on arXiv ↗

T0 review · grok-4.3

The TAC framework improves 1-bit deep neural network accuracy by more than 6 percentage points over prior methods by handling convolutional and fully connected layers separately.

2026-05-25 00:52 UTC pith:UDAL3FJU

load-bearing objection The paper's main idea is to binarize only conv layers while pruning and low-bit quantizing FC layers instead, claiming >6pp accuracy gains over prior 1-bit work, but the abstract gives no data to back the separation as the cause. the 2 major comments →

arxiv 1907.05271 v1 pith:UDAL3FJU submitted 2019-07-09 cs.CV

A Targeted Acceleration and Compression Framework for Low bit Neural Networks

Biao Qian , Yang Wang This is my paper

classification cs.CV

keywords 1-bit neural networksbinarizationnetwork pruninglow-bit quantizationconvolutional layersfully connected layersmodel compressionimage classification

verification ladder T0 review T1 audit T2 compute T3 formal T4 reserved

The pith

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes a Targeted Acceleration and Compression framework for 1-bit DNNs that separates convolutional layers from fully connected layers and optimizes each type on its own. Convolutional layers receive full binarization of both weights and activations while fully connected layers instead receive pruning plus low-bit quantization. This separation is meant to avoid the accuracy penalty that comes from binarizing fully connected layers, whose efficiency gains do not offset the performance drop. Results on CIFAR-10, CIFAR-100, and ImageNet show the approach raises accuracy substantially compared with earlier uniform binarization techniques.

Core claim

By separating the convolutional and fully connected layers and optimizing them individually, with both activations and weights binarized only in the convolutional layers while the binarization operation is replaced by network pruning and low-bit quantization in the fully connected layers, the accuracy of 1-bit deep neural networks can be significantly improved.

What carries the argument

The Targeted Acceleration and Compression (TAC) framework, which separates convolutional layers for binarization from fully connected layers for pruning and low-bit quantization and optimizes the two types individually.

Load-bearing premise

The premise that binarizing fully connected layers produces accuracy losses that their acceleration and compression effects cannot offset, and that separating and optimizing the two layer types individually will recover those losses.

What would settle it

A controlled experiment on ImageNet that applies the same network architecture but forces binarization on the fully connected layers as well and measures whether accuracy drops below the TAC version by a comparable margin.

Watch this falsifier — get emailed when new claim-graph text bears on it.

If this is right

1-bit networks reach higher top-1 and top-5 accuracy on CIFAR-10, CIFAR-100, and ImageNet while retaining their computational efficiency.
Uniform binarization across every layer type is shown to be suboptimal once layer-specific treatment is allowed.
The framework demonstrates that pruning combined with low-bit quantization can serve as a drop-in replacement for binarization in fully connected layers.
Accuracy gains appear consistently across small-scale and large-scale image classification benchmarks.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same separation principle could be tested on other low-bit quantization schemes beyond strict 1-bit networks.
Hardware designs might allocate different compute paths to the two layer types rather than treating the entire model uniformly.
The approach could be combined with existing training tricks such as knowledge distillation to push accuracy further.
Edge-device deployments that value both speed and accuracy might adopt this selective treatment as a default pattern.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit.

Desk Editor's Note

The paper's main idea is to binarize only conv layers while pruning and low-bit quantizing FC layers instead, claiming >6pp accuracy gains over prior 1-bit work, but the abstract gives no data to back the separation as the cause.

read the letter

The paper's core proposal is a Targeted Acceleration and Compression framework that splits convolutional and fully connected layers. Conv layers get full binarization on weights and activations. FC layers skip binarization entirely and instead use pruning plus low-bit quantization. The authors argue that binarizing FC layers costs too much accuracy relative to the compression and speed gains, so they optimize the two layer types separately. They test on CIFAR-10, CIFAR-100, and ImageNet and state that this yields more than 6 percentage points better accuracy than existing 1-bit methods.

Referee Report

2 major / 1 minor

Summary. The paper proposes a Targeted Acceleration and Compression (TAC) framework for 1-bit DNNs. Convolutional layers are binarized for both weights and activations, while fully connected layers use network pruning and low-bit quantization instead of binarization, on the grounds that binarizing FC layers produces accuracy loss not offset by efficiency gains. The layers are separated and optimized individually. Experiments on CIFAR-10, CIFAR-100, and ImageNet report that TAC significantly improves 1-bit network accuracy and outperforms prior state-of-the-art methods by more than 6 percentage points.

Significance. If the reported gains hold and are shown to stem from the targeted separation rather than simply from leaving FC layers at higher precision, the framework could supply a pragmatic hybrid recipe for trading minimal efficiency for substantially better accuracy in binarized networks. The work would then usefully illustrate that uniform binarization across layer types is often suboptimal.

major comments (2)

[Abstract] Abstract and experimental results: the headline claim that TAC outperforms the state of the art by more than 6 percentage points is presented without tabulated baselines, absolute accuracies, standard deviations, or statistical tests, preventing verification of the central performance assertion.
[Experiments] The manuscript provides no ablation that compares the full TAC pipeline (explicit conv/FC separation plus per-type optimization) against a control that applies identical pruning + low-bit quantization to FC layers but without the separation step. Because the accuracy improvement is attributed specifically to the TAC construction, this missing control is load-bearing for the causal claim.

minor comments (1)

[Abstract] Abstract contains typographical inconsistencies (e.g., 'restrict s', stray capitalization of 'We consider').

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments. We address each point below and will revise the manuscript to improve clarity and strengthen the claims where possible.

read point-by-point responses

Referee: [Abstract] Abstract and experimental results: the headline claim that TAC outperforms the state of the art by more than 6 percentage points is presented without tabulated baselines, absolute accuracies, standard deviations, or statistical tests, preventing verification of the central performance assertion.

Authors: We agree the abstract would benefit from more concrete numbers to support the headline claim. In revision we will update the abstract to report the absolute top-1 accuracies achieved by TAC on CIFAR-10, CIFAR-100 and ImageNet together with the exact margins over the cited baselines. The experimental section already contains the full comparison tables; we will add a cross-reference in the abstract. Standard deviations and statistical tests were not computed in the original single-run experiments; we will explicitly note this limitation rather than retroactively add them. revision: partial
Referee: [Experiments] The manuscript provides no ablation that compares the full TAC pipeline (explicit conv/FC separation plus per-type optimization) against a control that applies identical pruning + low-bit quantization to FC layers but without the separation step. Because the accuracy improvement is attributed specifically to the TAC construction, this missing control is load-bearing for the causal claim.

Authors: We acknowledge the value of an explicit ablation isolating the separation step. In the revised manuscript we will add a controlled experiment that applies the identical pruning-plus-low-bit-quantization recipe to the FC layers while keeping the convolutional layers binarized, but without the explicit layer-type separation and per-type optimization schedule used in TAC. Results of this control will be reported alongside the full TAC results to clarify the contribution of the separation itself. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical framework proposal with no derivations or self-referential reductions.

full rationale

The paper presents a targeted framework for 1-bit networks based on an explicit assumption about FC-layer binarization costs, followed by empirical evaluation on standard datasets. No equations, derivations, fitted parameters renamed as predictions, or load-bearing self-citations appear in the provided text. The central claim rests on experimental outperformance rather than any mathematical chain that reduces to its own inputs by construction. This is a standard empirical methods paper whose validity is testable against external benchmarks, so the derivation chain is self-contained with no circular steps.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review provides no information on free parameters, axioms, or invented entities.

pith-pipeline@v0.9.0 · 5727 in / 855 out tokens · 18382 ms · 2026-05-25T00:52:12.859747+00:00 · methodology

0 comments

read the original abstract

1 bit deep neural networks (DNNs), of which both the activations and weights are binarized , are attracting more and more attention due to their high computational efficiency and low memory requirement . However, the drawback of large accuracy dropping also restrict s its application. In this paper, we propose a novel Targeted Acceleration and Compression (TAC) framework to improve the performance of 1 bit deep neural networks W e consider that the acceleration and compression effects of binarizing fully connected layer s are not sufficient to compensate for the accuracy loss caused by it In the proposed framework, t he convolutional and fully connected layer are separated and optimized i ndividually . F or the convolutional layer s , both the activations and weights are binarized. For the fully connected layer s, the binarization operation is re placed by network pruning and low bit quantization. The proposed framework is implemented on the CIFAR 10, CIFAR 100 and ImageNet ( ILSVRC 12 ) datasets , and experimental results show that the proposed TAC can significantly improve the accuracy of 1 bit deep neural networks and outperforms the state of the art by more than 6 percentage points .

discussion (0)

Reference graph

Works this paper leans on

41 extracted references · 41 canonical work pages · 8 internal anchors

[1]

However, the drawback of large accuracy dropping also restrict s its application

Abstract 1-bit deep neural networks (DNNs), of which both the activations and weights are binarized, are attracting more and more attention due to their high computational efficiency and low memory requirement. However, the drawback of large accuracy dropping also restrict s its application. In this paper, we propose a novel Targeted Acceleration and Comp...

work page
[2]

datasets, and experimental results show that the proposed TAC can significantly improve the accuracy of 1 -bit deep neural networks and outperforms the state-of-the-art by more than 6 percentage points

work page
[3]

Under the circumstances, the application of deep neural networks on mobile devices is gaining more and more attention

Introduction Recently, deep neural networks (DNNs) have widely applied in computer vision tasks, such as image classification [17] [18], object detection [24] and visual recognition [25][26][27][28][29][30][31][32][33] so on. Under the circumstances, the application of deep neural networks on mobile devices is gaining more and more attention. However, gre...

work page
[4]

Network pruning: Network pruning measures the redundancy on network structures like connections, neurons and filters with different rules, and removes unimportant parts

Related Work In this section, we mainly review the methods related to pruning and quantization. Network pruning: Network pruning measures the redundancy on network structures like connections, neurons and filters with different rules, and removes unimportant parts. In [1], all connections with weights below a certain threshold are removed from the pre -tr...

work page
[5]

Method In the section, we introduce our Targeted Acceleration and Compression (TAC) framework in detail. Considering the binarization of fully connected layers is not very necessary and can cause the accuracy loss , our TAC optimizes convolutional lay ers and fully connected layers individually and regards the process as two steps: accelerating convolutio...

work page
[6]

5.1 Datasets and Implement Details In this section, we briefly introduce the datasets, network structures and experiment settings in our experiments

Experiments We performed extensive experiments on CIFAR-10, CIFAR-100 and ImageNet datasets with VGG-9 and AlexNet architectures. 5.1 Datasets and Implement Details In this section, we briefly introduce the datasets, network structures and experiment settings in our experiments. CIFAR-10/100 [22]: This dataset consists of a training set of 50,000 and a te...

work page
[7]

For the hyper-parameters, we set the pruning rate at iterative steps as {0.2, 0

to binarize both the activations and weights of convolutional layers, and network pruning and quantization in [1] [4] to compress fully connected layers. For the hyper-parameters, we set the pruning rate at iterative steps as {0.2, 0. 4, 0.6, 0.7 , 0.75 } and the bit width of quantization as 4. For simplicity, we denote the networks trained with the TAC f...

work page
[8]

For the convolutional layers, both the activations and weights are binarized to speed up the computations

Conclusion In this paper, we have proposed a novel framework named Targeted Acceleration and Compression (TAC), where the convolutional and fully connected layer are separated and learned individually. For the convolutional layers, both the activations and weights are binarized to speed up the computations. For the fully connected layers, network pruning ...

work page
[9]

Learning both weights and connections for efficie nt neural network[C]//Advances in neural information processing systems

Han S, Pool J, Tran J, et al. Learning both weights and connections for efficie nt neural network[C]//Advances in neural information processing systems. 2015: 1135-1143

work page 2015
[10]

Learning efficient convolutional networks through network slimming[C]//Computer Vision (ICCV), 2017 IEEE International Conference on

Liu Z, Li J, Shen Z, et al. Learning efficient convolutional networks through network slimming[C]//Computer Vision (ICCV), 2017 IEEE International Conference on. IEEE, 2017: 2755-2763

work page 2017
[11]

Pruning Filters for Efficient ConvNets

Li H, Kadav A, Durdanovic I, et al. Pruning filters for efficient convnets[J]. arXiv preprint arXiv:1608.08710, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016
[12]

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

Han S, Mao H, Dally W J. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding[J]. arXiv preprint arXiv:1510.00149, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015
[13]

Accelerating Convolutional Networks via Global & Dynamic Filter Pruning[C]//IJCAI

Lin S, Ji R, Li Y , et al. Accelerating Convolutional Networks via Global & Dynamic Filter Pruning[C]//IJCAI. 2018: 2425-2432

work page 2018
[14]

Towards the Limit of Network Quantization

Choi Y , El-Khamy M, Lee J. Towards the limit of networ k quantization[J]. arXiv preprint arXiv:1612.01543, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016
[15]

Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights

Zhou A, Yao A, Guo Y , et al. Incremental network quantization: Towards lossless cnns with low- precision weights[J]. arXiv preprint arXiv:1702.03044, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[16]

Ternary Weight Networks[J]

Li F, Zhang B, Liu B. Ternary Weight Networks[J]. 2016

work page 2016
[17]

Trained Ternary Quantization[J]

Zhu C, Han S, Mao H, et al. Trained Ternary Quantization[J]. 2016

work page 2016
[18]

Binaryconnect: Training deep neural networks with binary weights during propagations[C]//Advances in neural information processing systems

Courbariaux M, Bengio Y , David J P. Binaryconnect: Training deep neural networks with binary weights during propagations[C]//Advances in neural information processing systems. 2015: 3123- 3131

work page 2015
[19]

BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1[J]

Courbariaux M, Bengio Y . BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1[J]. 2016

work page 2016
[20]

XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks[J]

Rastegari M, Ordonez V , Redmon J, et al. XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks[J]. 2016:525-542

work page 2016
[21]

Deep Learning with Low Precision by Half-wave Gaussian Quantization

Cai Z, He X, Sun J, et al. Deep learning with low precision by half -wave gaussian quantization[J]. arXiv preprint arXiv:1702.00953, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[22]

From Hashing to CNNs: Training BinaryWeight Networks via Hashing[J]

Hu Q, Wang P, Cheng J. From Hashing to CNNs: Training BinaryWeight Networks via Hashing[J]. 2018

work page 2018
[23]

Towards Accurate Binary Convolutional Neural Network[J]

Lin X, Zhao C, Pan W. Towards Accurate Binary Convolutional Neural Network[J]. 2017

work page 2017
[24]

Two -Step Quantization for Low -bit Neural Networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

Wang P, Hu Q, Zhang Y , et al. Two -Step Quantization for Low -bit Neural Networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018: 4376-4384

work page 2018
[25]

Imagenet classification with deep convolutional neural networks[C]//Advances in neural information processing systems

Krizhevsky A, Sutskever I, Hinton G E. Imagenet classification with deep convolutional neural networks[C]//Advances in neural information processing systems. 2012: 1097-1105

work page 2012
[26]

Very Deep Convolutional Networks for Large-Scale Image Recognition

Simonyan K, Zisserman A. Very deep convolutional networks for large -scale image recognition[J]. arXiv preprint arXiv:1409.1556, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014
[27]

Imagenet: A large -scale hierarchical image database[C]//Computer Vision and Pattern Recognition, 2009

Deng J, Dong W, Socher R, et al. Imagenet: A large -scale hierarchical image database[C]//Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on. Ieee, 2009: 248-255

work page 2009
[28]

Bi-Real Net: Enhancing the Performance of 1-bit CNNs With Improved Representational Capability and Advanced Training Algorithm

Liu Z, Wu B, Luo W, et al. Bi -Real Net: Enhancing the Performance of 1 -bit CNNs With Improved Representational Capability and Advanced Training Algorithm[J]. arXiv preprint arXiv:1808.00278, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[29]

Adam: A Method for Stochastic Optimization

Kingma D P, Ba J. Adam: A method for stochastic optimization[J]. arXiv preprint arXiv:1412.6980, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014
[30]

Learning multiple layers of features from tiny images[R]

Krizhevsky A, Hinton G. Learning multiple layers of features from tiny images[R]. Technical report, University of Toronto, 2009

work page 2009
[31]

Imagenet large scale visual recognition challenge[J]

Russakovsky O, Deng J, Su H, et al. Imagenet large scale visual recognition challenge[J]. International Journal of Computer Vision, 2015, 115(3): 211-252

work page 2015
[32]

Faster r -cnn: Towards real-time object detection with region proposal networks[C]//Advances in neural information processing systems

Ren S, He K, Girshick R, et al. Faster r -cnn: Towards real-time object detection with region proposal networks[C]//Advances in neural information processing systems. 2015: 91-99

work page 2015
[33]

Wang et al., Iterative Views Agreement: An Iterative Low -Rank based Structured Optimization Method to Multi-View Spectral Clustering

Y . Wang et al., Iterative Views Agreement: An Iterative Low -Rank based Structured Optimization Method to Multi-View Spectral Clustering. IJCAI 2016:2153-2159

work page 2016
[34]

Wang et al., Robust Subspace Clustering for Multi -view Data by Exploiting Correlation Consensus

Y . Wang et al., Robust Subspace Clustering for Multi -view Data by Exploiting Correlation Consensus. IEEE Transactions on Image Processing, 24(11):3939-3949, 2015

work page 2015
[35]

Wang et al., Multiview Spectral Clustering via Structured Low-Rank Matrix Factorization

Y . Wang et al., Multiview Spectral Clustering via Structured Low-Rank Matrix Factorization. IEEE Transactions on Neural Networks and Learning Systems, 29(10):4833-4843, 2018

work page 2018
[36]

L. Wu, Y . Wang and L. Shao. Cycle -Consistent Deep Generative Hashing for Cross -Modal Retrieval. IEEE Transactions on Image Processing,28(4):1602-1612, 2019

work page 2019
[37]

Wang et al., Effective Multi -Query Expansions: Collaborative Deep Networks for Robust Landmark Retrieval, IEEE Transactions on Image Processing, 26 (3), 1393-1404, 2017

Y . Wang et al., Effective Multi -Query Expansions: Collaborative Deep Networks for Robust Landmark Retrieval, IEEE Transactions on Image Processing, 26 (3), 1393-1404, 2017

work page 2017
[38]

Wu et al., Deep Adaptive Feature Embedding with Loc al Sample Distributions for Person Re - identification

L. Wu et al., Deep Adaptive Feature Embedding with Loc al Sample Distributions for Person Re - identification. Pattern Recognition, 73:275-288, 2018

work page 2018
[39]

Wu et al., What -and-Where to Match: Deep Spatially Multiplicative Integration Networks for Person Re-identification

L. Wu et al., What -and-Where to Match: Deep Spatially Multiplicative Integration Networks for Person Re-identification. Pattern Recognition, 76:727-738, 2018

work page 2018
[40]

Wu, Y Wang, L Shao, M Wang，3-D PersonVLAD: Learning Deep Global Representations for Video-Based Person Reidentification

L. Wu, Y Wang, L Shao, M Wang，3-D PersonVLAD: Learning Deep Global Representations for Video-Based Person Reidentification. IEEE Transactions on Neural Networks and Learning Systems, 2019

work page 2019
[41]

L Wu, Y Wang, X Li, J Gao, Deep Attention-based Spatially Recursive Networks for Fine -Grained Visual Recognition, IEEE Transactions on Cybernetics 49 (5), 1791-1802, 2019

work page 2019

[1] [1]

However, the drawback of large accuracy dropping also restrict s its application

Abstract 1-bit deep neural networks (DNNs), of which both the activations and weights are binarized, are attracting more and more attention due to their high computational efficiency and low memory requirement. However, the drawback of large accuracy dropping also restrict s its application. In this paper, we propose a novel Targeted Acceleration and Comp...

work page

[2] [2]

datasets, and experimental results show that the proposed TAC can significantly improve the accuracy of 1 -bit deep neural networks and outperforms the state-of-the-art by more than 6 percentage points

work page

[3] [3]

Under the circumstances, the application of deep neural networks on mobile devices is gaining more and more attention

Introduction Recently, deep neural networks (DNNs) have widely applied in computer vision tasks, such as image classification [17] [18], object detection [24] and visual recognition [25][26][27][28][29][30][31][32][33] so on. Under the circumstances, the application of deep neural networks on mobile devices is gaining more and more attention. However, gre...

work page

[4] [4]

Network pruning: Network pruning measures the redundancy on network structures like connections, neurons and filters with different rules, and removes unimportant parts

Related Work In this section, we mainly review the methods related to pruning and quantization. Network pruning: Network pruning measures the redundancy on network structures like connections, neurons and filters with different rules, and removes unimportant parts. In [1], all connections with weights below a certain threshold are removed from the pre -tr...

work page

[5] [5]

Method In the section, we introduce our Targeted Acceleration and Compression (TAC) framework in detail. Considering the binarization of fully connected layers is not very necessary and can cause the accuracy loss , our TAC optimizes convolutional lay ers and fully connected layers individually and regards the process as two steps: accelerating convolutio...

work page

[6] [6]

5.1 Datasets and Implement Details In this section, we briefly introduce the datasets, network structures and experiment settings in our experiments

Experiments We performed extensive experiments on CIFAR-10, CIFAR-100 and ImageNet datasets with VGG-9 and AlexNet architectures. 5.1 Datasets and Implement Details In this section, we briefly introduce the datasets, network structures and experiment settings in our experiments. CIFAR-10/100 [22]: This dataset consists of a training set of 50,000 and a te...

work page

[7] [7]

For the hyper-parameters, we set the pruning rate at iterative steps as {0.2, 0

to binarize both the activations and weights of convolutional layers, and network pruning and quantization in [1] [4] to compress fully connected layers. For the hyper-parameters, we set the pruning rate at iterative steps as {0.2, 0. 4, 0.6, 0.7 , 0.75 } and the bit width of quantization as 4. For simplicity, we denote the networks trained with the TAC f...

work page

[8] [8]

For the convolutional layers, both the activations and weights are binarized to speed up the computations

Conclusion In this paper, we have proposed a novel framework named Targeted Acceleration and Compression (TAC), where the convolutional and fully connected layer are separated and learned individually. For the convolutional layers, both the activations and weights are binarized to speed up the computations. For the fully connected layers, network pruning ...

work page

[9] [9]

Learning both weights and connections for efficie nt neural network[C]//Advances in neural information processing systems

Han S, Pool J, Tran J, et al. Learning both weights and connections for efficie nt neural network[C]//Advances in neural information processing systems. 2015: 1135-1143

work page 2015

[10] [10]

Learning efficient convolutional networks through network slimming[C]//Computer Vision (ICCV), 2017 IEEE International Conference on

Liu Z, Li J, Shen Z, et al. Learning efficient convolutional networks through network slimming[C]//Computer Vision (ICCV), 2017 IEEE International Conference on. IEEE, 2017: 2755-2763

work page 2017

[11] [11]

Pruning Filters for Efficient ConvNets

Li H, Kadav A, Durdanovic I, et al. Pruning filters for efficient convnets[J]. arXiv preprint arXiv:1608.08710, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016

[12] [12]

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

Han S, Mao H, Dally W J. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding[J]. arXiv preprint arXiv:1510.00149, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015

[13] [13]

Accelerating Convolutional Networks via Global & Dynamic Filter Pruning[C]//IJCAI

Lin S, Ji R, Li Y , et al. Accelerating Convolutional Networks via Global & Dynamic Filter Pruning[C]//IJCAI. 2018: 2425-2432

work page 2018

[14] [14]

Towards the Limit of Network Quantization

Choi Y , El-Khamy M, Lee J. Towards the limit of networ k quantization[J]. arXiv preprint arXiv:1612.01543, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016

[15] [15]

Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights

Zhou A, Yao A, Guo Y , et al. Incremental network quantization: Towards lossless cnns with low- precision weights[J]. arXiv preprint arXiv:1702.03044, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[16] [16]

Ternary Weight Networks[J]

Li F, Zhang B, Liu B. Ternary Weight Networks[J]. 2016

work page 2016

[17] [17]

Trained Ternary Quantization[J]

Zhu C, Han S, Mao H, et al. Trained Ternary Quantization[J]. 2016

work page 2016

[18] [18]

Binaryconnect: Training deep neural networks with binary weights during propagations[C]//Advances in neural information processing systems

Courbariaux M, Bengio Y , David J P. Binaryconnect: Training deep neural networks with binary weights during propagations[C]//Advances in neural information processing systems. 2015: 3123- 3131

work page 2015

[19] [19]

BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1[J]

Courbariaux M, Bengio Y . BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1[J]. 2016

work page 2016

[20] [20]

XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks[J]

Rastegari M, Ordonez V , Redmon J, et al. XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks[J]. 2016:525-542

work page 2016

[21] [21]

Deep Learning with Low Precision by Half-wave Gaussian Quantization

Cai Z, He X, Sun J, et al. Deep learning with low precision by half -wave gaussian quantization[J]. arXiv preprint arXiv:1702.00953, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[22] [22]

From Hashing to CNNs: Training BinaryWeight Networks via Hashing[J]

Hu Q, Wang P, Cheng J. From Hashing to CNNs: Training BinaryWeight Networks via Hashing[J]. 2018

work page 2018

[23] [23]

Towards Accurate Binary Convolutional Neural Network[J]

Lin X, Zhao C, Pan W. Towards Accurate Binary Convolutional Neural Network[J]. 2017

work page 2017

[24] [24]

Two -Step Quantization for Low -bit Neural Networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

Wang P, Hu Q, Zhang Y , et al. Two -Step Quantization for Low -bit Neural Networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018: 4376-4384

work page 2018

[25] [25]

Imagenet classification with deep convolutional neural networks[C]//Advances in neural information processing systems

Krizhevsky A, Sutskever I, Hinton G E. Imagenet classification with deep convolutional neural networks[C]//Advances in neural information processing systems. 2012: 1097-1105

work page 2012

[26] [26]

Very Deep Convolutional Networks for Large-Scale Image Recognition

Simonyan K, Zisserman A. Very deep convolutional networks for large -scale image recognition[J]. arXiv preprint arXiv:1409.1556, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014

[27] [27]

Imagenet: A large -scale hierarchical image database[C]//Computer Vision and Pattern Recognition, 2009

Deng J, Dong W, Socher R, et al. Imagenet: A large -scale hierarchical image database[C]//Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on. Ieee, 2009: 248-255

work page 2009

[28] [28]

Bi-Real Net: Enhancing the Performance of 1-bit CNNs With Improved Representational Capability and Advanced Training Algorithm

Liu Z, Wu B, Luo W, et al. Bi -Real Net: Enhancing the Performance of 1 -bit CNNs With Improved Representational Capability and Advanced Training Algorithm[J]. arXiv preprint arXiv:1808.00278, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[29] [29]

Adam: A Method for Stochastic Optimization

Kingma D P, Ba J. Adam: A method for stochastic optimization[J]. arXiv preprint arXiv:1412.6980, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014

[30] [30]

Learning multiple layers of features from tiny images[R]

Krizhevsky A, Hinton G. Learning multiple layers of features from tiny images[R]. Technical report, University of Toronto, 2009

work page 2009

[31] [31]

Imagenet large scale visual recognition challenge[J]

Russakovsky O, Deng J, Su H, et al. Imagenet large scale visual recognition challenge[J]. International Journal of Computer Vision, 2015, 115(3): 211-252

work page 2015

[32] [32]

Faster r -cnn: Towards real-time object detection with region proposal networks[C]//Advances in neural information processing systems

Ren S, He K, Girshick R, et al. Faster r -cnn: Towards real-time object detection with region proposal networks[C]//Advances in neural information processing systems. 2015: 91-99

work page 2015

[33] [33]

Wang et al., Iterative Views Agreement: An Iterative Low -Rank based Structured Optimization Method to Multi-View Spectral Clustering

Y . Wang et al., Iterative Views Agreement: An Iterative Low -Rank based Structured Optimization Method to Multi-View Spectral Clustering. IJCAI 2016:2153-2159

work page 2016

[34] [34]

Wang et al., Robust Subspace Clustering for Multi -view Data by Exploiting Correlation Consensus

Y . Wang et al., Robust Subspace Clustering for Multi -view Data by Exploiting Correlation Consensus. IEEE Transactions on Image Processing, 24(11):3939-3949, 2015

work page 2015

[35] [35]

Wang et al., Multiview Spectral Clustering via Structured Low-Rank Matrix Factorization

Y . Wang et al., Multiview Spectral Clustering via Structured Low-Rank Matrix Factorization. IEEE Transactions on Neural Networks and Learning Systems, 29(10):4833-4843, 2018

work page 2018

[36] [36]

L. Wu, Y . Wang and L. Shao. Cycle -Consistent Deep Generative Hashing for Cross -Modal Retrieval. IEEE Transactions on Image Processing,28(4):1602-1612, 2019

work page 2019

[37] [37]

Wang et al., Effective Multi -Query Expansions: Collaborative Deep Networks for Robust Landmark Retrieval, IEEE Transactions on Image Processing, 26 (3), 1393-1404, 2017

Y . Wang et al., Effective Multi -Query Expansions: Collaborative Deep Networks for Robust Landmark Retrieval, IEEE Transactions on Image Processing, 26 (3), 1393-1404, 2017

work page 2017

[38] [38]

Wu et al., Deep Adaptive Feature Embedding with Loc al Sample Distributions for Person Re - identification

L. Wu et al., Deep Adaptive Feature Embedding with Loc al Sample Distributions for Person Re - identification. Pattern Recognition, 73:275-288, 2018

work page 2018

[39] [39]

Wu et al., What -and-Where to Match: Deep Spatially Multiplicative Integration Networks for Person Re-identification

L. Wu et al., What -and-Where to Match: Deep Spatially Multiplicative Integration Networks for Person Re-identification. Pattern Recognition, 76:727-738, 2018

work page 2018

[40] [40]

Wu, Y Wang, L Shao, M Wang，3-D PersonVLAD: Learning Deep Global Representations for Video-Based Person Reidentification

L. Wu, Y Wang, L Shao, M Wang，3-D PersonVLAD: Learning Deep Global Representations for Video-Based Person Reidentification. IEEE Transactions on Neural Networks and Learning Systems, 2019

work page 2019

[41] [41]

L Wu, Y Wang, X Li, J Gao, Deep Attention-based Spatially Recursive Networks for Fine -Grained Visual Recognition, IEEE Transactions on Cybernetics 49 (5), 1791-1802, 2019

work page 2019