arxiv: 1801.01687 · v1 · pith:CFEXDO7Inew · submitted 2018-01-05 · 💻 cs.CV

Accelerated Training for Massive Classification via Dynamic Class Selection

Xingcheng Zhang , Lei Yang , Junjie Yan , Dahua Lin This is my paper

classification 💻 cs.CV

keywords classesclassificationcostmassivemethodnumberclassdemand

0 comments

read the original abstract

Massive classification, a classification task defined over a vast number of classes (hundreds of thousands or even millions), has become an essential part of many real-world systems, such as face recognition. Existing methods, including the deep networks that achieved remarkable success in recent years, were mostly devised for problems with a moderate number of classes. They would meet with substantial difficulties, e.g. excessive memory demand and computational cost, when applied to massive problems. We present a new method to tackle this problem. This method can efficiently and accurately identify a small number of "active classes" for each mini-batch, based on a set of dynamic class hierarchies constructed on the fly. We also develop an adaptive allocation scheme thereon, which leads to a better tradeoff between performance and cost. On several large-scale benchmarks, our method significantly reduces the training cost and memory demand, while maintaining competitive performance.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

EarthSight: A Distributed Framework for Low-Latency Satellite Intelligence
cs.LG 2025-11 unverdicted novelty 6.0

EarthSight reduces average compute time per image by 1.9x and 90th-percentile end-to-end latency from 51 to 21 minutes by distributing inference decisions between orbit and ground with shared backbones and early rejec...