Gradual Tuning: a better way of Fine Tuning the parameters of a Deep Neural Network

Alexander V. Terekhov; Guglielmo Montone; J. Kevin O'Regan

arxiv: 1711.10177 · v1 · pith:I6Y5FPHCnew · submitted 2017-11-28 · 💻 cs.AI · cs.NE

Gradual Tuning: a better way of Fine Tuning the parameters of a Deep Neural Network

Guglielmo Montone , J. Kevin O'Regan , Alexander V. Terekhov This is my paper

classification 💻 cs.AI cs.NE

keywords tuningnetworktaskdifferentgradualparametersbetterfine

0 comments

read the original abstract

In this paper we present an alternative strategy for fine-tuning the parameters of a network. We named the technique Gradual Tuning. Once trained on a first task, the network is fine-tuned on a second task by modifying a progressively larger set of the network's parameters. We test Gradual Tuning on different transfer learning tasks, using networks of different sizes trained with different regularization techniques. The result shows that compared to the usual fine tuning, our approach significantly reduces catastrophic forgetting of the initial task, while still retaining comparable if not better performance on the new task.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Layout-Aware Representation Learning for Open-Set ID Fraud Discovery
cs.CV 2026-04 unverdicted novelty 5.0

Adapting DINOv3 via SimMIM and composite metric learning on U.S. IDs yields 99.83% Canadian layout accuracy and surfaces 276 fraud cases (222 missed by prior detectors) in 20k Canadian IDs via embedding analysis.