On-chip learning in a conventional silicon MOSFET based Analog Hardware Neural Network
Pith reviewed 2026-05-25 11:49 UTC · model grok-4.3
The pith
Conventional MOSFET transistors can act as synapses for on-chip learning in an analog hardware neural network.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
An analog hardware neural network using conventional MOSFETs as synapses, together with analog peripheral circuits for neuron activation and synaptic weight updates, supports on-chip learning; SPICE simulation of the full system produces high training and test accuracy on Fisher's Iris dataset and energy-speed performance comparable to prior NVM-device implementations.
What carries the argument
Single-transistor MOSFET synapse whose current-voltage characteristic is modeled in SPICE and embedded in a crossbar array with analog circuits that compute weight updates during on-chip learning.
If this is right
- On-chip learning becomes possible using only standard CMOS transistors that can be made in merchant foundries.
- The system avoids the large voltage pulses required by floating-gate transistors, improving energy efficiency.
- Training and test accuracy on the Iris dataset reaches levels previously shown with NVM-based analog networks.
- Speed and energy consumption during on-chip learning remain comparable to earlier RRAM, PCM, and spintronic implementations.
Where Pith is reading between the lines
- If the MOSFET synapse model holds in silicon, the same crossbar architecture could be scaled to larger networks without requiring new materials or processes.
- Real-device mismatch or temperature drift could be tested by adding Monte-Carlo variation to the SPICE netlist and checking whether learning still converges.
- The approach opens a route to integrate analog on-chip learning directly with conventional digital CMOS logic on the same chip.
Load-bearing premise
SPICE models of the MOSFET synapses and analog weight-update circuits accurately represent real-device behavior without being disrupted by noise, mismatch, or fabrication variations.
What would settle it
Fabricate the MOSFET crossbar and peripheral circuits in silicon and measure whether on-chip training on the Iris dataset still reaches the reported training and test accuracies.
Figures
read the original abstract
On-chip learning in a crossbar array based analog hardware Neural Network (NN) has been shown to have major advantages in terms of speed and energy compared to training NN on a traditional computer. However analog hardware NN proposals and implementations thus far have mostly involved Non Volatile Memory (NVM) devices like Resistive Random Access Memory (RRAM), Phase Change Memory (PCM), spintronic devices or floating gate transistors as synapses. Fabricating systems based on RRAM, PCM or spintronic devices need in-house laboratory facilities and cannot be done through merchant foundries, unlike conventional silicon based CMOS chips. Floating gate transistors need large voltage pulses for weight update, making on-chip learning in such systems energy inefficient. This paper proposes and implements through SPICE simulations on-chip learning in analog hardware NN using only conventional silicon based MOSFETs (without any floating gate) as synapses since they are easy to fabricate. We first model the synaptic characteristic of our single transistor synapse using SPICE circuit simulator and benchmark it against experimentally obtained current-voltage characteristics of a transistor. Next we design a Fully Connected Neural Network (FCNN) crossbar array using such transistor synapses. We also design analog peripheral circuits for neuron and synaptic weight update calculation, needed for on-chip learning, again using conventional transistors. Simulating the entire system on SPICE simulator, we obtain high training and test accuracy on the standard Fisher's Iris dataset, widely used in machine learning. We also compare the speed and energy performance of our transistor based implementation of analog hardware NN with some previous implementations of NN with NVM devices and show comparable performance with respect to on-chip learning.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes using conventional silicon MOSFETs (without floating gates) as synapses in a crossbar-based analog hardware neural network to enable on-chip learning. A single-transistor synapse is modeled in SPICE and benchmarked against experimental I-V curves; a fully connected NN crossbar and analog peripheral circuits for neurons and weight updates (gradient calculation and pulsing) are then designed using standard transistors. Full-system SPICE simulations are reported to achieve high training and test accuracy on Fisher's Iris dataset, with speed and energy metrics claimed comparable to prior NVM-based analog NN implementations.
Significance. If the idealized SPICE results prove representative of fabricated silicon, the approach would enable on-chip learning in analog hardware using merchant-foundry CMOS processes, sidestepping specialized NVM fabrication and high-voltage requirements of floating-gate devices while retaining potential speed and energy advantages over digital training.
major comments (2)
- [full-system SPICE simulation and results] The full-system SPICE simulation results (described after the peripheral-circuit design) rest on ideal device models for the weight-update path without Monte-Carlo mismatch, 1/f noise, or process-variation analysis; only the isolated single-transistor synapse I-V curve is experimentally benchmarked. This directly undermines the central claim that the analog on-chip learning loop converges reliably, as any systematic offset in gradient computation or pulse generation would break the reported Iris accuracy.
- [performance comparison] The energy and speed comparison to NVM implementations (final section) is performed under the same idealized SPICE assumptions used for the learning loop; no sensitivity analysis quantifies how analog non-idealities would degrade the claimed accuracy or efficiency, rendering the comparability claim load-bearing yet untested.
minor comments (2)
- [simulation results] State the precise network topology (number of layers, hidden neurons, output classes) and the exact training/test split and accuracy numbers for the Iris experiments rather than the qualitative phrase 'high training and test accuracy'.
- [analog peripheral circuits] Clarify whether the peripheral circuits for weight update are fully analog or contain any digital control logic, and provide schematic or block-diagram references for the gradient-calculation block.
Simulated Author's Rebuttal
We thank the referee for the constructive comments on our simulation-based study. We address each major comment below.
read point-by-point responses
-
Referee: [full-system SPICE simulation and results] The full-system SPICE simulation results (described after the peripheral-circuit design) rest on ideal device models for the weight-update path without Monte-Carlo mismatch, 1/f noise, or process-variation analysis; only the isolated single-transistor synapse I-V curve is experimentally benchmarked. This directly undermines the central claim that the analog on-chip learning loop converges reliably, as any systematic offset in gradient computation or pulse generation would break the reported Iris accuracy.
Authors: Our manuscript is a SPICE simulation study demonstrating on-chip learning feasibility with conventional MOSFET synapses. The single-transistor synapse is benchmarked against experimental I-V data, while the full crossbar and peripheral circuits use standard transistor models. The reported Iris accuracy shows that the learning loop converges under these modeled conditions. We do not claim experimental validation of the complete system or robustness to mismatch/noise; such analysis would require fabricated silicon and is outside the paper's scope. We can add a brief discussion of these limitations. revision: partial
-
Referee: [performance comparison] The energy and speed comparison to NVM implementations (final section) is performed under the same idealized SPICE assumptions used for the learning loop; no sensitivity analysis quantifies how analog non-idealities would degrade the claimed accuracy or efficiency, rendering the comparability claim load-bearing yet untested.
Authors: The comparisons use performance figures extracted directly from our SPICE simulations and the numbers reported in the cited NVM papers. This is consistent with the idealized framework of the study. We agree the claim would benefit from explicit qualification and will revise the final section to state that the metrics are simulation-derived under ideal conditions. revision: yes
Circularity Check
No significant circularity; results are direct simulation outputs.
full rationale
The paper derives its reported training/test accuracies and energy/speed metrics solely from end-to-end SPICE simulations of a MOSFET crossbar plus peripheral circuits on the Iris dataset. No equations, fitted parameters, or self-citations are shown that reduce these outputs to inputs by construction (e.g., no synaptic I-V model is defined from the target accuracy, and no uniqueness theorem or ansatz is imported from prior author work). The derivation chain is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption SPICE models of conventional MOSFETs accurately represent synaptic conductance modulation for on-chip learning
Reference graph
Works this paper leans on
- [1]
- [2]
-
[3]
C. D. Schuman et al. , arxiv. 1705.06963v1
work page internal anchor Pith review Pith/arXiv arXiv
-
[4]
Boahen, Computing in Science & Engi- neering, vol
K. Boahen, Computing in Science & Engi- neering, vol. 19, no. 2, 2017
work page 2017
- [5]
- [6]
-
[7]
A. Sengupta et al. , IEEE Transactions on Circuits and Systems- I, vol. 63, no. 12, 2016
work page 2016
-
[8]
P. A. Merolla et al. , Science vol. 345, no. 6197, pp. 668-697, 2014
work page 2014
-
[9]
S.B. Furber et al. , Proceedings of the IEEE, vol. 102, no. 5, pp. 652-665, 2014
work page 2014
- [10]
- [11]
-
[12]
G.W.Burr et al. in Proc. IEEE Interna- tional Electron Devices Meeting (IEDM), pp. 29.5.129.5.4, 2014
work page 2014
-
[13]
D. Ielmini and H. S. Philip Wong, Nature Electronics, vol. 1, no. 111101, pp. 333-343, 2018
work page 2018
-
[14]
Suri ed., Cognitive Systems Mono- graphs, Springer, 2017
M. Suri ed., Cognitive Systems Mono- graphs, Springer, 2017
work page 2017
- [15]
- [16]
- [17]
-
[18]
M. Suri et al. , Proc. IEEE International Electron Devices Meeting (IEDM), pp. 4.4.1- 4.4.4, 2011
work page 2011
- [19]
-
[20]
Diorio et al., IEEE Transactions on Elec- tron Devices, vol
C. Diorio et al., IEEE Transactions on Elec- tron Devices, vol. 43, no. 11, 1996
work page 1996
-
[21]
B. W. Lee et al., IEEE Transactions on Cir- cuits and Systems, vol. 38, no. 6, 1991
work page 1991
-
[22]
Y. Horio et al. IEEE International Sympo- sium on Circuits and Systems, 1990 (DOI: 10.1109/ISCAS.1990.112638)
- [23]
-
[24]
H. Choi et al. , IEEE Transactions on Elec- tron Devices, vol. 65, no. 1, 2018
work page 2018
-
[25]
G. Burr et al. , IEEE Transactions on Elec- tron Devices, vol. 62, no. 11, 2015
work page 2015
-
[26]
P. Chen et al. IEEE International Electron Device Meeting, 2017 (DOI: 10.1109/IEDM.2017.8268337)
-
[27]
T. Gokmen and Y. Vlasov, Frontiers in Neu- roscience vol. 10, no. 333, 2016
work page 2016
-
[28]
A. Hirohata et al. , IEEE Transactions on Magnetics, vol. 51, no. 10, 2015
work page 2015
-
[29]
UCI repository of ma- chine learning databases
D. J. Newman et al., “UCI repository of ma- chine learning databases”, Department of In- formation and Computer Science, University of California Irvine, CA, 1998
work page 1998
- [30]
-
[31]
A. Sengupta et al. , Proc. IEEE Biomedical Circuits and Systems Conference (BioCAS), no. 7833852, 2016
work page 2016
-
[32]
C. S. Thakur et al., IEEE Trans. on Circuits and Systems- I, vol. 63, no. 2, 2016
work page 2016
-
[33]
A. Tripathi et al. , Proc.in IEEE Interna- tional Symposium on Circuits and Systems (ISCAS), 2019
work page 2019
-
[34]
P. U. Diehl and M. Cook, Front. Comput. Neurosci. vol. 9, no. 99, 2015
work page 2015
-
[35]
G. Q. Bi and M. M. Poo, J. Neurosci. 18, pp. 1046410472, 1998
work page 1998
- [36]
-
[37]
P. Narayanan et al. IBM Journal of Re- search and Development, vol. 61, no. 4/5, 2017. 17
work page 2017
-
[38]
A. Sengupta, A. Banerjee, and K. Roy, Phys. Rev. Appl. vol. 6, no. 064003, 2016
work page 2016
-
[39]
T. Bhattacharya et al. Proc. in IEEE In- ternational Joint Conference on Neural Net- works (IJCNN), no. 18181462 (2018)
work page 2018
-
[40]
MOS Transistor in Modern Semi- conductor Devices for Integrated Circuits
C. Hu, “MOS Transistor in Modern Semi- conductor Devices for Integrated Circuits”, Pearson, 2010
work page 2010
- [41]
-
[42]
B. D. Yammenavar et al. , International Journal of Artificial Intelligence and Appli- cations (IJAIA), vol. 2, no. 3, 2011
work page 2011
-
[43]
G. Khodabandehloo et al. ,IEEE Trans. on VLSI Systems, vol. 20, no. 4, 2012
work page 2012
-
[44]
Neamen et al., Electronic Circuits- Anal- ysis and Design, McGraw Hill, 2006
D. Neamen et al., Electronic Circuits- Anal- ysis and Design, McGraw Hill, 2006
work page 2006
-
[45]
A Comprehensive Study of the Howland Current Pump
AN-1515 “A Comprehensive Study of the Howland Current Pump” (PDF) (PDF). Texas Instruments, Inc. 2013
work page 2013
- [46]
-
[47]
Analog CMOS-based Resistive Processing Unit for Deep Neural Network Training
S. Kim et al. ,arXiv:1706.06620, 2017
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[48]
A. Biswas et al. , arXiv:1612.02233, 2016
work page internal anchor Pith review Pith/arXiv arXiv 2016
- [49]
- [50]
- [51]
- [52]
-
[53]
S. Emori et al., Spin Hall torque magnetom- etry of Dzyaloshinskii do- main walls, Physi- cal Review B, vol. 90, no. 184427, 2014
work page 2014
- [54]
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.