Hardware-Software Co-Design of Scalable, Energy-Efficient Analog Recurrent Computations

Arthur Fyon; Damien Ernst; Guillaume Drion; Jean-Michel Redout\'e; Julien Brandoit; Loris Mendolia

REVIEW 2 major objections 2 minor 2 cited by

Reviewed by Pith at T0; open to challenge.

T0 means a machine referee read the full paper against a public rubric. The mark states how deep the mechanical check went, never who wrote it. the ladder, T0–T4 →

Challenge this review Re-run · record.json Download PDF Read on arXiv ↗

T0 review · grok-4.3

Bistable Memory Recurrent Units enable ultra-low power analog recurrent neural networks via direct parameter-to-circuit mapping and noise suppression at cell boundaries

2026-06-30 22:18 UTC pith:N76ISPAO

load-bearing objection The paper maps BMRUs to current-mode analog circuits with direct parameter correspondence and 20x boundary noise suppression in 180 nm sims, but the first-quadrant fixed-threshold reformulation's effect on expressivity is asserted without shown verification. the 2 major comments →

arxiv 2605.15216 v3 pith:N76ISPAO submitted 2026-05-12 cs.AR cs.LG

Hardware-Software Co-Design of Scalable, Energy-Efficient Analog Recurrent Computations

Arthur Fyon , Julien Brandoit , Loris Mendolia , Damien Ernst , Jean-Michel Redout\'e , Guillaume Drion This is my paper

classification cs.AR cs.LG

keywords analog circuitsrecurrent neural networkslow-power hardwareBistable Memory Recurrent Unitshardware-software co-designnoise suppressionCMOS implementationkeyword spotting

verification ladder T0 review T1 audit T2 compute T3 formal T4 reserved

The pith

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that Bistable Memory Recurrent Units, RNNs with discrete outputs and hysteretic dynamics, can be mapped to current-mode analog circuits with one-to-one correspondence between learned parameters and circuit elements. This mapping is enabled by reformulating the units for first-quadrant operation with fixed thresholds. Discrete outputs suppress analog noise by at least twentyfold at each boundary, eliminating the accumulation that has blocked recurrent analog designs. Transistor simulations in 180 nm CMOS confirm the software model predicts circuit behavior accurately enough to serve as a simulator. Power analyses show recurrence adds only linear cost while feedforward layers dominate quadratically, supporting sub-microwatt keyword spotting at the RNN core.

Core claim

Bistable Memory Recurrent Units admit an ultra-low power current-mode analog implementation which establishes a one-to-one correspondence between each learned parameter and a circuit element; the discrete outputs suppress analog noise by at least 20-fold at each cell boundary, breaking the noise accumulation that prevents analog recurrence. The reformulation for first-quadrant operation with fixed thresholds preserves expressivity and trainability while enabling the direct correspondence.

What carries the argument

Bistable Memory Recurrent Units reformulated for first-quadrant operation with fixed thresholds, which carry the one-to-one parameter-to-circuit-element mapping and enable noise suppression via discrete outputs

Load-bearing premise

Reformulating BMRUs for first-quadrant operation with fixed thresholds preserves expressivity and trainability while enabling direct hardware mapping

What would settle it

A fabricated 180 nm CMOS circuit implementing recurrent BMRUs that shows noise suppression below 20-fold at cell boundaries or deviates from software predictions in recurrent mode would falsify the claim

Watch this falsifier — get emailed when new claim-graph text bears on it.

If this is right

Power cost of adding recurrence scales linearly with state dimension
Feedforward layers dominate total power and scale quadratically
Recurrence is added at linear marginal cost relative to the feedforward backbone
End-to-end keyword spotting achieves sub-microwatt inference at the RNN core
Software model serves as high-fidelity low-cost simulator of physical hardware

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The linear marginal cost of recurrence could allow larger state dimensions in always-on sensors without proportional power growth
The high-fidelity software simulator could speed iteration on other analog recurrent designs without repeated full transistor simulations
Sub-microwatt recurrent cores may extend to continuous biomedical monitoring tasks where temporal dynamics matter
The approach of discrete outputs to break noise accumulation might generalize to other hysteretic or threshold-based analog neural units

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit.

Desk Editor's Note

The paper maps BMRUs to current-mode analog circuits with direct parameter correspondence and 20x boundary noise suppression in 180 nm sims, but the first-quadrant fixed-threshold reformulation's effect on expressivity is asserted without shown verification.

read the letter

The core contribution is a hardware-software co-design that turns BMRUs into analog current-mode cells. Each learned parameter maps one-to-one to a circuit element, and the discrete outputs cut noise accumulation at cell boundaries. Transistor-level simulations match the software model closely enough that the model can stand in for hardware behavior at low cost. They also show recurrence adds power linearly while the feedforward backbone scales quadratically, so the marginal cost of adding temporal processing stays modest.

That scaling result and the simulation fidelity are the parts that hold up cleanly from the abstract. The linear-vs-quadratic distinction is a practical point for always-on sensor work, and the noise-suppression claim follows directly from the discrete output choice.

The soft spot is the reformulation step. BMRUs are adjusted to first-quadrant currents and fixed thresholds to make the hardware mapping possible. The abstract states this keeps expressivity and trainability intact, but supplies no derivation or quantitative check on reachable state transitions or learnable temporal patterns. If the change narrows the model class, the claimed advantage over earlier analog recurrent attempts shrinks. The stress-test note on this point matches what the abstract shows.

This is for groups already working on sub-microwatt edge devices that need temporal processing, not general RNN theory. Readers who build analog accelerators or co-design flows will get concrete circuit details and scaling numbers to examine. It deserves a serious referee because the simulation evidence is presented as independent and the power claim is falsifiable in silicon, even though the expressivity preservation needs tighter support.

Referee Report

2 major / 2 minor

Summary. The paper claims that Bistable Memory Recurrent Units (BMRUs) can be reformulated for first-quadrant current-mode operation with fixed thresholds to enable a direct one-to-one mapping from learned parameters to analog circuit elements in 180 nm CMOS. This mapping, combined with discrete hysteretic outputs, is asserted to suppress analog noise by at least 20-fold at cell boundaries, overcoming noise accumulation in recurrent analog computation. Transistor-level simulations demonstrate near-perfect agreement with the software model, which is then used to show linear marginal power cost for adding recurrence (versus quadratic for feedforward layers) and sub-microwatt end-to-end keyword spotting.

Significance. If the reformulation preserves expressivity and trainability without loss, the result would be significant for always-on edge AI: it supplies a concrete hardware-software co-design path to analog recurrence at ultra-low power, with explicit parameter-to-device correspondence and quantitative noise/power scaling data. The near-perfect simulation fidelity and the reproducible large-scale noise/power analyses are explicit strengths that would allow the software model to serve as a low-cost proxy for hardware exploration.

major comments (2)

[Abstract and BMRU reformulation section] Abstract and the BMRU reformulation section: the claim that 'reformulation ... with fixed thresholds [preserves] expressivity and trainability' is stated without quantitative support. No comparison is provided of reachable state-transition functions, training convergence, or benchmark accuracy between the original variable-threshold BMRU and the fixed-threshold first-quadrant version. Because the one-to-one parameter-circuit mapping, the 20-fold noise suppression argument, and the linear power-scaling claim all rest on this equivalence, the absence of such verification is load-bearing.
[Simulation results section] Noise-suppression analysis (simulation results section): the reported 20-fold reduction is obtained from post-reformulation simulations; the text does not derive or bound the suppression factor from the fixed-threshold hysteretic dynamics alone, leaving open whether the factor is an intrinsic property of the reformulated BMRU or an artifact of the particular circuit sizing and input statistics used.

minor comments (2)

[Circuit design section] Notation for currents and thresholds should be unified between the software model equations and the circuit schematic; inconsistent symbols make the claimed one-to-one correspondence harder to verify by inspection.
[Application results] The keyword-spotting benchmark description omits the exact RNN topology (number of BMRU layers, hidden dimension) used for the sub-microwatt claim; adding this would allow direct reproduction of the power numbers.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback and for recognizing the potential significance of the BMRU hardware-software co-design. We respond to each major comment below, agreeing that additional quantitative support will strengthen the manuscript.

read point-by-point responses

Referee: [Abstract and BMRU reformulation section] Abstract and the BMRU reformulation section: the claim that 'reformulation ... with fixed thresholds [preserves] expressivity and trainability' is stated without quantitative support. No comparison is provided of reachable state-transition functions, training convergence, or benchmark accuracy between the original variable-threshold BMRU and the fixed-threshold first-quadrant version. Because the one-to-one parameter-circuit mapping, the 20-fold noise suppression argument, and the linear power-scaling claim all rest on this equivalence, the absence of such verification is load-bearing.

Authors: We agree that explicit quantitative comparisons are needed to substantiate the preservation of expressivity and trainability under the fixed-threshold reformulation. The reformulation rescales weights and biases to achieve equivalent first-quadrant dynamics while fixing thresholds for circuit mapping. In the revised version we will add a new subsection with side-by-side training convergence curves on standard RNN benchmarks, final test accuracies (including keyword spotting), and a characterization of reachable state-transition functions showing that the fixed-threshold model spans an equivalent functional class via parameter adjustment. revision: yes
Referee: [Simulation results section] Noise-suppression analysis (simulation results section): the reported 20-fold reduction is obtained from post-reformulation simulations; the text does not derive or bound the suppression factor from the fixed-threshold hysteretic dynamics alone, leaving open whether the factor is an intrinsic property of the reformulated BMRU or an artifact of the particular circuit sizing and input statistics used.

Authors: The 20-fold suppression is measured in transistor-level simulations of the reformulated circuit. The underlying mechanism is the hysteretic snap to discrete stable points, which quantizes noise at each recurrence step. We acknowledge the manuscript lacks an explicit analytical bound independent of sizing. In revision we will derive a lower bound on the suppression factor from the fixed-threshold dynamics alone: for additive Gaussian noise of std. dev. σ and hysteresis separation Δ, the effective noise variance after the threshold is reduced by a factor of at least Δ/(2σ) with high probability, showing the effect is intrinsic to the bistable dynamics (with the concrete 20× value realized by our chosen Δ/σ ratio). revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation relies on independent simulation validation

full rationale

The paper states a reformulation of BMRUs for first-quadrant fixed-threshold operation and asserts that this preserves expressivity and trainability, but provides no equations reducing any claimed performance metric (noise suppression, power scaling, or one-to-one mapping fidelity) directly to fitted parameters or prior results by construction. Transistor-level simulations in 180 nm CMOS are presented as an external check showing agreement with the software model, and power analyses are derived from those simulations rather than tautologically from the reformulation inputs. No load-bearing self-citations, uniqueness theorems, or ansatzes are exhibited in the text that collapse the central claims to their own definitions.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on the domain assumption that BMRUs possess discrete outputs and hysteretic dynamics that can be directly realized in current-mode analog circuits, plus the modeling choice of fixed thresholds after reformulation; no free parameters or invented entities are explicitly introduced in the abstract.

axioms (2)

domain assumption BMRUs constitute a class of RNNs with discrete-valued outputs and hysteretic dynamics that admit direct analog mapping
Invoked as the foundation for the circuit design and noise-suppression argument.
ad hoc to paper Reformulation to first-quadrant operation with fixed thresholds preserves expressivity and trainability
Required to establish the one-to-one parameter-circuit correspondence.

pith-pipeline@v0.9.1-grok · 5842 in / 1447 out tokens · 33228 ms · 2026-06-30T22:18:58.237621+00:00 · methodology

0 comments

read the original abstract

Always-on AI applications, from environmental sensors to biomedical implants, require ultra-low power consumption. Analog circuits offer a path to sub-microwatt inference, yet existing analog implementations are limited to feedforward architectures: extending them to recurrent dynamics has been considered impractical due to noise accumulation through temporal feedback. We demonstrate that this barrier can be overcome through hardware-software co-design. Specifically, we identify that Bistable Memory Recurrent Units (BMRUs), a class of Recurrent Neural Networks (RNNs) with discrete-valued outputs and hysteretic dynamics, admit an ultra-low power current-mode analog implementation which we design from first principles. The resulting circuit establishes a one-to-one correspondence between each learned parameter and a circuit element. The discrete outputs suppress analog noise by at least 20-fold at each cell boundary, breaking the noise accumulation that prevents analog recurrence. We reformulate BMRUs for first-quadrant operation with fixed thresholds, enabling the direct correspondence while preserving expressivity and trainability. Transistor-level simulations in 180 nm Complementary Metal-Oxide-Semiconductor (CMOS) show near-perfect agreement between software predictions and circuit-level behavior, with the software model thereby serving as a high-fidelity simulator of the physical hardware at low computational cost. We leverage this fidelity to conduct large-scale noise immunity and power scaling analyses: the power cost of adding recurrence scales linearly with state dimension, while the feedforward layers dominating total power scale quadratically, meaning recurrence is added at linear marginal cost relative to the feedforward backbone. End-to-end keyword spotting achieves sub-microwatt inference at the RNN core.

Figures

Figures reproduced from arXiv: 2605.15216 by Arthur Fyon, Damien Ernst, Guillaume Drion, Jean-Michel Redout\'e, Julien Brandoit, Loris Mendolia.

**Figure 1.** Figure 1: Current-mode analog implementation and FQ BMRU formulation. A. Schematic of the ultra-low power current-mode bistable cell (top) and associated input-output current relationship (bottom). All thresholds and output gain are independently tunable via bias currents. B. FQ BMRU equations (top) and input candidate versus state relationship (bottom), with α, βlo and βhi as learnable parameters. The correspondenc… view at source ↗

**Figure 2.** Figure 2: Analog CMOS implementation of a complete BMRU-based RNN for “yes” KWS. A. Complete network architecture where all operations are computed using analog primitives whose behavior emerges from the physical properties of subthreshold transistors (top). For this proof of concept, a minimal configuration with N = 2 layers and state dimension d = 4 is implemented. KWS task for “yes” recognition. MFCC extraction o… view at source ↗

**Figure 3.** Figure 3: Large-scale noise robustness analysis across three benchmarks (sMNIST, pMNIST and dKWS). Accuracy as a function of injected noise level (relative to measured analog noise from transistor-level simulations) for FQ BMRU, LRU, and minGRU. At analog noise level, FQ BMRU and minGRU maintain full accuracy, while LRU fails catastrophically. FQ BMRU exhibits robust performance up to approximately 2× the analog noi… view at source ↗

**Figure 4.** Figure 4: illustrates the fundamental current mirror topology used to implement weighted connections. In subthreshold operation, a diode-connected input transistor converts an input current Ix into a gate voltage Vx = Vy, which is shared with the output transistor. Since both transistors operate at identical gate-source voltages, their drain currents are primarily determined by their width ratio: Iy ≈ Wout Win Ix. (… view at source ↗

**Figure 5.** Figure 5: Binary-weighted PMOS current mirror for programmable weight implementation. The effective output current is set by enabling combinations of binary-scaled mirror branches, allowing discrete (quantized) weight tuning via a shift register. P − 1 V1 · · · P − d Vd P − b V − b,j N + 1 V1 · · · N + d Vd N + b V + b,j PReLU w − 1 I1 w − d Id I − b,j w + 1 I1 w + d Id I + b,j ReLU Pd i=1 w + i Ii − w − i Ii + I… view at source ↗

**Figure 6.** Figure 6: FC layer with ReLU activation. PMOS mirrors (top) implement negative weights; NMOS mirrors (bottom) implement positive weights. The diode-connected PMOS harvests net positive current. current flows from the supply, implementing ReLU activation and providing output voltage Vout,j for subsequent stages. For layers requiring anti-ReLU activation, the output transistor is replaced with a diode-connected NMOS t… view at source ↗

**Figure 7.** Figure 7: FC layer with anti-ReLU activation. Same structure as [PITH_FULL_IMAGE:figures/full_fig_p029_7.png] view at source ↗

**Figure 8.** Figure 8: CMOS implementation of the FQ BMRU cell. Top left: conceptual dual-Heaviside feedback architecture. Top right (blue): single Heaviside element H1 using 5 transistors. Bottom (red): complete Schmitt trigger with feedback, using 9 transistors total. The top right panel (blue) of [PITH_FULL_IMAGE:figures/full_fig_p030_8.png] view at source ↗

**Figure 9.** Figure 9: Tunability of the CMOS implementation of the FQ BMRU cell. Input-output current relationship of the CMOS implementation of the FQ BMRU cell. All thresholds and output gain are independently tunable via bias currents. an NMOS transistor (M8). This feedback current is mirrored through a PMOS current mirror (M7 and M9), and injected back into the comparator branch of H1 (M1, M2, and M9), thereby increasing th… view at source ↗

**Figure 10.** Figure 10: CMOS FQ BMRU cell simulation results. Transient simulation under triangular input current for different operating temperatures (left). DC sweep demonstrating hysteretic behavior for different operating temperatures (middle). Monte Carlo analysis with 3σ process variation at room temperature (right). Baseline parameters: Igain = 486 pA, Ithresh = 368 pA, Iwidth = 216 pA. Transient response and DC character… view at source ↗

**Figure 11.** Figure 11: Tunability of CMOS FQ BMRU cell parameters. Igain sweep from 0 to 500 pA for different operating temperatures (left). Ithresh sweep from 100 pA to 400 pA with Iwidth = 50 pA (middle). Iwidth sweep from 10 pA to 300 pA (right). Baseline parameters as in [PITH_FULL_IMAGE:figures/full_fig_p032_11.png] view at source ↗

**Figure 12.** Figure 12: Component-level power breakdown across 50 inferences. Power consumption of FQ BMRU cells versus FC layers for 50 inference samples. The approximately even split at d = 4 indicates that both components contribute comparably to efficiency. FQ BMRU cells exhibit substantially lower power variance, consistent with stable discrete-output dynamics [PITH_FULL_IMAGE:figures/full_fig_p033_12.png] view at source ↗

**Figure 13.** Figure 13: Illustration of 20× error suppression at BMRU cell boundaries. During inference for the sample in [PITH_FULL_IMAGE:figures/full_fig_p034_13.png] view at source ↗

**Figure 14.** Figure 14: Hardware inference traces from Cadence Spectre simulation (seed 45). Each panel shows output logit currents (top: Iyes in green, Ino in red) and power consumption (bottom) over the 101-frame input sequence. Spoken word: “down”. Hardware prediction via majority vote: “background”. Software prediction: “background” [PITH_FULL_IMAGE:figures/full_fig_p036_14.png] view at source ↗

**Figure 15.** Figure 15: Hardware inference traces from Cadence Spectre simulation (seed 47). Each panel shows output logit currents (top: Iyes in green, Ino in red) and power consumption (bottom) over the 101-frame input sequence. Spoken word: “yes”. Hardware prediction via majority vote: “yes”. Software prediction: “yes”. 36 [PITH_FULL_IMAGE:figures/full_fig_p036_15.png] view at source ↗

**Figure 16.** Figure 16: Hardware inference traces from Cadence Spectre simulation (seed 48). Each panel shows output logit currents (top: Iyes in green, Ino in red) and power consumption (bottom) over the 101-frame input sequence. Spoken word: “yes”. Hardware prediction via majority vote: “yes”. Software prediction: “yes”. 37 [PITH_FULL_IMAGE:figures/full_fig_p037_16.png] view at source ↗

**Figure 17.** Figure 17: Hardware inference traces from Cadence Spectre simulation (seed 49). Each panel shows output logit currents (top: Iyes in green, Ino in red) and power consumption (bottom) over the 101-frame input sequence. Spoken word: “yes”. Hardware prediction via majority vote: “background”. Software prediction: “background”. In this case, both implementations misclassify the sample. Note that the spoken “yes” has an… view at source ↗

**Figure 18.** Figure 18: Hardware inference traces from Cadence Spectre simulation (seed 50). Each panel shows output logit currents (top: Iyes in green, Ino in red) and power consumption (bottom) over the 101-frame input sequence. Spoken word: “yes”. Hardware prediction via majority vote: “yes”. Software prediction: “yes”. 38 [PITH_FULL_IMAGE:figures/full_fig_p038_18.png] view at source ↗

**Figure 19.** Figure 19: Hardware inference traces from Cadence Spectre simulation (seed 52). Each panel shows output logit currents (top: Iyes in green, Ino in red) and power consumption (bottom) over the 101-frame input sequence. Spoken word: background noise (no speech). Hardware prediction via majority vote: “background”. Software prediction: “background” [PITH_FULL_IMAGE:figures/full_fig_p039_19.png] view at source ↗

**Figure 20.** Figure 20: Hardware inference traces from Cadence Spectre simulation (seed 66). Each panel shows output logit currents (top: Iyes in green, Ino in red) and power consumption (bottom) over the 101-frame input sequence. Spoken word: “up”. Hardware prediction via majority vote: “background”. Software prediction: “background”. 39 [PITH_FULL_IMAGE:figures/full_fig_p039_20.png] view at source ↗

**Figure 21.** Figure 21: Hardware inference traces from Cadence Spectre simulation (seed 67). Each panel shows output logit currents (top: Iyes in green, Ino in red) and power consumption (bottom) over the 101-frame input sequence. Spoken word: “yes”. Hardware prediction via majority vote: “background”. Software prediction: “yes”. This is the only case across 50 test samples where hardware and software predictions differ [PITH_… view at source ↗

**Figure 22.** Figure 22: Hardware inference traces from Cadence Spectre simulation (seed 68). Each panel shows output logit currents (top: Iyes in green, Ino in red) and power consumption (bottom) over the 101-frame input sequence. Spoken word: “yes”. Hardware prediction via majority vote: “yes”. Software prediction: “yes”. 40 [PITH_FULL_IMAGE:figures/full_fig_p040_22.png] view at source ↗

**Figure 23.** Figure 23: Hardware inference traces from Cadence Spectre simulation (seed 61). Each panel shows output logit currents (top: Iyes in green, Ino in red) and power consumption (bottom) over the 101-frame input sequence. Spoken word: “right”. Hardware prediction via majority vote: “background”. Software prediction: “background” [PITH_FULL_IMAGE:figures/full_fig_p041_23.png] view at source ↗

**Figure 24.** Figure 24: PVT corner validation from Cadence Spectre simulation (seed 51). Each panel shows output logit currents (top: Iyes in green, Ino in red) over the 101-frame input sequence and the corresponding prediction for each PVT condition (bottom). All five process corners (TT, FF, SS, FS, SF), three temperatures (−27◦C, 27◦C, 81◦C), and ±10% supply voltage variation are evaluated. Spoken word: “yes”. Correct classif… view at source ↗

**Figure 25.** Figure 25: PVT corner validation from Cadence Spectre simulation (seed 66). Each panel shows output logit currents (top: Iyes in green, Ino in red) over the 101-frame input sequence and the corresponding prediction for each PVT condition (bottom). All five process corners (TT, FF, SS, FS, SF), three temperatures (−27◦C, 27◦C, 81◦C), and ±10% supply voltage variation are evaluated. Input: background noise. Correct cl… view at source ↗

**Figure 26.** Figure 26: Monte Carlo mismatch analysis from Cadence Spectre simulation (seed 51). Each panel shows output logit currents (top: Iyes in green, Ino in red) over the 101-frame input sequence and the corresponding prediction for each Monte Carlo sample (bottom). Analysis performed with 3σ mismatch variation on all transistors (200 samples). Spoken word: “yes”. Nominal prediction: “yes”. Impaired sample rate: 11.5%. 42… view at source ↗

**Figure 27.** Figure 27: Monte Carlo mismatch analysis from Cadence Spectre simulation (seed 45). Each panel shows output logit currents (top: Iyes in green, Ino in red) over the 101-frame input sequence and the corresponding prediction for each Monte Carlo sample (bottom). Analysis performed with 3σ mismatch variation on all transistors (200 samples). Spoken word: “down”. Nominal prediction: “background”. Impaired sample rate: 0… view at source ↗

**Figure 28.** Figure 28: Monte Carlo mismatch analysis from Cadence Spectre simulation (seed 47). Each panel shows output logit currents (top: Iyes in green, Ino in red) over the 101-frame input sequence and the corresponding prediction for each Monte Carlo sample (bottom). Analysis performed with 3σ mismatch variation on all transistors (200 samples). Spoken word: “yes”. Nominal prediction: “yes”. Impaired sample rate: 0%. 43 [… view at source ↗

**Figure 29.** Figure 29: Monte Carlo mismatch analysis from Cadence Spectre simulation (seed 48). Each panel shows output logit currents (top: Iyes in green, Ino in red) over the 101-frame input sequence and the corresponding prediction for each Monte Carlo sample (bottom). Analysis performed with 3σ mismatch variation on all transistors (200 samples). Spoken word: “yes”. Nominal prediction: “yes”. Impaired sample rate: 0.5% [PI… view at source ↗

**Figure 30.** Figure 30: Monte Carlo mismatch analysis from Cadence Spectre simulation (seed 49). Each panel shows output logit currents (top: Iyes in green, Ino in red) over the 101-frame input sequence and the corresponding prediction for each Monte Carlo sample (bottom). Analysis performed with 3σ mismatch variation on all transistors (200 samples). Spoken word: “yes”. Nominal prediction: “background” (misclassified under nomi… view at source ↗

**Figure 31.** Figure 31: Monte Carlo mismatch analysis from Cadence Spectre simulation (seed 50). Each panel shows output logit currents (top: Iyes in green, Ino in red) over the 101-frame input sequence and the corresponding prediction for each Monte Carlo sample (bottom). Analysis performed with 3σ mismatch variation on all transistors (200 samples). Spoken word: “yes”. Nominal prediction: “yes”. Impaired sample rate: 0% [PITH… view at source ↗

**Figure 32.** Figure 32: Monte Carlo mismatch analysis from Cadence Spectre simulation (seed 52). Each panel shows output logit currents (top: Iyes in green, Ino in red) over the 101-frame input sequence and the corresponding prediction for each Monte Carlo sample (bottom). Analysis performed with 3σ mismatch variation on all transistors (200 samples). Input: background noise (no speech). Nominal prediction: “background”. Impaire… view at source ↗

**Figure 33.** Figure 33: Monte Carlo mismatch analysis from Cadence Spectre simulation (seed 66). Each panel shows output logit currents (top: Iyes in green, Ino in red) over the 101-frame input sequence and the corresponding prediction for each Monte Carlo sample (bottom). Analysis performed with 3σ mismatch variation on all transistors (200 samples). Spoken word: “up”. Nominal prediction: “background”. Impaired sample rate: 0% … view at source ↗

**Figure 34.** Figure 34: Monte Carlo mismatch analysis from Cadence Spectre simulation (seed 67). Each panel shows output logit currents (top: Iyes in green, Ino in red) over the 101-frame input sequence and the corresponding prediction for each Monte Carlo sample (bottom). Analysis performed with 3σ mismatch variation on all transistors (200 samples). Spoken word: “yes”. Nominal hardware prediction: “background” (already in dis… view at source ↗

**Figure 35.** Figure 35: Monte Carlo mismatch analysis from Cadence Spectre simulation (seed 68). Each panel shows output logit currents (top: Iyes in green, Ino in red) over the 101-frame input sequence and the corresponding prediction for each Monte Carlo sample (bottom). Analysis performed with 3σ mismatch variation on all transistors (200 samples). Spoken word: “yes”. Nominal prediction: “yes”. Impaired sample rate: 0% [PITH… view at source ↗

**Figure 36.** Figure 36: Monte Carlo mismatch analysis from Cadence Spectre simulation (seed 61). Each panel shows output logit currents (top: Iyes in green, Ino in red) over the 101-frame input sequence and the corresponding prediction for each Monte Carlo sample (bottom). Analysis performed with 3σ mismatch variation on all transistors (200 samples). Spoken word: “right”. Nominal prediction: “background”. Impaired sample rate: … view at source ↗

**Figure 37.** Figure 37: Multi-class KWS evaluation (11 classes), "three" spoken. A. 2 × 4 network. Logit time evolution (left) and integrated logits used for the final classification decision (right). The classification is correct, but the narrow decision margins leave the prediction vulnerable to mismatch. B. 2 × 16 network. Logit time evolution (left) and integrated logits (right). The classification is correct and the decisio… view at source ↗

**Figure 38.** Figure 38: Intermediate signal comparison between software and hardware (layer-1 candidates, seed 51). Overlay of the 4 software-predicted and Cadence-simulated candidate currents of the first recurrent layer for a representative “yes” inference sample. 49 [PITH_FULL_IMAGE:figures/full_fig_p049_38.png] view at source ↗

**Figure 39.** Figure 39: Intermediate signal comparison between software and hardware (layer-1 states, seed 51). Overlay of the 4 software-predicted and Cadence-simulated FQ BMRU cell outputs of the first recurrent layer for a representative “yes” inference sample. 50 [PITH_FULL_IMAGE:figures/full_fig_p050_39.png] view at source ↗

**Figure 40.** Figure 40: Intermediate signal comparison between software and hardware (layer-2 candidates, seed 51). Overlay of the 4 software-predicted and Cadence-simulated candidate currents of the second recurrent layer for a representative “yes” inference sample. 51 [PITH_FULL_IMAGE:figures/full_fig_p051_40.png] view at source ↗

**Figure 41.** Figure 41: Intermediate signal comparison between software and hardware (layer-2 states, seed 51). Overlay of the 4 software-predicted and Cadence-simulated FQ BMRU cell outputs of the second recurrent layer for a representative “yes” inference sample. 52 [PITH_FULL_IMAGE:figures/full_fig_p052_41.png] view at source ↗

**Figure 42.** Figure 42: Intermediate signal comparison between software and hardware (layer-2 output after skip connection, seed 51). Overlay of the 4 software-predicted and Cadence-simulated output signals of the second recurrent layer, after the skip connection, for a representative “yes” inference sample. 53 [PITH_FULL_IMAGE:figures/full_fig_p053_42.png] view at source ↗

**Figure 43.** Figure 43: Intermediate signal comparison between software and hardware (output logits, seed 51). Overlay of the software-predicted and Cadence-simulated output logit currents for a representative “yes” inference sample. 54 [PITH_FULL_IMAGE:figures/full_fig_p054_43.png] view at source ↗

**Figure 44.** Figure 44: Intermediate signal comparison between software and hardware (layer-1 candidates, seed 66). Overlay of the 4 software-predicted and Cadence-simulated candidate currents of the first recurrent layer for a representative “background” inference sample. 55 [PITH_FULL_IMAGE:figures/full_fig_p055_44.png] view at source ↗

**Figure 45.** Figure 45: Intermediate signal comparison between software and hardware (layer-1 states, seed 66). Overlay of the 4 software-predicted and Cadence-simulated FQ BMRU cell outputs of the first recurrent layer for a representative “background” inference sample. 56 [PITH_FULL_IMAGE:figures/full_fig_p056_45.png] view at source ↗

**Figure 46.** Figure 46: Intermediate signal comparison between software and hardware (layer-2 candidates, seed 66). Overlay of the 4 software-predicted and Cadence-simulated candidate currents of the second recurrent layer for a representative “background” inference sample. 57 [PITH_FULL_IMAGE:figures/full_fig_p057_46.png] view at source ↗

**Figure 47.** Figure 47: Intermediate signal comparison between software and hardware (layer-2 states, seed 66). Overlay of the 4 software-predicted and Cadence-simulated FQ BMRU cell outputs of the second recurrent layer for a representative “background” inference sample. 58 [PITH_FULL_IMAGE:figures/full_fig_p058_47.png] view at source ↗

**Figure 48.** Figure 48: Intermediate signal comparison between software and hardware (layer-2 output after skip connection, seed 66). Overlay of the 4 software-predicted and Cadence-simulated output signals of the second recurrent layer, after the skip connection, for a representative “background” inference sample. 59 [PITH_FULL_IMAGE:figures/full_fig_p059_48.png] view at source ↗

**Figure 49.** Figure 49: Intermediate signal comparison between software and hardware (output logits, seed 66). Overlay of the software-predicted and Cadence-simulated output logit currents for a representative “background” inference sample. 60 [PITH_FULL_IMAGE:figures/full_fig_p060_49.png] view at source ↗

Review history (2 revisions) →

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Fully Tunable Ultra-Low Power Current-Mode Memory Cell in Standard CMOS Technology
eess.SP 2026-05 unverdicted novelty 7.0

A fully tunable ultra-low-power current-mode bistable memory cell using nine standard CMOS transistors enables spike-based logic gates and noise-immune recurrent neural units.
A Fully Tunable Ultra-Low Power Current-Mode Memory Cell in Standard CMOS Technology
eess.SP 2026-05 unverdicted novelty 6.0

A nine-transistor current-mode bistable memory cell in 180 nm CMOS is presented with independent tuning of threshold, hysteresis, and gain, shown via schematic simulations for spike-based logic gates and recurrent neu...

Reference graph

Works this paper leans on

117 extracted references · 117 canonical work pages · cited by 1 Pith paper · 17 internal anchors

[1]

LLMCarbon: Modeling the end-to-end carbon footprint of large language models, 2024

Ahmad Faiz, Sotaro Kaneda, Ruhan Wang, Rita Osi, Prateek Sharma, Fan Chen, and Lei Jiang. LLMCarbon: Modeling the end-to-end carbon footprint of large language models, 2024. URL https://arxiv.org/abs/2309.14393

work page arXiv 2024
[2]

Toward green AI: A methodological survey of the scientific literature.IEEE Access, 12:23989–24013, 2024

Enrico Barbierato and Alice Gatti. Toward green AI: A methodological survey of the scientific literature.IEEE Access, 12:23989–24013, 2024. doi: 10.1109/ACCESS.2024.3360705

work page doi:10.1109/access.2024.3360705 2024
[3]

James Kirkpatrick, Razvan Pascanu, Neil Rabinowitz, Joel Veness, Guillaume Desjardins, Andrei A

Mark Horowitz. Computing’s energy problem (and what we can do about it). InIEEE International Solid-State Circuits Conference Digest of Technical Papers, pages 10–14, San Francisco, CA, USA, 2014. IEEE. doi: 10.1109/ISSCC.2014.6757323

work page doi:10.1109/isscc.2014.6757323 2014
[4]

Efficient processing of deep neural networks: A tutorial and survey.Proceedings of the IEEE, 105(12):2295–2329, 2017

Vivienne Sze, Yu-Hsin Chen, Tien-Ju Yang, and Joel S Emer. Efficient processing of deep neural networks: A tutorial and survey.Proceedings of the IEEE, 105(12):2295–2329, 2017. doi: 10.1109/JPROC.2017.2761740

work page doi:10.1109/jproc.2017.2761740 2017
[5]

Neural record- ing and stimulation using wireless networks of microimplants.Nature Electronics, 4(8): 604–614, 2021

Jihun Lee, Vincent Leung, Ah-Hyoung Lee, Jiannan Huang, Peter Asbeck, Patrick P Mercier, Stephen Shellhammer, Lawrence Larson, Farah Laiwalla, and Arto Nurmikko. Neural record- ing and stimulation using wireless networks of microimplants.Nature Electronics, 4(8): 604–614, 2021. doi: 10.1038/s41928-021-00631-8

work page doi:10.1038/s41928-021-00631-8 2021
[6]

Neural Dust: An Ultrasonic, Low Power Solution for Chronic Brain-Machine Interfaces

Dongjin Seo, Jose M Carmena, Jan M Rabaey, Elad Alon, and Michel M Maharbiz. Neural dust: An ultrasonic, low power solution for chronic brain-machine interfaces, 2013. URL https://arxiv.org/abs/1307.2196. 10

work page internal anchor Pith review Pith/arXiv arXiv 2013
[7]

An electronic neuromorphic system for real-time detection of high frequency oscillations (HFO) in intracranial EEG.Nature Communications, 12(1):3095, 2021

Mohammadali Sharifshazileh, Karla Burelo, Johannes Sarnthein, and Giacomo Indiveri. An electronic neuromorphic system for real-time detection of high frequency oscillations (HFO) in intracranial EEG.Nature Communications, 12(1):3095, 2021. doi: 10.1038/ s41467-021-23342-2

work page 2021
[8]

Analog versus digital: Extrapolating from electronics to neurobiology

Rahul Sarpeshkar. Analog versus digital: Extrapolating from electronics to neurobiology. Neural Computation, 10(7):1601–1638, 1998. doi: 10.1162/089976698300017052

work page doi:10.1162/089976698300017052 1998
[9]

Memory devices and applications for in-memory computing.Nature Nanotechnology, 15(7): 529–544, 2020

Abu Sebastian, Manuel Le Gallo, Riduan Khaddam-Aljameh, and Evangelos Eleftheriou. Memory devices and applications for in-memory computing.Nature Nanotechnology, 15(7): 529–544, 2020. doi: 10.1038/s41565-020-0655-z

work page doi:10.1038/s41565-020-0655-z 2020
[10]

In-memory computing with resistive switching devices

Daniele Ielmini and H-S Philip Wong. In-memory computing with resistive switching devices. Nature Electronics, 1(6):333–343, 2018. doi: 10.1038/s41928-018-0092-2

work page doi:10.1038/s41928-018-0092-2 2018
[11]

Attention is all you need

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. Attention is all you need. InAdvances in Neural Information Processing Systems, volume 30, Long Beach, CA, USA, 2017

work page 2017
[12]

The end of transformers? On challenging attention and the rise of sub-quadratic architectures, 2025

Alexander M Fichtl, Jeremias Bohn, Josefin Kelber, Edoardo Mosca, and Georg Groh. The end of transformers? On challenging attention and the rise of sub-quadratic architectures, 2025. URLhttps://arxiv.org/abs/2510.05364

work page arXiv 2025
[13]

Efficient transformers: A survey

Yi Tay, Mostafa Dehghani, Dara Bahri, and Donald Metzler. Efficient transformers: A survey. ACM Computing Surveys, 55(6):1–28, 2022. doi: 10.1145/3530811

work page doi:10.1145/3530811 2022
[14]

Efficiently Modeling Long Sequences with Structured State Spaces

Albert Gu, Karan Goel, and Christopher Ré. Efficiently modeling long sequences with structured state spaces, 2022. URLhttps://arxiv.org/abs/2111.00396

work page internal anchor Pith review Pith/arXiv arXiv 2022
[15]

Mamba: Linear-time sequence modeling with selective state spaces,

Albert Gu and Tri Dao. Mamba: Linear-time sequence modeling with selective state spaces,

work page
[16]

URLhttps://arxiv.org/abs/2312.00752

work page internal anchor Pith review Pith/arXiv arXiv
[17]

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Tri Dao and Albert Gu. Transformers are SSMs: Generalized models and efficient algorithms through structured state space duality, 2024. URL https://arxiv.org/abs/2405.21060

work page internal anchor Pith review Pith/arXiv arXiv 2024
[18]

Linear recurrent units for sequential recommendation

Zhenrui Yue, Yueqi Wang, Zhankui He, Huimin Zeng, Julian McAuley, and Dong Wang. Linear recurrent units for sequential recommendation. InProceedings of the 17th ACM International Conference on Web Search and Data Mining, pages 930–938, Merida, Mexico,

work page
[19]

doi: 10.1145/3616855.3635760

work page doi:10.1145/3616855.3635760
[20]

The Mamba in the Llama: Distilling and accelerating hybrid models

Junxiong Wang, Daniele Paliotta, Avner May, Alexander Rush, and Tri Dao. The Mamba in the Llama: Distilling and accelerating hybrid models. InAdvances in Neural Information Processing Systems, volume 37, pages 62432–62457, Vancouver, Canada, 2024

work page 2024
[21]

On the parameterization and initialization of diagonal state space models

Albert Gu, Ankit Gupta, Karan Goel, and Christopher Ré. On the parameterization and initialization of diagonal state space models. InAdvances in Neural Information Processing Systems, volume 35, pages 35971–35983, New Orleans, LA, USA, 2022

work page 2022
[22]

Fading memory and the problem of approximating nonlinear operators with Volterra series.IEEE Transactions on Circuits and Systems, 32(11):1150–1161,

Stephen Boyd and Leon Chua. Fading memory and the problem of approximating nonlinear operators with Volterra series.IEEE Transactions on Circuits and Systems, 32(11):1150–1161,

work page
[23]

doi: 10.1109/TCS.1985.1085649

work page doi:10.1109/tcs.1985.1085649 1985
[24]

A bio-inspired bistable recurrent cell allows for long-lasting memory.PLOS ONE, 16(6):e0252676, 2021

Nicolas Vecoven, Damien Ernst, and Guillaume Drion. A bio-inspired bistable recurrent cell allows for long-lasting memory.PLOS ONE, 16(6):e0252676, 2021. doi: 10.1371/journal. pone.0252676

work page doi:10.1371/journal 2021
[25]

Warming up recurrent neural networks to maximise reachable multistability greatly improves learning.Neural Networks, 166:645–669, 2023

Gaspard Lambrechts, Florent De Geeter, Nicolas Vecoven, Damien Ernst, and Guillaume Drion. Warming up recurrent neural networks to maximise reachable multistability greatly improves learning.Neural Networks, 166:645–669, 2023. doi: 10.1016/j.neunet.2023.07.023

work page doi:10.1016/j.neunet.2023.07.023 2023
[26]

Memory from the dynamics of intrinsic membrane currents.Proceedings of the National Academy of Sciences, 93(24):13481–13486, 1996

Eve Marder, L F Abbott, Gina G Turrigiano, Zhengyu Liu, and Jorge Golowasch. Memory from the dynamics of intrinsic membrane currents.Proceedings of the National Academy of Sciences, 93(24):13481–13486, 1996. doi: 10.1073/pnas.93.24.13481. 11

work page doi:10.1073/pnas.93.24.13481 1996
[27]

Neural Computation 9, 1735–1780

Sepp Hochreiter and Jürgen Schmidhuber. Long short-term memory.Neural Computation, 9 (8):1735–1780, 1997. doi: 10.1162/neco.1997.9.8.1735

work page doi:10.1162/neco.1997.9.8.1735 1997
[28]

Learning phrase representations using RNN encoder- decoder for statistical machine translation, 2014

Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. Learning phrase representations using RNN encoder- decoder for statistical machine translation, 2014. URL https://arxiv.org/abs/1406. 1078

work page 2014
[29]

Bengio, P

Yoshua Bengio, Patrice Simard, and Paolo Frasconi. Learning long-term dependencies with gradient descent is difficult.IEEE Transactions on Neural Networks, 5(2):157–166, 1994. doi: 10.1109/72.279181

work page doi:10.1109/72.279181 1994
[30]

On the difficulty of training recurrent neural networks

Razvan Pascanu, Tomas Mikolov, and Yoshua Bengio. On the difficulty of training recurrent neural networks. InInternational Conference on Machine Learning, pages 1310–1318, Atlanta, GA, USA, 2013. PMLR

work page 2013
[31]

Review of ASIC accelerators for deep neural network.Microprocessors and Microsystems, 89:104441, 2022

Raju Machupalli, Masum Hossain, and Mrinal Mandal. Review of ASIC accelerators for deep neural network.Microprocessors and Microsystems, 89:104441, 2022. doi: 10.1016/j.micpro. 2022.104441

work page doi:10.1016/j.micpro 2022
[32]

Eyeriss v2: A flexible accelerator for emerging deep neural networks on mobile devices.IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 9(2):292–308, 2019

Yu-Hsin Chen, Tien-Ju Yang, Joel Emer, and Vivienne Sze. Eyeriss v2: A flexible accelerator for emerging deep neural networks on mobile devices.IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 9(2):292–308, 2019. doi: 10.1109/JETCAS.2019.2910232

work page doi:10.1109/jetcas.2019.2910232 2019
[33]

Tiny machine learning and on-device inference: A survey of applications, challenges, and future directions.Sensors, 25(10):3191, 2025

Soroush Heydari and Qusay H Mahmoud. Tiny machine learning and on-device inference: A survey of applications, challenges, and future directions.Sensors, 25(10):3191, 2025. doi: 10.3390/s25103191

work page doi:10.3390/s25103191 2025
[34]

MLPerf Tiny Benchmark

Colby Banbury, Vijay Janapa Reddi, Peter Torelli, Jeremy Holleman, Nat Jeffries, Csaba Kiraly, Pietro Montino, David Kanter, Sebastian Ahmed, Danilo Pau, et al. MLPerf tiny benchmark, 2021. URLhttps://arxiv.org/abs/2106.07597

work page Pith review arXiv 2021
[35]

O’Reilly Media, 2019

Pete Warden and Daniel Situnayake.TinyML: Machine Learning with TensorFlow Lite on Arduino and Ultra-Low-Power Microcontrollers. O’Reilly Media, 2019

work page 2019
[36]

MCUNet: Tiny deep learning on IoT devices

Ji Lin, Wei-Ming Chen, Yujun Lin, John Cohn, Chuang Gan, and Song Han. MCUNet: Tiny deep learning on IoT devices. InAdvances in Neural Information Processing Systems, volume 33, pages 11711–11722, Virtual, 2020

work page 2020
[37]

SpArSe: Sparse architecture search for CNNs on resource-constrained microcontrollers

Igor Fedorov, Ryan P Adams, Matthew Mattina, and Paul N Whatmough. SpArSe: Sparse architecture search for CNNs on resource-constrained microcontrollers. InAdvances in Neural Information Processing Systems, volume 32, Vancouver, Canada, 2019

work page 2019
[38]

Real-Time Com- puting Without Stable States: A New Framework for Neural Compu- tation Based on Perturbations

Wolfgang Maass, Thomas Natschläger, and Henry Markram. Real-time computing without stable states: A new framework for neural computation based on perturbations.Neural Computation, 14(11):2531–2560, 2002. doi: 10.1162/089976602760407955

work page doi:10.1162/089976602760407955 2002
[39]

Loihi: A neuromorphic manycore processor with on-chip learning.IEEE Micro, 38(1):82–99, 2018

Mike Davies, Narayan Srinivasa, Tsung-Han Lin, Gautham Chinya, Yongqiang Cao, Sri Harsha Choday, Georgios Dimou, Prasad Joshi, Nabil Imam, Shweta Jain, et al. Loihi: A neuromorphic manycore processor with on-chip learning.IEEE Micro, 38(1):82–99, 2018. doi: 10.1109/ MM.2018.112130359

work page arXiv 2018
[40]

Efficient neuromorphic signal processing with Loihi 2

Garrick Orchard, E Paxon Frady, Daniel Ben Dayan Rubin, Sophia Sanborn, Sumit Bam Shrestha, Friedrich T Sommer, and Mike Davies. Efficient neuromorphic signal processing with Loihi 2. InIEEE Workshop on Signal Processing Systems (SiPS), pages 254–259, Coimbra, Portugal, 2021. IEEE. doi: 10.1109/SiPS52927.2021.00053

work page doi:10.1109/sips52927.2021.00053 2021
[41]

Neuromorphic electronic systems.Proceedings of the IEEE, 78(10):1629–1636,

Carver Mead. Neuromorphic electronic systems.Proceedings of the IEEE, 78(10):1629–1636,

work page
[42]

doi: 10.1109/5.58356

work page doi:10.1109/5.58356
[43]

Memristive crossbar arrays for brain-inspired computing

Qiangfei Xia and J Joshua Yang. Memristive crossbar arrays for brain-inspired computing. Nature Materials, 18(4):309–323, 2019. doi: 10.1038/s41563-019-0291-x. 12

work page doi:10.1038/s41563-019-0291-x 2019
[44]

Training and operation of an integrated neuromorphic network based on metal-oxide memristors.Nature, 521(7550):61–64, 2015

Mirko Prezioso, Farnood Merrikh-Bayat, Brian D Hoskins, Gina C Adam, Konstantin K Likharev, and Dmitri B Strukov. Training and operation of an integrated neuromorphic network based on metal-oxide memristors.Nature, 521(7550):61–64, 2015. doi: 10.1038/nature14441

work page doi:10.1038/nature14441 2015
[45]

Hardware implementation of deep network accelerators towards healthcare and biomedical applications.IEEE Transactions on Biomedical Circuits and Systems, 14(6):1138–1159, 2020

Mostafa Rahimi Azghadi, Corey Lammie, Jason K Eshraghian, Melika Payvand, Elisa Donati, Bernabe Linares-Barranco, and Giacomo Indiveri. Hardware implementation of deep network accelerators towards healthcare and biomedical applications.IEEE Transactions on Biomedical Circuits and Systems, 14(6):1138–1159, 2020. doi: 10.1109/TBCAS.2020.3036081

work page doi:10.1109/tbcas.2020.3036081 2020
[46]

Long short-term memory networks in memristor crossbar arrays.Nature Machine Intelligence, 1(1):49–57, 2019

Can Li, Miao Hu, Yunning Li, Hao Jiang, Ning Ge, Eric Montgomery, Jiaming Zhang, Wenhao Song, Noraica Dávila, Catherine E Graves, et al. Long short-term memory networks in memristor crossbar arrays.Nature Machine Intelligence, 1(1):49–57, 2019. doi: 10.1038/ s42256-018-0001-4

work page 2019
[47]

A review of computing with spiking neural networks.Computers, Materials & Continua, 78(3):2909, 2024

Jiadong Wu, Yinan Wang, Zhiwei Li, Lun Lu, and Qingjiang Li. A review of computing with spiking neural networks.Computers, Materials & Continua, 78(3):2909, 2024. doi: 10.32604/cmc.2024.047240

work page doi:10.32604/cmc.2024.047240 2024
[48]

Towards spike-based machine intelligence with neuromorphic computing.Nature, 575(7784):607–617, 2019

Kaushik Roy, Akhilesh Jaiswal, and Priyadarshini Panda. Towards spike-based machine intelligence with neuromorphic computing.Nature, 575(7784):607–617, 2019. doi: 10.1038/ s41586-019-1677-2

work page 2019
[49]

Real-time edge neuromorphic tasting from chemical microsensor arrays.Frontiers in Neuroscience, 15:771480, 2021

Nicholas LeBow, Bodo Rueckauer, Pengfei Sun, Meritxell Rovira, Cecilia Jiménez-Jorquera, Shih-Chii Liu, and Josep Maria Margarit-Taulé. Real-time edge neuromorphic tasting from chemical microsensor arrays.Frontiers in Neuroscience, 15:771480, 2021. doi: 10.3389/fnins. 2021.771480

work page doi:10.3389/fnins 2021
[50]

Opportunities for neuromorphic computing algorithms and applications.Nature Computational Science, 2(1):10–19, 2022

Catherine D Schuman, Shruti R Kulkarni, Maryam Parsa, J Parker Mitchell, Bill Kay, and Prasanna Date. Opportunities for neuromorphic computing algorithms and applications.Nature Computational Science, 2(1):10–19, 2022. doi: 10.1038/s43588-021-00184-y

work page doi:10.1038/s43588-021-00184-y 2022
[51]

Recurrent Neural Networks Hardware Implementation on FPGA

Andre Xian Ming Chang, Berin Martini, and Eugenio Culurciello. Recurrent neural networks hardware implementation on FPGA, 2015. URLhttps://arxiv.org/abs/1511.05552

work page internal anchor Pith review Pith/arXiv arXiv 2015
[52]

Noise-mitigation strategies in physical feedforward neural networks.Chaos: An Interdisciplinary Journal of Nonlinear Science, 32(6):061106,

Nadezhda Semenova and Daniel Brunner. Noise-mitigation strategies in physical feedforward neural networks.Chaos: An Interdisciplinary Journal of Nonlinear Science, 32(6):061106,

work page
[53]

doi: 10.1063/5.0096637

work page doi:10.1063/5.0096637
[54]

Kendall, R

Jack Kendall, Ross Pantone, Kalpana Manickavasagam, Yoshua Bengio, and Benjamin Scellier. Training end-to-end analog neural networks with equilibrium propagation, 2020. URL https: //arxiv.org/abs/2006.01981

work page arXiv 2020
[55]

MIT press, 2002

Shih-Chii Liu.Analog VLSI: Circuits and principles. MIT press, 2002

work page 2002
[56]

AML100 - near-zero power analogml processor

Aspinity. AML100 - near-zero power analogml processor. https://www.aspinity.com/ aml100, 2022. Product information page, accessed: 2026-05-05

work page 2022
[57]

NDP120 neural decision processor

Syntiant. NDP120 neural decision processor. https://www.syntiant.com/s/ Syntiant-Product_Brief_NDP120.pdf, 2021. Product brief, accessed: 2026-05-05

work page 2021
[58]

Parallelizable memory recurrent units

Florent De Geeter, Gaspard Lambrechts, Damien Ernst, and Guillaume Drion. Parallelizable memory recurrent units, 2026. URLhttps://arxiv.org/abs/2601.09495

work page internal anchor Pith review Pith/arXiv arXiv 2026
[59]

Pararnn: Unlocking parallel training of nonlinear rnns for large language models, 2025

Federico Danieli, Pau Rodríguez, Miguel Sarabia, Xavier Suau, and Luca Zappella. ParaRNN: Unlocking parallel training of nonlinear RNNs for large language models, 2025. URL https: //arxiv.org/abs/2510.21450

work page arXiv 2025
[60]

Parallelizing linear recurrent neural nets over sequence length,

Eric Martin and Chris Cundy. Parallelizing linear recurrent neural nets over sequence length,

work page
[61]

URLhttps://arxiv.org/abs/1709.04057

work page internal anchor Pith review Pith/arXiv arXiv
[62]

Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition

Pete Warden. Speech commands: A dataset for limited-vocabulary speech recognition, 2018. URLhttps://arxiv.org/abs/1804.03209. 13

work page internal anchor Pith review Pith/arXiv arXiv 2018
[63]

Resurrecting recurrent neural networks for long sequences, 2023

Antonio Orvieto, Samuel L Smith, Albert Gu, Anushan Fernando, Caglar Gulcehre, Razvan Pascanu, and Soham De. Resurrecting recurrent neural networks for long sequences, 2023. URLhttps://arxiv.org/abs/2303.06349

work page arXiv 2023
[64]

Were RNNs All We Needed?

Leo Feng, Frederick Tung, Mohamed Osama Ahmed, Yoshua Bengio, and Hossein Hajimir- sadeghi. Were RNNs all we needed?, 2024. URLhttps://arxiv.org/abs/2410.01201

work page arXiv 2024
[65]

On the Importance of Multistability for Horizon Generalization in Reinforcement Learning

Asad Bakija, Florent De Geeter, Julien Brandoit, Pierre Sacré, and Guillaume Drion. On the importance of multistability for horizon generalization in reinforcement learning, 2026. URL https://arxiv.org/abs/2605.12206

work page internal anchor Pith review Pith/arXiv arXiv 2026
[66]

Matching properties of MOS transistors.IEEE Journal of Solid-State Circuits, 24(5):1433–1439, 1989

Marcel J M Pelgrom, Aad C J Duinmaijer, and Anton P G Welbers. Matching properties of MOS transistors.IEEE Journal of Solid-State Circuits, 24(5):1433–1439, 1989. doi: 10.1109/JSSC.1989.572629

work page doi:10.1109/jssc.1989.572629 1989
[67]

Device mismatch and tradeoffs in the design of analog circuits.IEEE Journal of Solid-State Circuits, 40(6):1212–1224, 2005

Peter R Kinget. Device mismatch and tradeoffs in the design of analog circuits.IEEE Journal of Solid-State Circuits, 40(6):1212–1224, 2005. doi: 10.1109/JSSC.2005.848021

work page doi:10.1109/jssc.2005.848021 2005
[68]

Silicon diode temperature sensors—a review of applications.Sensors and Actuators A: Physical, 232:63–74, 2015

Mohtashim Mansoor, Ibraheem Haneef, Suhail Akhtar, Andrea De Luca, and Florin Udrea. Silicon diode temperature sensors—a review of applications.Sensors and Actuators A: Physical, 232:63–74, 2015. doi: 10.1016/j.sna.2015.04.022

work page doi:10.1016/j.sna.2015.04.022 2015
[69]

Long range arena: A benchmark for efficient transformers.arXiv preprint arXiv:2011.04006,

Yi Tay, Mostafa Dehghani, Samira Abnar, Yikang Shen, Dara Bahri, Philip Pham, Jinfeng Rao, Liu Yang, Sebastian Ruder, and Donald Metzler. Long range arena: A benchmark for efficient transformers, 2021. URLhttps://arxiv.org/abs/2011.04006

work page arXiv 2021
[70]

nanoGPT, 2022

Andrej Karpathy. nanoGPT, 2022. URL https://github.com/karpathy/nanoGPT. GitHub repository

work page 2022
[71]

Digital twin: Enabling technologies, challenges and open research.IEEE Access, 8:108952–108971, 2020

Iván López-Espejo, Zheng-Hua Tan, John H L Hansen, and Jesper Jensen. Deep spoken keyword spotting: An overview.IEEE Access, 10:4169–4199, 2022. doi: 10.1109/ACCESS. 2021.3139508

work page doi:10.1109/access 2022
[72]

The mnist database of handwritten digit images for machine learning research [best of the web].IEEE Signal Processing Magazine, 29(6):141–142, 2012

Geoffrey Hinton, Li Deng, Dong Yu, George E Dahl, Abdel-rahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara N Sainath, et al. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups.IEEE Signal Processing Magazine, 29(6):82–97, 2012. doi: 10.1109/MSP.2012. 2205597

work page doi:10.1109/msp.2012 2012
[73]

Small-footprint keyword spotting using deep neural networks

Guoguo Chen, Carolina Parada, and Georg Heigold. Small-footprint keyword spotting using deep neural networks. InIEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 4087–4091, Florence, Italy, 2014. IEEE. doi: 10.1109/ICASSP. 2014.6854370

work page doi:10.1109/icassp 2014
[74]

Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting

Sercan O Arik, Markus Kliegl, Rewon Child, Joel Hestness, Andrew Gibiansky, Chris Fougner, Ryan Prenger, and Adam Coates. Convolutional recurrent neural networks for small-footprint keyword spotting, 2017. URLhttps://arxiv.org/abs/1703.05390

work page internal anchor Pith review Pith/arXiv arXiv 2017
[75]

Steven Davis and Paul Mermelstein. Comparison of parametric representations for monosyl- labic word recognition in continuously spoken sentences.IEEE Transactions on Acoustics, Speech, and Signal Processing, 28(4):357–366, 1980. doi: 10.1109/TASSP.1980.1163420

work page doi:10.1109/tassp.1980.1163420 1980
[76]

An 800 nW switched-capacitor feature extraction filterbank for sound classification.IEEE Transactions on Circuits and Systems I: Regular Papers, 68(4):1578–1588, 2021

Daniel Augusto Villamizar, Dante Gabriel Muratore, James B Wieser, and Boris Murmann. An 800 nW switched-capacitor feature extraction filterbank for sound classification.IEEE Transactions on Circuits and Systems I: Regular Papers, 68(4):1578–1588, 2021. doi: 10. 1109/tcsi.2020.3047035

work page arXiv 2021
[77]

Heejin Yang, Ji-Hwan Seol, Rohit Rothe, Zichen Fan, Qirui Zhang, Hun-Seok Kim, David Blaauw, and Dennis Sylvester. A 1.5- µw fully-integrated keyword spotting SoC in 28-nm CMOS with skip-RNN and fast-settling analog frontend for adaptive frame skipping.IEEE Journal of Solid-State Circuits, 59(1):29–39, 2023. doi: 10.1109/jssc.2023.3316648

work page doi:10.1109/jssc.2023.3316648 2023
[78]

Cadence Virtuoso Platform

Cadence Design Systems. Cadence Virtuoso Platform. https://www.cadence.com/en_ US/home/tools/custom-ic-analog-rf-design/virtuoso-studio.html, 2023. 14

work page 2023
[79]

Oxford University Press, 3rd edition, 2011

Phillip E Allen and Douglas R Holberg.CMOS Analog Circuit Design. Oxford University Press, 3rd edition, 2011

work page 2011
[80]

McGraw-Hill, 2001

Behzad Razavi.Design of Analog CMOS Integrated Circuits. McGraw-Hill, 2001

work page 2001

Showing first 80 references.

[1] [1]

LLMCarbon: Modeling the end-to-end carbon footprint of large language models, 2024

Ahmad Faiz, Sotaro Kaneda, Ruhan Wang, Rita Osi, Prateek Sharma, Fan Chen, and Lei Jiang. LLMCarbon: Modeling the end-to-end carbon footprint of large language models, 2024. URL https://arxiv.org/abs/2309.14393

work page arXiv 2024

[2] [2]

Toward green AI: A methodological survey of the scientific literature.IEEE Access, 12:23989–24013, 2024

Enrico Barbierato and Alice Gatti. Toward green AI: A methodological survey of the scientific literature.IEEE Access, 12:23989–24013, 2024. doi: 10.1109/ACCESS.2024.3360705

work page doi:10.1109/access.2024.3360705 2024

[3] [3]

James Kirkpatrick, Razvan Pascanu, Neil Rabinowitz, Joel Veness, Guillaume Desjardins, Andrei A

Mark Horowitz. Computing’s energy problem (and what we can do about it). InIEEE International Solid-State Circuits Conference Digest of Technical Papers, pages 10–14, San Francisco, CA, USA, 2014. IEEE. doi: 10.1109/ISSCC.2014.6757323

work page doi:10.1109/isscc.2014.6757323 2014

[4] [4]

Efficient processing of deep neural networks: A tutorial and survey.Proceedings of the IEEE, 105(12):2295–2329, 2017

Vivienne Sze, Yu-Hsin Chen, Tien-Ju Yang, and Joel S Emer. Efficient processing of deep neural networks: A tutorial and survey.Proceedings of the IEEE, 105(12):2295–2329, 2017. doi: 10.1109/JPROC.2017.2761740

work page doi:10.1109/jproc.2017.2761740 2017

[5] [5]

Neural record- ing and stimulation using wireless networks of microimplants.Nature Electronics, 4(8): 604–614, 2021

Jihun Lee, Vincent Leung, Ah-Hyoung Lee, Jiannan Huang, Peter Asbeck, Patrick P Mercier, Stephen Shellhammer, Lawrence Larson, Farah Laiwalla, and Arto Nurmikko. Neural record- ing and stimulation using wireless networks of microimplants.Nature Electronics, 4(8): 604–614, 2021. doi: 10.1038/s41928-021-00631-8

work page doi:10.1038/s41928-021-00631-8 2021

[6] [6]

Neural Dust: An Ultrasonic, Low Power Solution for Chronic Brain-Machine Interfaces

Dongjin Seo, Jose M Carmena, Jan M Rabaey, Elad Alon, and Michel M Maharbiz. Neural dust: An ultrasonic, low power solution for chronic brain-machine interfaces, 2013. URL https://arxiv.org/abs/1307.2196. 10

work page internal anchor Pith review Pith/arXiv arXiv 2013

[7] [7]

An electronic neuromorphic system for real-time detection of high frequency oscillations (HFO) in intracranial EEG.Nature Communications, 12(1):3095, 2021

Mohammadali Sharifshazileh, Karla Burelo, Johannes Sarnthein, and Giacomo Indiveri. An electronic neuromorphic system for real-time detection of high frequency oscillations (HFO) in intracranial EEG.Nature Communications, 12(1):3095, 2021. doi: 10.1038/ s41467-021-23342-2

work page 2021

[8] [8]

Analog versus digital: Extrapolating from electronics to neurobiology

Rahul Sarpeshkar. Analog versus digital: Extrapolating from electronics to neurobiology. Neural Computation, 10(7):1601–1638, 1998. doi: 10.1162/089976698300017052

work page doi:10.1162/089976698300017052 1998

[9] [9]

Memory devices and applications for in-memory computing.Nature Nanotechnology, 15(7): 529–544, 2020

Abu Sebastian, Manuel Le Gallo, Riduan Khaddam-Aljameh, and Evangelos Eleftheriou. Memory devices and applications for in-memory computing.Nature Nanotechnology, 15(7): 529–544, 2020. doi: 10.1038/s41565-020-0655-z

work page doi:10.1038/s41565-020-0655-z 2020

[10] [10]

In-memory computing with resistive switching devices

Daniele Ielmini and H-S Philip Wong. In-memory computing with resistive switching devices. Nature Electronics, 1(6):333–343, 2018. doi: 10.1038/s41928-018-0092-2

work page doi:10.1038/s41928-018-0092-2 2018

[11] [11]

Attention is all you need

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. Attention is all you need. InAdvances in Neural Information Processing Systems, volume 30, Long Beach, CA, USA, 2017

work page 2017

[12] [12]

The end of transformers? On challenging attention and the rise of sub-quadratic architectures, 2025

Alexander M Fichtl, Jeremias Bohn, Josefin Kelber, Edoardo Mosca, and Georg Groh. The end of transformers? On challenging attention and the rise of sub-quadratic architectures, 2025. URLhttps://arxiv.org/abs/2510.05364

work page arXiv 2025

[13] [13]

Efficient transformers: A survey

Yi Tay, Mostafa Dehghani, Dara Bahri, and Donald Metzler. Efficient transformers: A survey. ACM Computing Surveys, 55(6):1–28, 2022. doi: 10.1145/3530811

work page doi:10.1145/3530811 2022

[14] [14]

Efficiently Modeling Long Sequences with Structured State Spaces

Albert Gu, Karan Goel, and Christopher Ré. Efficiently modeling long sequences with structured state spaces, 2022. URLhttps://arxiv.org/abs/2111.00396

work page internal anchor Pith review Pith/arXiv arXiv 2022

[15] [15]

Mamba: Linear-time sequence modeling with selective state spaces,

Albert Gu and Tri Dao. Mamba: Linear-time sequence modeling with selective state spaces,

work page

[16] [16]

URLhttps://arxiv.org/abs/2312.00752

work page internal anchor Pith review Pith/arXiv arXiv

[17] [17]

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Tri Dao and Albert Gu. Transformers are SSMs: Generalized models and efficient algorithms through structured state space duality, 2024. URL https://arxiv.org/abs/2405.21060

work page internal anchor Pith review Pith/arXiv arXiv 2024

[18] [18]

Linear recurrent units for sequential recommendation

Zhenrui Yue, Yueqi Wang, Zhankui He, Huimin Zeng, Julian McAuley, and Dong Wang. Linear recurrent units for sequential recommendation. InProceedings of the 17th ACM International Conference on Web Search and Data Mining, pages 930–938, Merida, Mexico,

work page

[19] [19]

doi: 10.1145/3616855.3635760

work page doi:10.1145/3616855.3635760

[20] [20]

The Mamba in the Llama: Distilling and accelerating hybrid models

Junxiong Wang, Daniele Paliotta, Avner May, Alexander Rush, and Tri Dao. The Mamba in the Llama: Distilling and accelerating hybrid models. InAdvances in Neural Information Processing Systems, volume 37, pages 62432–62457, Vancouver, Canada, 2024

work page 2024

[21] [21]

On the parameterization and initialization of diagonal state space models

Albert Gu, Ankit Gupta, Karan Goel, and Christopher Ré. On the parameterization and initialization of diagonal state space models. InAdvances in Neural Information Processing Systems, volume 35, pages 35971–35983, New Orleans, LA, USA, 2022

work page 2022

[22] [22]

Fading memory and the problem of approximating nonlinear operators with Volterra series.IEEE Transactions on Circuits and Systems, 32(11):1150–1161,

Stephen Boyd and Leon Chua. Fading memory and the problem of approximating nonlinear operators with Volterra series.IEEE Transactions on Circuits and Systems, 32(11):1150–1161,

work page

[23] [23]

doi: 10.1109/TCS.1985.1085649

work page doi:10.1109/tcs.1985.1085649 1985

[24] [24]

A bio-inspired bistable recurrent cell allows for long-lasting memory.PLOS ONE, 16(6):e0252676, 2021

Nicolas Vecoven, Damien Ernst, and Guillaume Drion. A bio-inspired bistable recurrent cell allows for long-lasting memory.PLOS ONE, 16(6):e0252676, 2021. doi: 10.1371/journal. pone.0252676

work page doi:10.1371/journal 2021

[25] [25]

Warming up recurrent neural networks to maximise reachable multistability greatly improves learning.Neural Networks, 166:645–669, 2023

Gaspard Lambrechts, Florent De Geeter, Nicolas Vecoven, Damien Ernst, and Guillaume Drion. Warming up recurrent neural networks to maximise reachable multistability greatly improves learning.Neural Networks, 166:645–669, 2023. doi: 10.1016/j.neunet.2023.07.023

work page doi:10.1016/j.neunet.2023.07.023 2023

[26] [26]

Memory from the dynamics of intrinsic membrane currents.Proceedings of the National Academy of Sciences, 93(24):13481–13486, 1996

Eve Marder, L F Abbott, Gina G Turrigiano, Zhengyu Liu, and Jorge Golowasch. Memory from the dynamics of intrinsic membrane currents.Proceedings of the National Academy of Sciences, 93(24):13481–13486, 1996. doi: 10.1073/pnas.93.24.13481. 11

work page doi:10.1073/pnas.93.24.13481 1996

[27] [27]

Neural Computation 9, 1735–1780

Sepp Hochreiter and Jürgen Schmidhuber. Long short-term memory.Neural Computation, 9 (8):1735–1780, 1997. doi: 10.1162/neco.1997.9.8.1735

work page doi:10.1162/neco.1997.9.8.1735 1997

[28] [28]

Learning phrase representations using RNN encoder- decoder for statistical machine translation, 2014

Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. Learning phrase representations using RNN encoder- decoder for statistical machine translation, 2014. URL https://arxiv.org/abs/1406. 1078

work page 2014

[29] [29]

Bengio, P

Yoshua Bengio, Patrice Simard, and Paolo Frasconi. Learning long-term dependencies with gradient descent is difficult.IEEE Transactions on Neural Networks, 5(2):157–166, 1994. doi: 10.1109/72.279181

work page doi:10.1109/72.279181 1994

[30] [30]

On the difficulty of training recurrent neural networks

Razvan Pascanu, Tomas Mikolov, and Yoshua Bengio. On the difficulty of training recurrent neural networks. InInternational Conference on Machine Learning, pages 1310–1318, Atlanta, GA, USA, 2013. PMLR

work page 2013

[31] [31]

Review of ASIC accelerators for deep neural network.Microprocessors and Microsystems, 89:104441, 2022

Raju Machupalli, Masum Hossain, and Mrinal Mandal. Review of ASIC accelerators for deep neural network.Microprocessors and Microsystems, 89:104441, 2022. doi: 10.1016/j.micpro. 2022.104441

work page doi:10.1016/j.micpro 2022

[32] [32]

Eyeriss v2: A flexible accelerator for emerging deep neural networks on mobile devices.IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 9(2):292–308, 2019

Yu-Hsin Chen, Tien-Ju Yang, Joel Emer, and Vivienne Sze. Eyeriss v2: A flexible accelerator for emerging deep neural networks on mobile devices.IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 9(2):292–308, 2019. doi: 10.1109/JETCAS.2019.2910232

work page doi:10.1109/jetcas.2019.2910232 2019

[33] [33]

Tiny machine learning and on-device inference: A survey of applications, challenges, and future directions.Sensors, 25(10):3191, 2025

Soroush Heydari and Qusay H Mahmoud. Tiny machine learning and on-device inference: A survey of applications, challenges, and future directions.Sensors, 25(10):3191, 2025. doi: 10.3390/s25103191

work page doi:10.3390/s25103191 2025

[34] [34]

MLPerf Tiny Benchmark

Colby Banbury, Vijay Janapa Reddi, Peter Torelli, Jeremy Holleman, Nat Jeffries, Csaba Kiraly, Pietro Montino, David Kanter, Sebastian Ahmed, Danilo Pau, et al. MLPerf tiny benchmark, 2021. URLhttps://arxiv.org/abs/2106.07597

work page Pith review arXiv 2021

[35] [35]

O’Reilly Media, 2019

Pete Warden and Daniel Situnayake.TinyML: Machine Learning with TensorFlow Lite on Arduino and Ultra-Low-Power Microcontrollers. O’Reilly Media, 2019

work page 2019

[36] [36]

MCUNet: Tiny deep learning on IoT devices

Ji Lin, Wei-Ming Chen, Yujun Lin, John Cohn, Chuang Gan, and Song Han. MCUNet: Tiny deep learning on IoT devices. InAdvances in Neural Information Processing Systems, volume 33, pages 11711–11722, Virtual, 2020

work page 2020

[37] [37]

SpArSe: Sparse architecture search for CNNs on resource-constrained microcontrollers

Igor Fedorov, Ryan P Adams, Matthew Mattina, and Paul N Whatmough. SpArSe: Sparse architecture search for CNNs on resource-constrained microcontrollers. InAdvances in Neural Information Processing Systems, volume 32, Vancouver, Canada, 2019

work page 2019

[38] [38]

Real-Time Com- puting Without Stable States: A New Framework for Neural Compu- tation Based on Perturbations

Wolfgang Maass, Thomas Natschläger, and Henry Markram. Real-time computing without stable states: A new framework for neural computation based on perturbations.Neural Computation, 14(11):2531–2560, 2002. doi: 10.1162/089976602760407955

work page doi:10.1162/089976602760407955 2002

[39] [39]

Loihi: A neuromorphic manycore processor with on-chip learning.IEEE Micro, 38(1):82–99, 2018

Mike Davies, Narayan Srinivasa, Tsung-Han Lin, Gautham Chinya, Yongqiang Cao, Sri Harsha Choday, Georgios Dimou, Prasad Joshi, Nabil Imam, Shweta Jain, et al. Loihi: A neuromorphic manycore processor with on-chip learning.IEEE Micro, 38(1):82–99, 2018. doi: 10.1109/ MM.2018.112130359

work page arXiv 2018

[40] [40]

Efficient neuromorphic signal processing with Loihi 2

Garrick Orchard, E Paxon Frady, Daniel Ben Dayan Rubin, Sophia Sanborn, Sumit Bam Shrestha, Friedrich T Sommer, and Mike Davies. Efficient neuromorphic signal processing with Loihi 2. InIEEE Workshop on Signal Processing Systems (SiPS), pages 254–259, Coimbra, Portugal, 2021. IEEE. doi: 10.1109/SiPS52927.2021.00053

work page doi:10.1109/sips52927.2021.00053 2021

[41] [41]

Neuromorphic electronic systems.Proceedings of the IEEE, 78(10):1629–1636,

Carver Mead. Neuromorphic electronic systems.Proceedings of the IEEE, 78(10):1629–1636,

work page

[42] [42]

doi: 10.1109/5.58356

work page doi:10.1109/5.58356

[43] [43]

Memristive crossbar arrays for brain-inspired computing

Qiangfei Xia and J Joshua Yang. Memristive crossbar arrays for brain-inspired computing. Nature Materials, 18(4):309–323, 2019. doi: 10.1038/s41563-019-0291-x. 12

work page doi:10.1038/s41563-019-0291-x 2019

[44] [44]

Training and operation of an integrated neuromorphic network based on metal-oxide memristors.Nature, 521(7550):61–64, 2015

Mirko Prezioso, Farnood Merrikh-Bayat, Brian D Hoskins, Gina C Adam, Konstantin K Likharev, and Dmitri B Strukov. Training and operation of an integrated neuromorphic network based on metal-oxide memristors.Nature, 521(7550):61–64, 2015. doi: 10.1038/nature14441

work page doi:10.1038/nature14441 2015

[45] [45]

Hardware implementation of deep network accelerators towards healthcare and biomedical applications.IEEE Transactions on Biomedical Circuits and Systems, 14(6):1138–1159, 2020

Mostafa Rahimi Azghadi, Corey Lammie, Jason K Eshraghian, Melika Payvand, Elisa Donati, Bernabe Linares-Barranco, and Giacomo Indiveri. Hardware implementation of deep network accelerators towards healthcare and biomedical applications.IEEE Transactions on Biomedical Circuits and Systems, 14(6):1138–1159, 2020. doi: 10.1109/TBCAS.2020.3036081

work page doi:10.1109/tbcas.2020.3036081 2020

[46] [46]

Long short-term memory networks in memristor crossbar arrays.Nature Machine Intelligence, 1(1):49–57, 2019

Can Li, Miao Hu, Yunning Li, Hao Jiang, Ning Ge, Eric Montgomery, Jiaming Zhang, Wenhao Song, Noraica Dávila, Catherine E Graves, et al. Long short-term memory networks in memristor crossbar arrays.Nature Machine Intelligence, 1(1):49–57, 2019. doi: 10.1038/ s42256-018-0001-4

work page 2019

[47] [47]

A review of computing with spiking neural networks.Computers, Materials & Continua, 78(3):2909, 2024

Jiadong Wu, Yinan Wang, Zhiwei Li, Lun Lu, and Qingjiang Li. A review of computing with spiking neural networks.Computers, Materials & Continua, 78(3):2909, 2024. doi: 10.32604/cmc.2024.047240

work page doi:10.32604/cmc.2024.047240 2024

[48] [48]

Towards spike-based machine intelligence with neuromorphic computing.Nature, 575(7784):607–617, 2019

Kaushik Roy, Akhilesh Jaiswal, and Priyadarshini Panda. Towards spike-based machine intelligence with neuromorphic computing.Nature, 575(7784):607–617, 2019. doi: 10.1038/ s41586-019-1677-2

work page 2019

[49] [49]

Real-time edge neuromorphic tasting from chemical microsensor arrays.Frontiers in Neuroscience, 15:771480, 2021

Nicholas LeBow, Bodo Rueckauer, Pengfei Sun, Meritxell Rovira, Cecilia Jiménez-Jorquera, Shih-Chii Liu, and Josep Maria Margarit-Taulé. Real-time edge neuromorphic tasting from chemical microsensor arrays.Frontiers in Neuroscience, 15:771480, 2021. doi: 10.3389/fnins. 2021.771480

work page doi:10.3389/fnins 2021

[50] [50]

Opportunities for neuromorphic computing algorithms and applications.Nature Computational Science, 2(1):10–19, 2022

Catherine D Schuman, Shruti R Kulkarni, Maryam Parsa, J Parker Mitchell, Bill Kay, and Prasanna Date. Opportunities for neuromorphic computing algorithms and applications.Nature Computational Science, 2(1):10–19, 2022. doi: 10.1038/s43588-021-00184-y

work page doi:10.1038/s43588-021-00184-y 2022

[51] [51]

Recurrent Neural Networks Hardware Implementation on FPGA

Andre Xian Ming Chang, Berin Martini, and Eugenio Culurciello. Recurrent neural networks hardware implementation on FPGA, 2015. URLhttps://arxiv.org/abs/1511.05552

work page internal anchor Pith review Pith/arXiv arXiv 2015

[52] [52]

Noise-mitigation strategies in physical feedforward neural networks.Chaos: An Interdisciplinary Journal of Nonlinear Science, 32(6):061106,

Nadezhda Semenova and Daniel Brunner. Noise-mitigation strategies in physical feedforward neural networks.Chaos: An Interdisciplinary Journal of Nonlinear Science, 32(6):061106,

work page

[53] [53]

doi: 10.1063/5.0096637

work page doi:10.1063/5.0096637

[54] [54]

Kendall, R

Jack Kendall, Ross Pantone, Kalpana Manickavasagam, Yoshua Bengio, and Benjamin Scellier. Training end-to-end analog neural networks with equilibrium propagation, 2020. URL https: //arxiv.org/abs/2006.01981

work page arXiv 2020

[55] [55]

MIT press, 2002

Shih-Chii Liu.Analog VLSI: Circuits and principles. MIT press, 2002

work page 2002

[56] [56]

AML100 - near-zero power analogml processor

Aspinity. AML100 - near-zero power analogml processor. https://www.aspinity.com/ aml100, 2022. Product information page, accessed: 2026-05-05

work page 2022

[57] [57]

NDP120 neural decision processor

Syntiant. NDP120 neural decision processor. https://www.syntiant.com/s/ Syntiant-Product_Brief_NDP120.pdf, 2021. Product brief, accessed: 2026-05-05

work page 2021

[58] [58]

Parallelizable memory recurrent units

Florent De Geeter, Gaspard Lambrechts, Damien Ernst, and Guillaume Drion. Parallelizable memory recurrent units, 2026. URLhttps://arxiv.org/abs/2601.09495

work page internal anchor Pith review Pith/arXiv arXiv 2026

[59] [59]

Pararnn: Unlocking parallel training of nonlinear rnns for large language models, 2025

Federico Danieli, Pau Rodríguez, Miguel Sarabia, Xavier Suau, and Luca Zappella. ParaRNN: Unlocking parallel training of nonlinear RNNs for large language models, 2025. URL https: //arxiv.org/abs/2510.21450

work page arXiv 2025

[60] [60]

Parallelizing linear recurrent neural nets over sequence length,

Eric Martin and Chris Cundy. Parallelizing linear recurrent neural nets over sequence length,

work page

[61] [61]

URLhttps://arxiv.org/abs/1709.04057

work page internal anchor Pith review Pith/arXiv arXiv

[62] [62]

Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition

Pete Warden. Speech commands: A dataset for limited-vocabulary speech recognition, 2018. URLhttps://arxiv.org/abs/1804.03209. 13

work page internal anchor Pith review Pith/arXiv arXiv 2018

[63] [63]

Resurrecting recurrent neural networks for long sequences, 2023

Antonio Orvieto, Samuel L Smith, Albert Gu, Anushan Fernando, Caglar Gulcehre, Razvan Pascanu, and Soham De. Resurrecting recurrent neural networks for long sequences, 2023. URLhttps://arxiv.org/abs/2303.06349

work page arXiv 2023

[64] [64]

Were RNNs All We Needed?

Leo Feng, Frederick Tung, Mohamed Osama Ahmed, Yoshua Bengio, and Hossein Hajimir- sadeghi. Were RNNs all we needed?, 2024. URLhttps://arxiv.org/abs/2410.01201

work page arXiv 2024

[65] [65]

On the Importance of Multistability for Horizon Generalization in Reinforcement Learning

Asad Bakija, Florent De Geeter, Julien Brandoit, Pierre Sacré, and Guillaume Drion. On the importance of multistability for horizon generalization in reinforcement learning, 2026. URL https://arxiv.org/abs/2605.12206

work page internal anchor Pith review Pith/arXiv arXiv 2026

[66] [66]

Matching properties of MOS transistors.IEEE Journal of Solid-State Circuits, 24(5):1433–1439, 1989

Marcel J M Pelgrom, Aad C J Duinmaijer, and Anton P G Welbers. Matching properties of MOS transistors.IEEE Journal of Solid-State Circuits, 24(5):1433–1439, 1989. doi: 10.1109/JSSC.1989.572629

work page doi:10.1109/jssc.1989.572629 1989

[67] [67]

Device mismatch and tradeoffs in the design of analog circuits.IEEE Journal of Solid-State Circuits, 40(6):1212–1224, 2005

Peter R Kinget. Device mismatch and tradeoffs in the design of analog circuits.IEEE Journal of Solid-State Circuits, 40(6):1212–1224, 2005. doi: 10.1109/JSSC.2005.848021

work page doi:10.1109/jssc.2005.848021 2005

[68] [68]

Silicon diode temperature sensors—a review of applications.Sensors and Actuators A: Physical, 232:63–74, 2015

Mohtashim Mansoor, Ibraheem Haneef, Suhail Akhtar, Andrea De Luca, and Florin Udrea. Silicon diode temperature sensors—a review of applications.Sensors and Actuators A: Physical, 232:63–74, 2015. doi: 10.1016/j.sna.2015.04.022

work page doi:10.1016/j.sna.2015.04.022 2015

[69] [69]

Long range arena: A benchmark for efficient transformers.arXiv preprint arXiv:2011.04006,

Yi Tay, Mostafa Dehghani, Samira Abnar, Yikang Shen, Dara Bahri, Philip Pham, Jinfeng Rao, Liu Yang, Sebastian Ruder, and Donald Metzler. Long range arena: A benchmark for efficient transformers, 2021. URLhttps://arxiv.org/abs/2011.04006

work page arXiv 2021

[70] [70]

nanoGPT, 2022

Andrej Karpathy. nanoGPT, 2022. URL https://github.com/karpathy/nanoGPT. GitHub repository

work page 2022

[71] [71]

Digital twin: Enabling technologies, challenges and open research.IEEE Access, 8:108952–108971, 2020

Iván López-Espejo, Zheng-Hua Tan, John H L Hansen, and Jesper Jensen. Deep spoken keyword spotting: An overview.IEEE Access, 10:4169–4199, 2022. doi: 10.1109/ACCESS. 2021.3139508

work page doi:10.1109/access 2022

[72] [72]

The mnist database of handwritten digit images for machine learning research [best of the web].IEEE Signal Processing Magazine, 29(6):141–142, 2012

Geoffrey Hinton, Li Deng, Dong Yu, George E Dahl, Abdel-rahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara N Sainath, et al. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups.IEEE Signal Processing Magazine, 29(6):82–97, 2012. doi: 10.1109/MSP.2012. 2205597

work page doi:10.1109/msp.2012 2012

[73] [73]

Small-footprint keyword spotting using deep neural networks

Guoguo Chen, Carolina Parada, and Georg Heigold. Small-footprint keyword spotting using deep neural networks. InIEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 4087–4091, Florence, Italy, 2014. IEEE. doi: 10.1109/ICASSP. 2014.6854370

work page doi:10.1109/icassp 2014

[74] [74]

Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting

Sercan O Arik, Markus Kliegl, Rewon Child, Joel Hestness, Andrew Gibiansky, Chris Fougner, Ryan Prenger, and Adam Coates. Convolutional recurrent neural networks for small-footprint keyword spotting, 2017. URLhttps://arxiv.org/abs/1703.05390

work page internal anchor Pith review Pith/arXiv arXiv 2017

[75] [75]

Steven Davis and Paul Mermelstein. Comparison of parametric representations for monosyl- labic word recognition in continuously spoken sentences.IEEE Transactions on Acoustics, Speech, and Signal Processing, 28(4):357–366, 1980. doi: 10.1109/TASSP.1980.1163420

work page doi:10.1109/tassp.1980.1163420 1980

[76] [76]

An 800 nW switched-capacitor feature extraction filterbank for sound classification.IEEE Transactions on Circuits and Systems I: Regular Papers, 68(4):1578–1588, 2021

Daniel Augusto Villamizar, Dante Gabriel Muratore, James B Wieser, and Boris Murmann. An 800 nW switched-capacitor feature extraction filterbank for sound classification.IEEE Transactions on Circuits and Systems I: Regular Papers, 68(4):1578–1588, 2021. doi: 10. 1109/tcsi.2020.3047035

work page arXiv 2021

[77] [77]

Heejin Yang, Ji-Hwan Seol, Rohit Rothe, Zichen Fan, Qirui Zhang, Hun-Seok Kim, David Blaauw, and Dennis Sylvester. A 1.5- µw fully-integrated keyword spotting SoC in 28-nm CMOS with skip-RNN and fast-settling analog frontend for adaptive frame skipping.IEEE Journal of Solid-State Circuits, 59(1):29–39, 2023. doi: 10.1109/jssc.2023.3316648

work page doi:10.1109/jssc.2023.3316648 2023

[78] [78]

Cadence Virtuoso Platform

Cadence Design Systems. Cadence Virtuoso Platform. https://www.cadence.com/en_ US/home/tools/custom-ic-analog-rf-design/virtuoso-studio.html, 2023. 14

work page 2023

[79] [79]

Oxford University Press, 3rd edition, 2011

Phillip E Allen and Douglas R Holberg.CMOS Analog Circuit Design. Oxford University Press, 3rd edition, 2011

work page 2011

[80] [80]

McGraw-Hill, 2001

Behzad Razavi.Design of Analog CMOS Integrated Circuits. McGraw-Hill, 2001

work page 2001