Trojan-Resilient NTT: Protecting Against Control Flow and Timing Faults on Reconfigurable Platforms
Pith reviewed 2026-05-16 09:31 UTC · model grok-4.3
The pith
A modified Number Theoretic Transform detects and corrects control-flow disruptions and timing faults from hardware Trojans in post-quantum cryptography.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The secure NTT architecture detects unconventional delays, control-flow disruptions, and soft analytical side-channel attacks induced by hardware Trojans on control signals, then applies adaptive fault correction to mitigate them, achieving high success rates in simulation and FPGA tests while keeping area and time overheads modest.
What carries the argument
Fault detection and adaptive correction modules integrated into the NTT pipeline that monitor control signals for disruptions and timing anomalies.
If this is right
- PQC implementations on reconfigurable hardware can resist both accidental faults and intentional Trojan attacks on control paths.
- The same detection approach applies across different Kyber parameter sets without major redesign.
- Modest overhead allows the secure NTT to be used in resource-constrained post-quantum devices.
Where Pith is reading between the lines
- The method could be adapted to protect other lattice-based primitives such as Dilithium or NTRU against similar control-flow Trojans.
- Real silicon deployments would need additional validation against Trojans that adapt to the detection logic itself.
- Layering this technique with existing data-path countermeasures would raise the bar for combined side-channel and fault attacks.
Load-bearing premise
The detection mechanisms can reliably identify Trojan-induced delays and control-flow changes without excessive false positives or missing adaptive attacks.
What would settle it
Insertion of a control-signal Trojan into the NTT on an Artix-7 FPGA followed by a run of Kyber that produces undetected or uncorrected faults in more than a small fraction of trials.
Figures
read the original abstract
Number Theoretic Transform (NTT) is the most essential component for polynomial multiplications used in lattice-based Post-Quantum Cryptography (PQC) algorithms such as Kyber, Dilithium, NTRU etc. However, side-channel attacks (SCA) and hardware vulnerabilities in the form of hardware Trojans may alter control signals to disrupt the circuit's control flow and introduce unconventional delays in the critical hardware of PQC. Hardware Trojans, especially on control signals, are more low cost and impactful than data signals because a single corrupted control signal can disrupt or bypass entire computation sequences, whereas data faults usually cause only localized errors. On the other hand, adversaries can perform Soft Analytical Side Channel Attacks (SASCA) on the design using the inserted hardware Trojan. In this paper, we present a secure NTT architecture capable of detecting unconventional delays, control-flow disruptions, and SASCA, while providing an adaptive fault-correction methodology for their mitigation. Extensive simulations and implementations of our Secure NTT on Artix-7 FPGA with different Kyber variants show that our fault detection and correction modules can efficiently detect and correct faults whether caused unintentionally or intentionally by hardware Trojans with a high success rate, while introducing only modest area and time overheads.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a Trojan-resilient NTT architecture for lattice-based PQC (e.g., Kyber variants) that detects and corrects control-flow disruptions, unconventional timing delays, and SASCA induced by hardware Trojans on control signals. The design incorporates dedicated detection modules and an adaptive fault-correction methodology, evaluated via simulations and Artix-7 FPGA implementations that reportedly achieve high success rates with modest area and latency overheads.
Significance. If the detection and correction mechanisms prove reliable, the work would address a practical vulnerability in hardware PQC accelerators where low-cost control-signal Trojans can bypass entire computations or enable SASCA. The emphasis on reconfigurable platforms and combined protection against both unintentional faults and intentional Trojans is relevant to ongoing PQC standardization and deployment. However, the absence of quantitative metrics, baseline comparisons, and adversarial testing currently limits the assessed impact.
major comments (3)
- [Abstract] Abstract: the claim of 'high success rate' from 'extensive simulations and implementations on Artix-7 FPGA' is unsupported by any numerical detection rates, false-positive rates, error rates, or baseline comparisons against unprotected NTT designs, preventing assessment of the modules' actual effectiveness.
- [Evaluation] Evaluation section: no explicit fault model, Trojan placement strategy, or test coverage is described, and there is no evaluation against adaptive adversaries that deliberately tune delays or control signals to remain within the detection thresholds, leaving the central resilience claim untested.
- [Proposed architecture] Proposed architecture: the detection thresholds for 'unconventional delays' and control-signal anomalies are introduced as free parameters without a parameter-free derivation or sensitivity analysis, which directly affects the claimed ability to distinguish Trojan-induced faults from normal timing variation.
minor comments (1)
- [Abstract] The abstract and introduction could include a brief comparison table of area/latency overheads versus prior NTT hardening techniques.
Simulated Author's Rebuttal
We thank the referee for the insightful comments. We address each major comment below and indicate the revisions we will make to strengthen the manuscript.
read point-by-point responses
-
Referee: [Abstract] Abstract: the claim of 'high success rate' from 'extensive simulations and implementations on Artix-7 FPGA' is unsupported by any numerical detection rates, false-positive rates, error rates, or baseline comparisons against unprotected NTT designs, preventing assessment of the modules' actual effectiveness.
Authors: We agree with this observation. While the evaluation section provides detailed numerical results on detection rates, false-positive rates, and overhead comparisons from our Artix-7 FPGA implementations, the abstract does not summarize them. We will revise the abstract to include key quantitative metrics supporting the high success rate claim and baseline comparisons. revision: yes
-
Referee: [Evaluation] Evaluation section: no explicit fault model, Trojan placement strategy, or test coverage is described, and there is no evaluation against adaptive adversaries that deliberately tune delays or control signals to remain within the detection thresholds, leaving the central resilience claim untested.
Authors: We will revise the evaluation section to include an explicit description of the fault model, Trojan placement strategy on control signals, and test coverage details from our simulations. Regarding adaptive adversaries, while we evaluated a range of fault scenarios, we did not specifically test against adversaries that tune parameters to evade detection thresholds. We will add a discussion of this limitation and include sensitivity analysis to address the resilience claim more thoroughly. revision: partial
-
Referee: [Proposed architecture] Proposed architecture: the detection thresholds for 'unconventional delays' and control-signal anomalies are introduced as free parameters without a parameter-free derivation or sensitivity analysis, which directly affects the claimed ability to distinguish Trojan-induced faults from normal timing variation.
Authors: We will add a sensitivity analysis for the detection thresholds in the proposed architecture section. The thresholds are set based on timing analysis to distinguish normal variations from faults, and we will provide the derivation method and show how performance varies with different settings. This will strengthen the distinction between Trojan-induced faults and normal timing variation. revision: yes
Circularity Check
No circularity: design validated by direct FPGA experiments
full rationale
The paper proposes a hardware architecture for Trojan-resilient NTT with fault detection and correction modules. Its central claims rest on explicit FPGA implementations and simulations on Artix-7 with Kyber variants that report measured success rates and overheads. No equations, fitted parameters, self-citations, or uniqueness theorems are invoked; the derivation chain consists of standard hardware design steps followed by empirical measurement rather than any reduction of outputs to inputs by construction.
Axiom & Free-Parameter Ledger
free parameters (1)
- delay and control-signal detection thresholds
axioms (1)
- domain assumption Hardware Trojans on control signals are low-cost and high-impact compared to data-path faults
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
lightweight shift-register–based backup CSR... Clock Cycle Counter (CCC)... Local Mask (LM) Unit... risk factor (Ri) for each PR bitstream
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
adaptive fault-correction... three measures (Repeat Previous Loop, Reload PR Bit, Relocate PR Bit)
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Reconfigurable and high-efficiency polynomial multiplication accelerator for crystals-kyber
Li et al. Reconfigurable and high-efficiency polynomial multiplication accelerator for crystals-kyber. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems , 42(8):2540–2551, 2023
work page 2023
-
[2]
Bin Li, Y unfei Y an, Y uanxin Wei, and Heru Han. Scal- able and parallel optimization of the number theoretic 11 TABLE III: Trojan Scenarios for NTT with Local Mask (LM) Unit , Control-Flow Integrity ( CF I) Checker, and Clock Cycle Counter ( CCC ) Trojan Location Trojan Type / Payload Insertion Phase Impact on NTT LM CFI CCC Remarks SASCA-style leak- ag...
-
[3]
/ side- channel [7], [8] Secret key/ plaintext leakage ✓ × × Input data of NTT is masked by ω r in LM Outside Upstream [23] Pre-mask tap (copies inputs before masking) RTL / synthesis Secret leakage (mask bypass) × × × Not possible as LM is placed inside NTT Clock glitch/ inser- tion, Reset spoof- ing, DoS, loss of state Post-deployment
-
[4]
/ fault injection Integrity fault, tim- ing errors × ∼ ∼ Trojan in the N T T main clk or rst introduces ambiguity in control flow and may cause delays, which can be detected by CSR and CCC respec- tively Data leak after mask removal (tap internal nets) RTL [10] / syn- thesis [11] Secret leakage inside datapath × × × If a hardware Trojan inside the NTT remo...
work page 2024
-
[5]
An extensive study of flexible design methods for the number theoretic trans- form
Ahmet Can Mert, Emre Karabulut, Erdinç Öztürk, Erkay Sava¸ s, and A ydin A ysu. An extensive study of flexible design methods for the number theoretic trans- form. IEEE Transactions on Computers , 71(11):2829– 2843, 2022
work page 2022
-
[6]
Single-trace side-channel attacks on masked lattice- based encryption
Robert Primas, Peter Pessl, and Stefan Mangard. Single-trace side-channel attacks on masked lattice- based encryption. In Cryptographic Hardware and Embedded Systems – CHES 2017 , pages 513–533, Cham, 2017. Springer International Publishing
work page 2017
-
[7]
A configurable crystals- kyber hardware implementation with side-channel pro- tection
Arpan Jati, Naina Gupta, Anupam Chattopadhyay, and Somitra Kumar Sanadhya. A configurable crystals- kyber hardware implementation with side-channel pro- tection. ACM Trans. Embed. Comput. Syst. , 23(2), March 2024. 12 Designs Platform # of Butterfly Unit SEC* BRAM Energy CCs Protection Prevent
work page 2024
-
[8]
Artix-7 FPGA 1 280 1 NR 570 Random Memory Access, Clock Cycle Count SASCA, Un- conventional
-
[9]
ARM Cortex M4 - - - NR 178 × 103 Local Masking, Random Memory Access SASCA
-
[10]
Cost(SEC)=LUTs × 0.25 + FFs × 0.25 + DSPs × 100 [27]
Artix-7 FPGA 4 1492 6 NR 237 Partial Local Masking SASCA Ours baseline Artix-7 FPGA 1 273 1 2991 nJ 1028 Nil Nil Ours protected (only detection) Artix-7 FPGA 1 297 (↑8.7%) 1 3021 nJ (↑1%) 1028 (0%) Back up & Independent CSR, Local Masking, Clock Cycle Counter CF I , Delays, SASCA Ours protected (detec- tion+correction) Artix-7 FPGA 1 327 (↑19.7%) 1 3094 n...
-
[11]
Xilinx Zynq- 7000 Secure crypto IP updates Crypto module bit- streams PR with Authenticated Encryp- tion with Associated Data (AEAD) authentication using PUF-derived keys ∼ 7–10% area increase for PR con- troller; negligible time overhead during normal operation
-
[12]
Xilinx Kintex-7 FFT processors (analogous to NTT) Faulty FFT cores due to SEUs PR to replace faulty cores (on- demand scrubbing) Recovery latency per core swap ≈ 20–50 ms; area overhead for spare logic ≈ 5–8%
-
[13]
Reliability evaluated as a function of reconfigu- ration time
Generic Xilinx FPGA Radiation-prone environments, e.g., space) Fault tolerance for FPGA-based systems under SEUs (Single Event Upsets) Fault detection with TMR + diag- nostic logic, partial dynamic recon- figuration of only the faulty module via a Reconfiguration Controller Simulation-based; no measured slice or energy overheads reported. Reliability evalua...
-
[14]
Generic FPGA Reconfiguration security (DPA attack countermeasures) PR controller logic PR with ReCoFuse container for formal violation detection 9% LUT increase, 6% FF increase re- ported; reconfiguration time 30 ms per region
-
[15]
Modern SRAM- based FPGAs Safety-critical dynamically & PR systems (e.g. space, avionics) Single and multiple errors caused by harsh conditions (SEUs) Self-repairing method using run- time partial and dynamic reconfig- uration at fine-grain granularity TMR consumes on average 3.64× more hardware resources than the original circuit
-
[16]
SRAM- based FPGAs FPGA lifetime extension under permanent SEE (space missions) Permanent config faults (SEEs) PR deleting faulty frames; scrub- bing only faulty frames Extends device lifetime significantly; reduced reconfiguration traffic vs full scrubbing (quantitative area/time over- head not detailed) Our Artix-7 NTT, Security Processor Faults in control s...
-
[17]
Side-channel hardware trojan for provably-secure sca-protected implementations
Samaneh Ghandali, Thorben Moos, Amir Moradi, and Christof Paar. Side-channel hardware trojan for provably-secure sca-protected implementations. IEEE Transactions on V ery Large Scale Integration (VLSI) Systems, 28(6):1435–1448, 2020
work page 2020
-
[18]
Prasanna Ravi, Romain Poussier, Shivam Bhasin, and Anupam Chattopadhyay. On configurable sca coun- termeasures against single trace attacks for the ntt: A performance evaluation study over kyber and dilithium on the arm cortex-m4. In Proceedings of the Inter- national Conference on Cryptographic Hardware and Embedded Systems , page 123–146, Berlin, Heidelberg,
-
[19]
Hardware implementa- tion and security analysis of local-masked NTT for CRYSTALS-kyber
Rafael Carrera Rodriguez, Emanuele V alea, Florent Bruguier, and Pascal Benoit. Hardware implementa- tion and security analysis of local-masked NTT for CRYSTALS-kyber. Cryptology ePrint Archive, Paper 2024/1194, 2024
work page 2024
-
[20]
Field programmable gate ar- ray market size & share analysis – growth trends & forecasts (2025–2030)
Mordor Intelligence. Field programmable gate ar- ray market size & share analysis – growth trends & forecasts (2025–2030). Mordor Intelligence Industry Report, 2025. Market size: USD 10.08 billion (2025), projected to expand to USD 16.23 billion by 2030 at 10.00% CAGR
work page 2025
-
[21]
Dtjrtl: A configurable frame- 13 work for automated hardware trojan insertion at rtl
Ruochen Dai, Zhaoxiang Liu, Orlando Arias, Xiaolong Guo, and Tuba Y avuz. Dtjrtl: A configurable frame- 13 work for automated hardware trojan insertion at rtl. In Proceedings of the Great Lakes Symposium on VLSI 2024, GLSVLSI ’24, page 465–470, New Y ork, NY , USA, 2024. Association for Computing Machinery
work page 2024
-
[22]
Rijoy Mukherjee, Archisman Ghosh, and Rajat Subhra Chakraborty. Hls-irt: Hardware trojan insertion through modification of intermediate representation during high-level synthesis. ACM Trans. Des. Autom. Electron. Syst., 29(5), September 2024
work page 2024
-
[23]
Fpga trojans through detecting and weakening of cryptographic primitives
Pawel Swierczynski, Marc Fyrbiak, Philipp Koppe, and Christof Paar. Fpga trojans through detecting and weakening of cryptographic primitives. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems , 34(8):1236–1249, 2015
work page 2015
-
[24]
Wu, Karthik Ganesan, Y unqing Alexander Hu, H.-S
Tony F. Wu, Karthik Ganesan, Y unqing Alexander Hu, H.-S. Philip Wong, Simon Wong, and Subhasish Mitra. Tpad: Hardware trojan prevention and detection for trusted integrated circuits. Trans. Comp.-Aided Des. Integ. Cir . Sys., 35(4):521–534, April 2016
work page 2016
-
[25]
Zhiming Zhang, Laurent Njilla, Charles A. Kamhoua, and Qiaoyan Y u. Thwarting security threats from malicious fpga tools with novel fpga-oriented moving target defense. IEEE Transactions on V ery Large Scale Integration (VLSI) Systems , 27(3):665–678, 2019
work page 2019
-
[26]
Cad-base: An attack vector into the electronics supply chain
Kanad Basu, Samah Mohamed Saeed, Christian Pilato, Mohammed Ashraf, Mohammed Thari Nabeel, Krish- nendu Chakrabarty, and Ramesh Karri. Cad-base: An attack vector into the electronics supply chain. ACM Trans. Des. Autom. Electron. Syst. , 24(4), April 2019
work page 2019
-
[27]
Ausmita et al., Sarker and Canto. Error detection ar- chitectures for hardware/software co-design approaches of number-theoretic transform. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 42(7):2418–2422, 2023
work page 2023
-
[28]
Lightweight fault detection architecture for ntt on fpga, 2025
Rourab Paul, Paresh Baidya, and Krishnendu Guha. Lightweight fault detection architecture for ntt on fpga, 2025
work page 2025
-
[29]
Saeed Aghapour, Kasra Ahmadi, Mehran Mozaffari Kermani, and Reza Azarderakhsh. Partial recomputa- tion fault detection architecture for multiple-precision montgomery modular multiplication. IEEE Transac- tions on Computer-Aided Design of Integrated Circuits and Systems , pages 1–1, 2025
work page 2025
-
[30]
A fault-resistant NTT by polyno- mial evaluation and interpolation
Sven Bauer, Fabrizio De Santis, Kristjane Koleci, and Anita Aghaie. A fault-resistant NTT by polyno- mial evaluation and interpolation. Cryptology ePrint Archive, Paper 2024/788, 2024
work page 2024
-
[31]
Generic side-channel attacks on cca- secure lattice-based pke and kems
Prasanna et al. Generic side-channel attacks on cca- secure lattice-based pke and kems. IACR Transactions on Cryptographic Hardware and Embedded Systems , 2020(3):307–335, Jun. 2020
work page 2020
-
[32]
Generating efficient context-switch capable circuits through autonomous design flow
Alban Bourge, Olivier Muller, and Frédéric Rousseau. Generating efficient context-switch capable circuits through autonomous design flow. ACM Trans. Recon- figurable T echnol. Syst. , 10(1), December 2016
work page 2016
-
[33]
Vivado Design Suite User Guide: Partial Reconfiguration (UG909) , 2015
Xilinx, Inc. Vivado Design Suite User Guide: Partial Reconfiguration (UG909) , 2015. V ersion 2015.4; Xil- inx Vivado
work page 2015
-
[34]
Burin Amornpaisannon, Andreas Diavastos, Li-Shiuan Peh, and Trevor E. Carlson. Secure run-time hard- ware trojan detection using lightweight analytical models. Trans. Comp.-Aided Des. Integ. Cir . Sys. , 43(2):431–441, February 2024
work page 2024
- [35]
-
[36]
Hardware trojan detection using machine learning: A tutorial
Kevin Immanuel Gubbi, Banafsheh Saber Lati- bari, Anirudh Srikanth, Tyler Sheaves, Sayed Arash Beheshti-Shirazi, Sai Manoj PD, Satareh Rafatirad, Avesta Sasan, Houman Homayoun, and Soheil Salehi. Hardware trojan detection using machine learning: A tutorial. ACM Trans. Embed. Comput. Syst. , 22(3), April 2023
work page 2023
-
[37]
Post-configuration activation of hardware trojans in fpgas
Qazi Arbab Ahmed, Tobias Wiersema, and Marco Platzner. Post-configuration activation of hardware trojans in fpgas. Journal of Hardware and Systems Security, 8(2):79–93, 2024
work page 2024
-
[38]
Optimized schoolbook polynomial multipli- cation for compact lattice-based cryptography on fpga
Liu et al. Optimized schoolbook polynomial multipli- cation for compact lattice-based cryptography on fpga. IEEE Transactions on V ery Large Scale Integration (VLSI) Systems , 27(10):2459–2463, 2019
work page 2019
-
[39]
Secure update of fpga-based secure elements using partial reconfiguration
Manuel Unterstein, Arne Schäffers, Maksym Skorobo- gatiy, Anh Dang, Felix Stumpf, and Axel Poschmann. Secure update of fpga-based secure elements using partial reconfiguration. Technical Report 2020/833, IACR ePrint Archive, 2020
work page 2020
-
[40]
Dynamic partial reconfiguration scheme for fault-tolerant fft pro- cessor based on fpga
Xin Wei, Yi Z Xie, Y u Xie, and He Chen. Dynamic partial reconfiguration scheme for fault-tolerant fft pro- cessor based on fpga. The Journal of Engineering , 2019:7424–7427, 2019
work page 2019
-
[41]
Partial dynamic reconfiguration in an fpga-based fault-tolerant system: Simulation-based evaluation
Richard Panek, Jakub Lojda, Jakub Podivinsky, and Zdenek Kotasek. Partial dynamic reconfiguration in an fpga-based fault-tolerant system: Simulation-based evaluation. In 2018 IEEE East-W est Design & T est Symposium (EWDTS) , pages 1–6, 2018
work page 2018
-
[42]
Recofused partial reconfiguration for secure moving-target countermeasures on fpgas
Buse Ustaoglu, Kenneth Schmitz, Daniel Große, and Rolf Drechsler. Recofused partial reconfiguration for secure moving-target countermeasures on fpgas. SN Applied Sciences , 2:1363, 2020
work page 2020
-
[43]
An error-detection and self-repairing method for dy- namically and partially reconfigurable systems
Matteo Sonza Reorda, Luca Sterpone, and Anees Ullah. An error-detection and self-repairing method for dy- namically and partially reconfigurable systems. IEEE Transactions on Computers , 66(6):1022–1033, 2017
work page 2017
-
[44]
Optimal partial reconfiguration for permanent fault recovery on sram-based fpgas
Jose Luna, Luis Entrena, Mario Garcia-V alderas, and Celia Lopez-Ongil. Optimal partial reconfiguration for permanent fault recovery on sram-based fpgas. ISRN Electronics, 2013:Article ID 783673, 2013
work page 2013
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.