pith. sign in

arxiv: 1906.10817 · v1 · pith:FKTM7ZYPnew · submitted 2019-06-26 · 💻 cs.IT · cs.CR· cs.DC· math.IT

Coded State Machine -- Scaling State Machine Execution under Byzantine Faults

Pith reviewed 2026-05-25 15:38 UTC · model grok-4.3

classification 💻 cs.IT cs.CRcs.DCmath.IT
keywords Coded State MachineByzantine faultsState Machine ReplicationLagrange codingverifiable computationdistributed systemsinformation-theoretic security
0
0 comments X

The pith

Coded State Machine achieves linear scaling in storage, throughput, and security simultaneously with network size for state machine execution under Byzantine faults.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces an information-theoretic framework called Coded State Machine to execute multiple state machines securely on networks containing some Byzantine nodes. Standard state machine replication provides security but scales poorly in efficiency. CSM encodes states and input commands using Lagrange coding so that each coded version occupies the same storage as its uncoded origin yet still supports recovery and verification. A delegation procedure called INTERMIX lets nodes offload matrix-vector multiplications to a single worker while a small set of auditors verify correctness information-theoretically. The result is simultaneous linear improvement in storage, speed, and fault tolerance as the network grows.

Core claim

We propose CSM, which achieves the optimal linear scaling in storage efficiency, throughput, and security simultaneously with the size of the network. The storage efficiency is scaled via the design of Lagrange coded states and coded input commands that require the same storage size as their origins. The computational efficiency is scaled using a novel delegation algorithm, called INTERMIX, which is an information-theoretically verifiable matrix-vector multiplication algorithm of independent interest. Using INTERMIX, the network nodes securely delegate their coding operations to a single worker node, and a small group of randomly selected auditor nodes verify its correctness, so that the per

What carries the argument

Lagrange coded states together with the INTERMIX verifiable delegation algorithm for matrix-vector multiplication.

If this is right

  • Storage per state machine stays constant rather than growing with replication factor.
  • Throughput grows linearly with the number of nodes while security level is preserved.
  • A single worker can perform the coding operations for the entire network with verification by a small random auditor set.
  • The same INTERMIX primitive can be reused for other verifiable linear computations beyond state-machine updates.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same coding-plus-delegation pattern could be applied to other linear operations that dominate distributed ledgers or consensus protocols.
  • If the storage claim holds, existing replication-based systems could be replaced by CSM without increasing total memory footprint.
  • Real-world deployment would require checking whether the random auditor selection remains secure when the network size is moderate rather than asymptotic.

Load-bearing premise

Lagrange coded states and coded input commands occupy the same storage as their uncoded originals while still allowing secure recovery and verification in the presence of Byzantine faults.

What would settle it

A demonstration that recovering an original state or command from its Lagrange coded version requires strictly more than the original storage amount, or that the INTERMIX verification procedure fails to detect an incorrect result produced by a Byzantine worker.

Figures

Figures reproduced from arXiv: 1906.10817 by Mingchao Yu, Pramod Viswanath, Saeid Sahraei, Salman Avestimehr, Songze Li, Sreeram Kannan.

Figure 1
Figure 1. Figure 1: A block diagram illustration of main operations in [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Illustration of operating K = 2 state machines over N = 3 nodes, one of which (node 2) is malicious. Malicious nodes can compromise the system security in the consensus phase, in the execution phase when honest nodes try to decode the computation results, and when delivering the decoded outputs to the clients. 2.1 Network and Failure Models We consider a fully connected network between the clients and the … view at source ↗
Figure 3
Figure 3. Figure 3: Illustration of the key components of Coded State M [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: We note that the correctness of the results in this s [PITH_FULL_IMAGE:figures/full_fig_p009_4.png] view at source ↗
Figure 4
Figure 4. Figure 4: A block-diagram of the proposed centralized compu [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Illustration of INTERMIX for verifiable matrix vec [PITH_FULL_IMAGE:figures/full_fig_p010_5.png] view at source ↗
read the original abstract

We introduce an information-theoretic framework, named Coded State Machine (CSM), to securely and efficiently execute multiple state machines on untrusted network nodes, some of which are Byzantine. The standard method of solving this problem is using State Machine Replication, which achieves high security at the cost of low efficiency. We propose CSM, which achieves the optimal linear scaling in storage efficiency, throughput, and security simultaneously with the size of the network. The storage efficiency is scaled via the design of Lagrange coded states and coded input commands that require the same storage size as their origins. The computational efficiency is scaled using a novel delegation algorithm, called INTERMIX, which is an information-theoretically verifiable matrix-vector multiplication algorithm of independent interest. Using INTERMIX, the network nodes securely delegate their coding operations to a single worker node, and a small group of randomly selected auditor nodes verify its correctness, so that computational efficiency can scale almost linearly with the network size, without compromising on security.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 0 minor

Summary. The manuscript introduces the Coded State Machine (CSM) framework for securely executing multiple state machines across untrusted nodes subject to Byzantine faults. It claims to achieve optimal linear scaling simultaneously in storage efficiency, throughput, and security with network size, via Lagrange coded states and coded input commands that match the storage size of their uncoded origins, together with the INTERMIX algorithm for information-theoretically verifiable delegation of matrix-vector multiplications.

Significance. If the central constructions hold, the result would be significant for distributed systems and coding theory, as it proposes a route to linear scaling in all three metrics without the usual redundancy overhead of classical state-machine replication. The introduction of INTERMIX as a standalone verifiable computation primitive is noted as potentially of independent interest.

major comments (2)
  1. [Abstract] Abstract: The headline claim of linear scaling in storage efficiency rests on the assertion that 'Lagrange coded states and coded input commands that require the same storage size as their origins' while still enabling information-theoretic recovery and verification under Byzantine faults. Standard information-theoretic Byzantine tolerance requires explicit redundancy; the manuscript must supply the explicit encoding, recovery threshold, and security proof showing that dimension or metadata overhead is avoided.
  2. [Abstract] Abstract: The subsequent claim that INTERMIX enables 'computational efficiency [to] scale almost linearly with the network size, without compromising on security' by delegating to a single worker and a small auditor group is load-bearing for the throughput claim. The security reduction from the matrix-vector verification to the overall CSM security must be stated explicitly.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and constructive comments on the abstract. We address the points below and will revise the manuscript to make the claims more explicit while preserving the technical content.

read point-by-point responses
  1. Referee: [Abstract] Abstract: The headline claim of linear scaling in storage efficiency rests on the assertion that 'Lagrange coded states and coded input commands that require the same storage size as their origins' while still enabling information-theoretic recovery and verification under Byzantine faults. Standard information-theoretic Byzantine tolerance requires explicit redundancy; the manuscript must supply the explicit encoding, recovery threshold, and security proof showing that dimension or metadata overhead is avoided.

    Authors: The full paper defines the Lagrange coded state as the evaluation of a polynomial of degree at most m-1 (m state machines) at a distinct point per node; each coded state is therefore a single vector of the same dimension as an uncoded state, with no auxiliary metadata. Recovery succeeds from any set of at least m + f + 1 honest nodes (f Byzantine), which is the standard threshold for information-theoretic Byzantine tolerance under this coding. The security argument appears in the main body via the properties of Reed-Solomon-like codes over a sufficiently large field. We will add a single sentence to the abstract that states the recovery threshold and points to the encoding definition and proof. revision: yes

  2. Referee: [Abstract] Abstract: The subsequent claim that INTERMIX enables 'computational efficiency [to] scale almost linearly with the network size, without compromising on security' by delegating to a single worker and a small auditor group is load-bearing for the throughput claim. The security reduction from the matrix-vector verification to the overall CSM security must be stated explicitly.

    Authors: INTERMIX provides an information-theoretic soundness guarantee for each matrix-vector multiplication: any incorrect result is detected by the auditor set except with probability exponentially small in the field size. Because every state update in CSM is realized by such a multiplication, the per-operation soundness composes directly with the existing Byzantine tolerance of the coded states; the random selection of auditors ensures that the probability of an undetected fault remains negligible over polynomially many steps. We will insert an explicit one-paragraph reduction statement in both the abstract and the security section of the revision. revision: yes

Circularity Check

0 steps flagged

No circularity; construction-based claims are self-contained

full rationale

The paper introduces CSM as a novel framework whose linear scaling claims follow directly from the explicit design of Lagrange coded states (defined to match original storage size) and the INTERMIX verifiable delegation algorithm. No load-bearing step reduces a claimed result to a fitted parameter, self-citation chain, or input by construction; the storage and verification properties are asserted as properties of the new coding scheme itself rather than derived from prior fitted quantities. The derivation is therefore self-contained.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 2 invented entities

Only the abstract is available, so the ledger is limited to concepts explicitly named. No numerical free parameters are stated. Standard information-theoretic assumptions for Byzantine tolerance are implicit. Two new entities are introduced without external evidence.

axioms (1)
  • domain assumption Information-theoretic security model for Byzantine faults
    Invoked when the abstract claims security is preserved while scaling.
invented entities (2)
  • Coded State Machine (CSM) no independent evidence
    purpose: Framework for efficient secure state-machine execution
    Newly named construction in the abstract
  • INTERMIX no independent evidence
    purpose: Information-theoretically verifiable matrix-vector multiplication for delegation
    Novel algorithm introduced in the abstract

pith-pipeline@v0.9.0 · 5724 in / 1277 out tokens · 28042 ms · 2026-05-25T15:38:26.289941+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

52 extracted references · 52 canonical work pages · 1 internal anchor

  1. [1]

    K., Janakiraman, R., and Xu, L

    Aguilera, M. K., Janakiraman, R., and Xu, L. Using erasure codes efficiently for storage in a distributed system. In 2005 International Conference on Dependable Systems and Networks (DSN’05) (2005), IEEE, pp. 336–345

  2. [2]

    A., and Day, J

    Alsberg, P. A., and Day, J. D. A principle for resilient sharing of distributed resources . In Proceedings of the 2nd international conference on Softwar e engineering (1976), IEEE Com- puter Society Press, pp. 562–570

  3. [3]

    Balasubramanian, B., and Garg, V. K. Fault tolerance in distributed systems using fused data structures. IEEE transactions on parallel and distributed systems 24 , 4 (2013), 701–715

  4. [4]

    Balasubramanian, B., and Garg, V. K. Fault tolerance in distributed systems using fused state machines. Distributed computing 27 , 4 (2014), 287–311

  5. [5]

    Scalable, transparent, and post-quantum secure computational integrity

    Ben-Sasson, E., Bentov, I., Horesh, Y., and Riabzev, M. Scalable, transparent, and post-quantum secure computational integrity. Cryptol. ePrint Arch., Tech. Rep 46 (2018), 2018

  6. [6]

    Succinct non-interactive zero knowledge for a von neumann architecture

    Ben-Sasson, E., Chiesa, A., Tromer, E., and Virza, M. Succinct non-interactive zero knowledge for a von neumann architecture. In USENIX Security Symposium (2014), pp. 781– 796

  7. [7]

    From extractable collision resistance to succinct non-interactive arguments of knowl edge, and back again

    Bitansky, N., Canetti, R., Chiesa, A., and Tromer, E. From extractable collision resistance to succinct non-interactive arguments of knowl edge, and back again. In Proceedings of the 3rd Innovations in Theoretical Computer Science Conf erence (2012), ACM, pp. 326–349

  8. [8]

    Optimal resilience for erasure-coded byzantine distribut ed storage

    Cachin, C., and Tessaro, S. Optimal resilience for erasure-coded byzantine distribut ed storage. In International Conference on Dependable Systems and Networ ks (DSN’06) (2006), IEEE, pp. 115–124

  9. [9]

    R., W ang, Z., and Lynch, N

    Cadambe, V. R., W ang, Z., and Lynch, N. Information-theoretic lower bounds on the storage cost of shared memory emulation. In Proceedings of the 2016 ACM Symposium on Principles of Distributed Computing (2016), ACM, pp. 305–313

  10. [10]

    Practical byzantine fault tolerance and proactive recover y

    Castro, M., and Liskov, B. Practical byzantine fault tolerance and proactive recover y. ACM Transactions on Computer Systems (TOCS) 20 , 4 (2002), 398–461

  11. [11]

    Practical byzantine fault tolerance

    Castro, M., Liskov, B., et al. Practical byzantine fault tolerance. In OSDI (1999), vol. 99, pp. 173–186

  12. [12]

    G., Ramchandran, K., Wu, Y., and Suh, C

    Dimakis, A. G., Ramchandran, K., Wu, Y., and Suh, C. A survey on network codes for distributed storage. Proceedings of the IEEE 99 , 3 (2011), 476–489

  13. [13]

    Pow- erstore: proofs of writing for efficient and robust storage

    Dobre, D., Karame, G., Li, W., Majuntke, M., Suri, N., and Vuk olić, M. Pow- erstore: proofs of writing for efficient and robust storage. I n Proceedings of the 2013 ACM SIGSAC conference on Computer & communications security (2013), ACM, pp. 285–298

  14. [14]

    A verifiable random function with short proofs and keys

    Dodis, Y., and Yampolskiy, A. A verifiable random function with short proofs and keys. In International Workshop on Public Key Cryptography (2005), Springer, pp. 416–431. 16

  15. [15]

    On the minimal synchronism needed for distributed consensus

    Dolev, D., Dwork, C., and Stockmeyer, L. On the minimal synchronism needed for distributed consensus. Journal of the ACM (JACM) 34 , 1 (1987), 77–97

  16. [16]

    Short-dot: Computing large linear transforms distributedly using coded short dot products

    Dutta, S., Cadambe, V., and Grover, P. Short-dot: Computing large linear transforms distributedly using coded short dot products. In NIPS (2016), pp. 2100–2108

  17. [17]

    Consensus in the presence of partial syn- chrony

    Dwork, C., Lynch, N., and Stockmeyer, L. Consensus in the presence of partial syn- chrony. Journal of the ACM (JACM) 35 , 2 (1988), 288–323

  18. [18]

    Efficient techniques for publicly verifiable delegation of computation

    Elkhiyaoui, K., Önen, M., Azraoui, M., and Mol v a, R. Efficient techniques for publicly verifiable delegation of computation. In Proceedings of the 11th ACM on Asia Conference on Computer and Communications Security (2016), ACM, pp. 119–128

  19. [19]

    Publicly verifiable delegation of large polynomials and mat rix computations, with applications

    Fiore, D., and Gennaro, R. Publicly verifiable delegation of large polynomials and mat rix computations, with applications. In Proceedings of the 2012 ACM conference on Computer and communications security (2012), ACM, pp. 501–512

  20. [20]

    Garg, V. K. Implementing fault-tolerant services using state machine s: Beyond replication. In International Symposium on Distributed Computing (2010), Springer, pp. 450–464

  21. [21]

    Non-interactive verifiable computing: Out- sourcing computation to untrusted workers

    Gennaro, R., Gentry, C., and Parno, B. Non-interactive verifiable computing: Out- sourcing computation to untrusted workers. In Annual Cryptology Conference (2010), Springer, pp. 465–482

  22. [22]

    Algebraic Structure Theory of Sequential Machines (Prenti ce-Hall Interna- tional Series in Applied Mathematics)

    Hartmanis, J. Algebraic Structure Theory of Sequential Machines (Prenti ce-Hall Interna- tional Series in Applied Mathematics) . Prentice-Hall, Inc., Upper Saddle River, NJ, USA, 1966

  23. [23]

    R., and Reiter, M

    Hendricks, J., Ganger, G. R., and Reiter, M. K. Low-overhead byzantine fault-tolerant storage. In ACM SIGOPS Operating Systems Review (2007), vol. 41, ACM, pp. 73–86

  24. [24]

    S., and Umans, C

    Kedlaya, K. S., and Umans, C. Fast polynomial factorization and modular composition. SIAM Journal on Computing 40 , 6 (2011), 1767–1802

  25. [25]

    Omniledger: A secure, scale-out, decentralized ledger

    Kokoris-Kogias, E., Jov anovic, P., Gasser, L., Gailly, N., and Ford, B. Omniledger: A secure, scale-out, decentralized ledger. IACR Cryptology ePrint Archive 2017 (2017), 406

  26. [26]

    The implementation of reliable distributed multiprocess s ystems

    Lamport, L. The implementation of reliable distributed multiprocess s ystems. Computer networks 2 , 2 (1978), 95–114

  27. [27]

    The part-time parliament

    Lamport, L. The part-time parliament. ACM Transactions on Computer Systems (TOCS) 16, 2 (1998), 133–169

  28. [28]

    The byzantine generals problem

    Lamport, L., Shostak, R., and Pease, M. The byzantine generals problem. ACM Trans- actions on Programming Languages and Systems (TOPLAS) 4 , 3 (1982), 382–401

  29. [29]

    Closed partition lattice and machine decomposition

    Lee, D., and Yannakakis, M. Closed partition lattice and machine decomposition. IEEE Transactions on Computers 51 , 2 (2002), 216–228

  30. [30]

    Speed- ing up distributed machine learning using codes

    Lee, K., Lam, M., Pedarsani, R., Papailiopoulos, D., and Ram chandran, K. Speed- ing up distributed machine learning using codes. IEEE Transactions on Information Theory 64, 3 (2018), 1514–1529. 17

  31. [31]

    A., and A vestimehr, A

    Li, S., Maddah-Ali, M. A., and A vestimehr, A. S. A unified coding framework for distributed computing with straggling servers. IEEE Workshop on Network Coding and Ap- plications (Sept. 2016)

  32. [32]

    A., Yu, Q., and A vestimehr, A

    Li, S., Maddah-Ali, M. A., Yu, Q., and A vestimehr, A. S. A fundamental tradeoff between computation and communication in distributed comp uting. IEEE Transactions on Information Theory 64 , 1 (Jan. 2018)

  33. [33]

    A secure sharding protocol for open blockchains

    Luu, L., Narayanan, V., Zheng, C., Ba weja, K., Gilbert, S., a nd Saxena, P. A secure sharding protocol for open blockchains. In Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security (2016), ACM, pp. 17–30

  34. [34]

    J., and Sloane, N

    MacWilliams, F. J., and Sloane, N. J. A. The theory of error-correcting codes . Elsevier, 1977

  35. [35]

    Verifiable random functions

    Micali, S., Rabin, M., and V adhan, S. Verifiable random functions. In 40th Annual Sympo- sium on Foundations of Computer Science (Cat. No. 99CB37039 ) (1999), IEEE, pp. 120–130

  36. [36]

    A simple fault tolerant distributed hash table

    Naor, M., and Wieder, U. A simple fault tolerant distributed hash table. In International Workshop on Peer-to-Peer Systems (2003), Springer, pp. 88–97

  37. [37]

    M., and Liskov, B

    Oki, B. M., and Liskov, B. H. Viewstamped replication: A new primary copy method to support highly-available distributed systems. In Proceedings of the seventh annual ACM Symposium on Principles of distributed computing (1988), ACM, pp. 8–17

  38. [38]

    Pinocchio: Nearly practical verifiable computation

    Parno, B., Howell, J., Gentry, C., and Raykov a, M. Pinocchio: Nearly practical verifiable computation. Communications of the ACM 59 , 2 (2016), 103–112

  39. [39]

    On a decentralized trustless pseudo-random number generat ion algorithm

    Popov, S. On a decentralized trustless pseudo-random number generat ion algorithm. Journal of Mathematical Cryptology 11 , 1 (2017), 37–43

  40. [40]

    V., Shah, N

    Rashmi, K. V., Shah, N. B., and Kumar, P. V. Optimal exact-regenerating codes for distributed storage at the msr and mbr points via a product-m atrix construction. IEEE Transactions on Information Theory 57 , 8 (2011), 5227–5239

  41. [41]

    Introduction to coding theory

    Roth, R. Introduction to coding theory . Cambridge University Press, 2006

  42. [42]

    Sahraei, S., and A vestimehr, A. S. Interpol: Information theoretically verifiable polyno- mial evaluation. arXiv preprint arXiv:1901.03379 (2019)

  43. [43]

    Schneider, F. B. Implementing fault-tolerant services using the state mach ine approach: A tutorial. ACM Computing Surveys (CSUR) 22 , 4 (1990), 299–319

  44. [44]

    Unsolved problems in sharding

    Skidanov, A. Unsolved problems in sharding. https: // medium. com/ nearprotocol/ unsolved-problems-in-blockchain-sharding-2327d6517f43

  45. [45]

    K., Gailly, N., Gasser, L ., Khoffi, I., Fischer, M

    Syta, E., Jov anovic, P., Kogias, E. K., Gailly, N., Gasser, L ., Khoffi, I., Fischer, M. J., and Ford, B. Scalable bias-resistant distributed randomness. In IEEE Symposium on Security and Privacy (SP) (2017), IEEE, pp. 444–460

  46. [46]

    G., and Karampatziakis, N

    Tandon, R., Lei, Q., Dimakis, A. G., and Karampatziakis, N. Gradient coding: A void- ing stragglers in distributed learning. In Proceedings of the 34th International Conference on Machine Learning (Aug. 2017), pp. 3368–3376. 18

  47. [47]

    W ang, Z., and Cadambe, V. R. Multi-version coding—an information-theoretic perspect ive of consistent distributed storage. IEEE Transactions on Information Theory 64 , 6 (2018), 4540–4561

  48. [48]

    Yu, Q., Li, S., Ra viv, N., Kalan, S. M. M., Soltanolkotabi, M. , and A vestimehr, A. S. Lagrange coded computing: Optimal design for resiliency, s ecurity, and privacy. In NIPS Systems for ML Workshop (2018)

  49. [49]

    A., and A vestimehr, A

    Yu, Q., Maddah-Ali, M. A., and A vestimehr, A. S. Polynomial codes: an optimal design for high-dimensional coded matrix multiplication. In NIPS (2017), pp. 4406–4416

  50. [50]

    New publicly verifiable computation for batch matrix multiplication

    Zhang, X., Jiang, T., Li, K.-C., Castiglione, A., and Chen, X . New publicly verifiable computation for batch matrix multiplication. Information Sciences (2017)

  51. [51]

    Efficient secure and verifiable outsourcing of matrix multipl i- cations

    Zhang, Y., and Blanton, M. Efficient secure and verifiable outsourcing of matrix multipl i- cations. In International Conference on Information Security (2014), Springer, pp. 158–178

  52. [52]

    Zou, Y. M. Representing boolean functions using polynomials: more ca n offer less. In International Symposium on Neural Networks (2011), Springer, pp. 290–296. A Field Extension for General Boolean Functions Using the construction of [52, Theorem 2], we can represent a ny arbitrary Boolean function f : {0, 1}n → { 0, 1} whose inputs are n binary variables ...