Explicit Fuzzy Logic in the Feed-Forward Layer: Self-Forgetting Quantifiers Discover Legible Grammatical-Licensing Detectors

Mark Oskin

arxiv: 2606.31845 · v1 · pith:JRMD2534new · submitted 2026-06-30 · 💻 cs.CL · cs.LG

Explicit Fuzzy Logic in the Feed-Forward Layer: Self-Forgetting Quantifiers Discover Legible Grammatical-Licensing Detectors

Mark Oskin This is my paper

Pith reviewed 2026-07-01 05:41 UTC · model grok-4.3

classification 💻 cs.CL cs.LG

keywords feed-forward layerfuzzy logicgrammatical licensingtransformer interpretabilitysequence quantifiersnegationlanguage modeling

0 comments

The pith

Explicit fuzzy logic and self-forgetting quantifiers in the feed-forward layer turn units into legible grammatical licensing detectors.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper replaces the standard transformer feed-forward sublayer with a negation-capable version built from fuzzy intersection and set-difference operations on [0,1] memberships. It adds a small set of soft existential and proportion quantifiers, each carrying its own learned forgetting rate started from a sticky initialization. At 125M scale this keeps perplexity identical to the GELU baseline while fixing an early grammatical deficit in licensing and quantifiers. The resulting units fire on licensors such as negatives or comparatives and carry the memory forward to predict the licensed continuation, all without any post-hoc dictionary learning.

Core claim

A parameter-neutral NC-FFN using intersection A*B and bounded negation A*(1-B), augmented by soft sequence quantifiers with per-unit forgetting rates, recovers the licensing deficit at epoch one, lets logical structure migrate into depth, and produces units that read directly as grammatical licensing detectors while matching baseline language-model quality.

What carries the argument

The NC-FFN (negation-capable feed-forward network) of explicit fuzzy set operations combined with soft existential and soft proportion quantifiers each equipped with a learned per-unit forgetting rate from sticky initialization.

If this is right

Grammatical structure migrates from layer zero into deeper semantic layers.
Units become readable as licensing detectors for comparatives, passives, and negative-polarity items without auxiliary analysis.
The wider performance gap at epoch two is halved while LAMBADA scores modestly improve.
A fully Boolean version of the same layer diverges, showing that the fuzzy partition is required for stable training.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same quantifier-plus-forgetting construction could be tested on other long-range syntactic dependencies such as agreement or binding.
Building explicit logical forms into the architecture may reduce reliance on post-training interpretability methods across other sequence tasks.
The observed median forgetting half-life of roughly 1.5 tokens suggests a natural mechanism for controlling memory span in deeper layers.

Load-bearing premise

Two-operand logic stays localized at layer zero and erodes during training, and adding sequence quantifiers with forgetting rates will keep the logic intact, move it deeper, and preserve overall training dynamics.

What would settle it

If the added quantifiers fail to produce units that selectively activate on licensors and improve prediction of the corresponding licensed words, or if the model diverges when the quantifiers are removed, the central claim does not hold.

Figures

Figures reproduced from arXiv: 2606.31845 by Mark Oskin.

**Figure 2.** Figure 2: The logical content is task-shaped. Left: on a product-rewarding task (parity), the fraction of units doing two-operand logic snaps from ≈ 0 to ≈ 0.48 at the grok step (grok-aligned mean over 15 runs, band = s.d.). Right: under language-model training, the layer-0 two-operand signal erodes monotonically across two epochs. Same Boolean block, same probe, opposite trajectories. output costs 148× the loss of … view at source ↗

**Figure 3.** Figure 3: A·B intersection magnitude per layer across epoch-1 training (+decay+gate). Unlike NC-FFN’s L0 erosion, the intersection signal migrates out of layer 0 into the deep layers rather than collapsing—genuine two-operand logic is held and pushed into depth [PITH_FULL_IMAGE:figures/full_fig_p018_3.png] view at source ↗

**Figure 4.** Figure 4: Learned existential (∃) token half-life per layer (median ∼1.5 tokens). From a sticky γ0 ≈ 0.99 initialization, every unit learned a short, local memory; zero units remain near-permanent [PITH_FULL_IMAGE:figures/full_fig_p019_4.png] view at source ↗

**Figure 5.** Figure 5: Four grammatical licensing detectors (layer 9), each a predictive operator. Membership fires crisply on the grammatical licensor (left spike), the ∼1.5-token existential decay carries the signal forward over the licensing window, and the unit writes the licensed function word it predicts. None fires on the function word it names. token’s representation, and intersection and set-difference are propositional… view at source ↗

read the original abstract

A transformer's feed-forward (FFN) sublayer materializes the distinctions attention gathers, yet gives no account of what it computes. In a parameter-neutral replacement, each hidden unit is an explicit fuzzy set operation on sigmoid-bounded [0,1] memberships: intersection A*B and set-difference A*(1-B), the latter a bounded positive negation ("A but not B") that gated/bilinear units lack -- a negation-capable FFN (NC-FFN). On N-bit parity they are the most parameter-efficient reasoning basis at shallow depth; at scale (125M, OpenWebText) NC-FFN ties the GELU baseline's perplexity, every unit carrying explicit logical form. Two limits share one cause: two-operand logic localizes to layer 0 and erodes under training, and the one robust grammatical deficit concentrates in licensing and quantifiers, beyond within-token operators. We resolve both with a small block of sequence quantifiers: a soft existential and a soft proportion, each with a per-unit learned forgetting rate from a sticky init. This recovers the deficit at epoch one (halving the wider epoch-two gap), modestly leads on LAMBADA, and makes the FFN legible: the structure now holds and migrates into depth; the decay un-learns its stickiness (median half-life ~1.5 tokens; zero latch units); and at the semantic layers the units read, without dictionary learning, as grammatical licensing detectors: each fires on a licensor (a comparative, a passive participle, a negative-polarity item) and carries its memory forward to predict the licensed word (than, by, nor). This legibility is localized and free only up to a partition (a fully Boolean FFN diverges in training), but the result is a parameter-neutral, language-model-quality transformer with a readable, interpretable-by-construction grammatical mechanism -- an account not just of what a feed-forward layer represents but how it licenses.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper builds an explicit fuzzy-logic FFN with negation and adds per-unit forgetting quantifiers so that grammatical licensing detectors emerge without post-hoc analysis, while matching baseline perplexity.

read the letter

The core move is replacing the standard FFN with a negation-capable version that uses bounded fuzzy intersection and set-difference on [0,1] values, then layering in soft existential and proportion quantifiers that each carry a learned forgetting rate. This combination is meant to stop two-operand logic from collapsing to layer 0, let structure move deeper, and produce units that fire on licensors and carry the signal forward to the licensed token.

The approach is new in tying the fuzzy construction directly to sequence-level quantifiers with sticky initialization and decay. On the parity task it is parameter-efficient at shallow depth, and at 125M scale it reportedly ties GELU perplexity while making the FFN units readable as licensing detectors. The early recovery of the grammatical deficit and the modest LAMBADA lead are the concrete payoffs claimed.

The main limitation visible from the abstract is that every performance and legibility claim rests on results that are not shown here. Without the equations, training curves, ablation tables, or exact definition of the quantifier block, it is impossible to tell whether the forgetting rates add effective degrees of freedom or whether the reported parity with GELU survives stricter controls. The note that a fully Boolean version diverges also suggests the fuzzy relaxation is load-bearing, which needs explicit measurement.

The work is aimed at researchers who want interpretability built into the layer rather than recovered afterward. It is worth sending to peer review because the mechanism is distinct from existing MLP or attention variants and the grammatical-licensing observation is specific enough to be checked. A referee can verify whether the legibility holds across seeds and whether the quantifiers preserve training dynamics at larger scales.

Referee Report

2 major / 1 minor

Summary. The paper proposes replacing the standard FFN with a negation-capable version (NC-FFN) that implements explicit fuzzy-set operations (intersection A*B and bounded set-difference A*(1-B)) on sigmoid-bounded memberships, yielding per-unit logical forms. At 125M scale on OpenWebText this matches GELU perplexity; on N-bit parity it is parameter-efficient at shallow depth. The authors identify that two-operand logic localizes to layer 0 and erodes, while a grammatical deficit appears in licensing/quantifiers. They add a small block of soft sequence quantifiers (existential and proportion) each equipped with a per-unit learned forgetting rate initialized stickily; this recovers the deficit at epoch one (halving the epoch-two gap), modestly improves LAMBADA, allows logical structure to migrate into depth, and produces units that, without dictionary learning, read as grammatical licensing detectors (firing on licensors and carrying memory to predict licensed tokens). The final model remains parameter-neutral and training-stable, though a fully Boolean FFN diverges.

Significance. If the experimental claims hold, the work supplies a parameter-neutral, interpretable-by-construction account of how an FFN can implement grammatical licensing via explicit fuzzy logic and sequence quantifiers whose forgetting dynamics are learned. The migration of structure into depth and the emergence of legible detectors without post-hoc analysis would be a concrete advance for mechanistic interpretability of feed-forward sublayers while preserving language-model quality.

major comments (2)

[Abstract] Abstract and § on sequence quantifiers: the central claim that the added quantifiers recover the grammatical deficit at epoch one while tying perplexity rests on experimental outcomes, yet the provided text supplies no equations defining the soft existential/proportion operators, no training curves, no ablation tables, and no statistical details on the halving of the gap or the LAMBADA lead; without these the result cannot be verified or reproduced.
[Abstract] Abstract: the assertion that NC-FFN plus quantifiers is 'parameter-neutral' requires explicit accounting of the extra parameters introduced by the per-unit forgetting rates and the quantifier block; if these are non-negligible the neutrality claim is load-bearing for the scaling argument.

minor comments (1)

[Abstract] The abstract refers to 'median half-life ~1.5 tokens' and 'zero latch units' without defining how half-life is computed from the learned forgetting rates.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for highlighting issues of verifiability and parameter accounting. Both points are addressable by expanding the manuscript with the requested details; we outline the planned revisions below.

read point-by-point responses

Referee: [Abstract] Abstract and § on sequence quantifiers: the central claim that the added quantifiers recover the grammatical deficit at epoch one while tying perplexity rests on experimental outcomes, yet the provided text supplies no equations defining the soft existential/proportion operators, no training curves, no ablation tables, and no statistical details on the halving of the gap or the LAMBADA lead; without these the result cannot be verified or reproduced.

Authors: We agree that the abstract and the section on sequence quantifiers omit the defining equations for the soft existential and proportion operators as well as supporting experimental evidence. The current text therefore does not allow direct verification or reproduction of the reported recovery of the grammatical deficit, the halving of the epoch-two gap, or the LAMBADA improvement. In the revised version we will insert the explicit equations for both quantifiers, include training curves and ablation tables that isolate their contribution, and report statistical details (means, standard deviations, and any significance tests) for the performance deltas. These additions will appear in the main text or as a dedicated supplementary section. revision: yes
Referee: [Abstract] Abstract: the assertion that NC-FFN plus quantifiers is 'parameter-neutral' requires explicit accounting of the extra parameters introduced by the per-unit forgetting rates and the quantifier block; if these are non-negligible the neutrality claim is load-bearing for the scaling argument.

Authors: The manuscript asserts parameter neutrality, yet does not supply a line-item count of the additional parameters arising from the per-unit forgetting rates and the quantifier block. We accept that an explicit accounting is required to support the claim at scale. The revision will include a table or paragraph that enumerates the exact parameter overhead of the forgetting rates (one scalar per unit) and the quantifier block, demonstrating that the net increase remains negligible relative to the 125 M baseline and does not alter the scaling argument. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper introduces an architectural change (NC-FFN using explicit intersection and bounded set-difference on [0,1] memberships, plus soft sequence quantifiers with per-unit learned forgetting rates) and reports empirical outcomes: parity efficiency, tying GELU perplexity at 125M scale, recovery of grammatical licensing at epoch one, and emergence of legible detectors. These are training results on external benchmarks (OpenWebText, LAMBADA) rather than quantities defined by the model's own fitted parameters or equations. No self-definitional loop, fitted-input-as-prediction, or load-bearing self-citation chain appears in the provided text; the central claims remain falsifiable against held-out data and baselines.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides no explicit free parameters, axioms, or invented entities; the fuzzy operations and quantifiers are introduced as architectural choices whose justification rests on the reported experiments.

pith-pipeline@v0.9.1-grok · 5902 in / 1155 out tokens · 26556 ms · 2026-07-01T05:41:40.661162+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

31 extracted references · 3 canonical work pages

[1]

Artificial Intelligence , volume=

Tensor Product Variable Binding and the Representation of Symbolic Structures in Connectionist Systems , author=. Artificial Intelligence , volume=
[2]

Tensor Logic: The Language of

Domingos, Pedro , journal=. Tensor Logic: The Language of
[3]

International Conference on Machine Learning (ICML) , year=

Language Modeling with Gated Convolutional Networks , author=. International Conference on Machine Learning (ICML) , year=
[4]

Shazeer, Noam , journal=
[5]

and Dooms, Thomas and Rigg, Alice and Oramas, Jose and Sharkey, Lee , journal=

Pearce, Michael T. and Dooms, Thomas and Rigg, Alice and Oramas, Jose and Sharkey, Lee , journal=. Bilinear
[6]

and Dooms, Thomas and Rigg, Alice , journal=

Pearce, Michael T. and Dooms, Thomas and Rigg, Alice , journal=. Weight-based Decomposition: A Case for Bilinear
[7]

Advances in Neural Information Processing Systems (NeurIPS) , year=

Attention Is All You Need , author=. Advances in Neural Information Processing Systems (NeurIPS) , year=
[8]

Advances in Neural Information Processing Systems (NeurIPS) , year=

Neural Arithmetic Logic Units , author=. Advances in Neural Information Processing Systems (NeurIPS) , year=
[9]

Artificial Intelligence , volume=

Analyzing Differentiable Fuzzy Logic Operators , author=. Artificial Intelligence , volume=
[10]

Artificial Intelligence , volume=

Logic Tensor Networks , author=. Artificial Intelligence , volume=
[11]

arXiv preprint arXiv:2006.13155 , year=

Logical Neural Networks , author=. arXiv preprint arXiv:2006.13155 , year=

work page arXiv 2006
[12]

Advances in Neural Information Processing Systems (NeurIPS) , year=

Deep Differentiable Logic Gate Networks , author=. Advances in Neural Information Processing Systems (NeurIPS) , year=
[13]

Advances in Neural Information Processing Systems (NeurIPS) , year=

Convolutional Differentiable Logic Gate Networks , author=. Advances in Neural Information Processing Systems (NeurIPS) , year=
[14]

Advances in Neural Information Processing Systems (NeurIPS) , year=

Beta Embeddings for Multi-Hop Logical Reasoning in Knowledge Graphs , author=. Advances in Neural Information Processing Systems (NeurIPS) , year=
[15]

International Conference on Learning Representations (ICLR) , year=

Query2box: Reasoning over Knowledge Graphs in Vector Space using Box Embeddings , author=. International Conference on Learning Representations (ICLR) , year=
[16]

Zhang, Zhanqiu and Wang, Jie and Chen, Jiajun and Ji, Shuiwang and Wu, Feng , booktitle=
[17]

AAAI Conference on Artificial Intelligence , year=

Fuzzy Logic Based Logical Query Answering on Knowledge Graphs , author=. AAAI Conference on Artificial Intelligence , year=
[18]

Transformer Circuits Thread , year=

Softmax Linear Units , author=. Transformer Circuits Thread , year=
[19]

arXiv preprint arXiv:2310.17230 , year=

Codebook Features: Sparse and Discrete Interpretability for Neural Networks , author=. arXiv preprint arXiv:2310.17230 , year=

work page arXiv
[20]

Transformer Circuits Thread , year=

Toy Models of Superposition , author=. Transformer Circuits Thread , year=
[21]

Transformer Circuits Thread , year=

Towards Monosemanticity: Decomposing Language Models with Dictionary Learning , author=. Transformer Circuits Thread , year=
[22]

International Conference on Learning Representations (ICLR) , year=

Sparse Autoencoders Find Highly Interpretable Features in Language Models , author=. International Conference on Learning Representations (ICLR) , year=
[23]

Empirical Methods in Natural Language Processing (EMNLP) , year=

Transformer Feed-Forward Layers Are Key-Value Memories , author=. Empirical Methods in Natural Language Processing (EMNLP) , year=
[24]

arXiv preprint arXiv:2509.13357 , year=

Semantic Fusion with Fuzzy-Membership Features for Controllable Language Modelling , author=. arXiv preprint arXiv:2509.13357 , year=

work page arXiv
[25]

, journal=

Warstadt, Alex and Parrish, Alicia and Liu, Haokun and Mohananey, Anhad and Peng, Wei and Wang, Sheng-Fu and Bowman, Samuel R. , journal=
[26]

Paperno, Denis and Kruszewski, Germ. The. Association for Computational Linguistics (ACL) , year=
[27]

Zellers, Rowan and Holtzman, Ari and Bisk, Yonatan and Farhadi, Ali and Choi, Yejin , booktitle=
[28]

Sakaguchi, Keisuke and Le Bras, Ronan and Bhagavatula, Chandra and Choi, Yejin , journal=
[29]

Think You Have Solved Question Answering? Try

Clark, Peter and Cowhey, Isaac and Etzioni, Oren and Khot, Tushar and Sabharwal, Ashish and Schoenick, Carissa and Tafjord, Oyvind , journal=. Think You Have Solved Question Answering? Try
[30]

A Framework for Few-Shot Language Model Evaluation , author=
[31]

and Schmidhuber, J

Gers, Felix A. and Schmidhuber, J. Learning to Forget: Continual Prediction with. Neural Computation , volume=

[1] [1]

Artificial Intelligence , volume=

Tensor Product Variable Binding and the Representation of Symbolic Structures in Connectionist Systems , author=. Artificial Intelligence , volume=

[2] [2]

Tensor Logic: The Language of

Domingos, Pedro , journal=. Tensor Logic: The Language of

[3] [3]

International Conference on Machine Learning (ICML) , year=

Language Modeling with Gated Convolutional Networks , author=. International Conference on Machine Learning (ICML) , year=

[4] [4]

Shazeer, Noam , journal=

[5] [5]

and Dooms, Thomas and Rigg, Alice and Oramas, Jose and Sharkey, Lee , journal=

Pearce, Michael T. and Dooms, Thomas and Rigg, Alice and Oramas, Jose and Sharkey, Lee , journal=. Bilinear

[6] [6]

and Dooms, Thomas and Rigg, Alice , journal=

Pearce, Michael T. and Dooms, Thomas and Rigg, Alice , journal=. Weight-based Decomposition: A Case for Bilinear

[7] [7]

Advances in Neural Information Processing Systems (NeurIPS) , year=

Attention Is All You Need , author=. Advances in Neural Information Processing Systems (NeurIPS) , year=

[8] [8]

Advances in Neural Information Processing Systems (NeurIPS) , year=

Neural Arithmetic Logic Units , author=. Advances in Neural Information Processing Systems (NeurIPS) , year=

[9] [9]

Artificial Intelligence , volume=

Analyzing Differentiable Fuzzy Logic Operators , author=. Artificial Intelligence , volume=

[10] [10]

Artificial Intelligence , volume=

Logic Tensor Networks , author=. Artificial Intelligence , volume=

[11] [11]

arXiv preprint arXiv:2006.13155 , year=

Logical Neural Networks , author=. arXiv preprint arXiv:2006.13155 , year=

work page arXiv 2006

[12] [12]

Advances in Neural Information Processing Systems (NeurIPS) , year=

Deep Differentiable Logic Gate Networks , author=. Advances in Neural Information Processing Systems (NeurIPS) , year=

[13] [13]

Advances in Neural Information Processing Systems (NeurIPS) , year=

Convolutional Differentiable Logic Gate Networks , author=. Advances in Neural Information Processing Systems (NeurIPS) , year=

[14] [14]

Advances in Neural Information Processing Systems (NeurIPS) , year=

Beta Embeddings for Multi-Hop Logical Reasoning in Knowledge Graphs , author=. Advances in Neural Information Processing Systems (NeurIPS) , year=

[15] [15]

International Conference on Learning Representations (ICLR) , year=

Query2box: Reasoning over Knowledge Graphs in Vector Space using Box Embeddings , author=. International Conference on Learning Representations (ICLR) , year=

[16] [16]

Zhang, Zhanqiu and Wang, Jie and Chen, Jiajun and Ji, Shuiwang and Wu, Feng , booktitle=

[17] [17]

AAAI Conference on Artificial Intelligence , year=

Fuzzy Logic Based Logical Query Answering on Knowledge Graphs , author=. AAAI Conference on Artificial Intelligence , year=

[18] [18]

Transformer Circuits Thread , year=

Softmax Linear Units , author=. Transformer Circuits Thread , year=

[19] [19]

arXiv preprint arXiv:2310.17230 , year=

Codebook Features: Sparse and Discrete Interpretability for Neural Networks , author=. arXiv preprint arXiv:2310.17230 , year=

work page arXiv

[20] [20]

Transformer Circuits Thread , year=

Toy Models of Superposition , author=. Transformer Circuits Thread , year=

[21] [21]

Transformer Circuits Thread , year=

Towards Monosemanticity: Decomposing Language Models with Dictionary Learning , author=. Transformer Circuits Thread , year=

[22] [22]

International Conference on Learning Representations (ICLR) , year=

Sparse Autoencoders Find Highly Interpretable Features in Language Models , author=. International Conference on Learning Representations (ICLR) , year=

[23] [23]

Empirical Methods in Natural Language Processing (EMNLP) , year=

Transformer Feed-Forward Layers Are Key-Value Memories , author=. Empirical Methods in Natural Language Processing (EMNLP) , year=

[24] [24]

arXiv preprint arXiv:2509.13357 , year=

Semantic Fusion with Fuzzy-Membership Features for Controllable Language Modelling , author=. arXiv preprint arXiv:2509.13357 , year=

work page arXiv

[25] [25]

, journal=

Warstadt, Alex and Parrish, Alicia and Liu, Haokun and Mohananey, Anhad and Peng, Wei and Wang, Sheng-Fu and Bowman, Samuel R. , journal=

[26] [26]

Paperno, Denis and Kruszewski, Germ. The. Association for Computational Linguistics (ACL) , year=

[27] [27]

Zellers, Rowan and Holtzman, Ari and Bisk, Yonatan and Farhadi, Ali and Choi, Yejin , booktitle=

[28] [28]

Sakaguchi, Keisuke and Le Bras, Ronan and Bhagavatula, Chandra and Choi, Yejin , journal=

[29] [29]

Think You Have Solved Question Answering? Try

Clark, Peter and Cowhey, Isaac and Etzioni, Oren and Khot, Tushar and Sabharwal, Ashish and Schoenick, Carissa and Tafjord, Oyvind , journal=. Think You Have Solved Question Answering? Try

[30] [30]

A Framework for Few-Shot Language Model Evaluation , author=

[31] [31]

and Schmidhuber, J

Gers, Felix A. and Schmidhuber, J. Learning to Forget: Continual Prediction with. Neural Computation , volume=