A Mechanistic Study of Tabular Foundation Models

Anderson Schneider; James T. Wilson; Marin Bilo\v{s}; Yuriy Nevmyvaka

arxiv: 2605.21288 · v1 · pith:UQN2SR56new · submitted 2026-05-20 · 💻 cs.LG

A Mechanistic Study of Tabular Foundation Models

Marin Bilo\v{s} , James T. Wilson , Anderson Schneider , Yuriy Nevmyvaka This is my paper

Pith reviewed 2026-05-21 06:10 UTC · model grok-4.3

classification 💻 cs.LG

keywords tabular foundation modelsmechanistic interpretabilityin-context learningsimilarity-based readoutscausal interventionspermutation invariancetabular classificationrobustness

0 comments

The pith

Tabular foundation models use distinct similarity-based readouts for in-context predictions that causal interventions can isolate.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Tabular foundation models with different architectures reach similar accuracy on classification and regression tasks, but this leaves open whether they follow the same internal steps. The work probes the models directly to identify how they compute predictions from context examples. It finds that the families employ qualitatively different similarity-based readouts, ranging from attention-weighted votes over context labels to class-conditional mean calculations. These differences are verified by causal interventions that alter behavior in the predicted directions. The analysis also locates the source of each model's permutation invariances in specific positional parameters and shows that targeted perturbations reproduce the expected failure patterns for each readout.

Core claim

The model families realize qualitatively distinct similarity-based readouts: from an attention-weighted vote over context labels to a class-conditional mean readout, each confirmed by causal intervention. Representation collapse is not a practical concern for them. Each model's permutation invariances trace to specific positional parameters whose removal preserves accuracy and makes approximate invariance exact. Perturbations engineered against each readout reproduce predicted failure modes; hub and rank attacks isolate them from refit baselines.

What carries the argument

Causal interventions that target and confirm distinct similarity-based readout mechanisms, such as attention-weighted voting versus class-conditional mean computation.

If this is right

Hub and rank attacks will isolate each model's readout from refit baselines by reproducing its specific failure modes.
Removing the identified positional parameters will turn approximate permutation invariance into exact invariance while keeping accuracy intact.
Representation collapse will not impair practical performance across the model families.
Each architecture will exhibit characteristic breakdowns under perturbations designed for its readout.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Model choice for a given tabular task could be guided by matching the readout mechanism to the structure of the data rather than accuracy numbers alone.
Hybrid architectures that combine elements of multiple readouts might improve robustness to the attacks that currently target one mechanism at a time.
The same intervention approach could be applied to test whether the identified readouts remain stable after fine-tuning on new tabular domains.
These distinctions suggest that scaling laws for tabular models may need to account for readout type in addition to parameter count.

Load-bearing premise

The causal interventions isolate the readout mechanism without altering other computation paths or representations that could produce the same behavioral change.

What would settle it

Perform the causal intervention meant to disrupt an attention-weighted vote and observe whether predictions shift exactly as expected for that mechanism but not for models using a class-conditional mean readout.

Figures

Figures reproduced from arXiv: 2605.21288 by Anderson Schneider, James T. Wilson, Marin Bilo\v{s}, Yuriy Nevmyvaka.

**Figure 1.** Figure 1: Subset of results. Illustration of readout mechanisms: (a) attention-weighted vote for TabPFNv2 and Mitra, both on layer 9; (b) TabPFNv2 L9 puts most of its attention mass on same-class context rows; (c) nearest class prototype for TabICLv2, at L11. Causal evidence: (d) each backbone’s native readout stays within a few pp of end-to-end accuracy, while cross-backbone readouts collapse; (e) five mechanism-gr… view at source ↗

**Figure 2.** Figure 2: Per-layer profiles on the 49-dataset classification suite. Thin lines: seed-averaged perdataset curves; thick black lines: means. Left: TabPFNv2 linear-probe accuracy converges late with a sharp jump at L8 → L9. Middle: Pearson correlation between an attention-weighted vote read off layer L and TabPFNv2’s predicted probabilities; the vote rule (§3.1) becomes faithful only at L9. Right: TabICLv2 cosine-kNN… view at source ↗

**Figure 4.** Figure 4: Hand-crafted models that isolate the three symmetry-breaking devices. M0– M3 are single attention layers over m=3 binary features (M0: shared-cell mean-pool; M1: column identifier; M2: per-column attention mask; M3: pair tokens). Transductive accuracy on Task A (y=x1), B (y=x1 ⊕ x2), C (y=maj(x)). M0 is capped at the 0.75 multiset bound; M1, M2, M3 reach 1.0 on A; only M3 (pair tokens) breaks the bound … view at source ↗

**Figure 5.** Figure 5: Per-dataset accuracy spread under random column permutations on the 49-dataset bench [PITH_FULL_IMAGE:figures/full_fig_p040_5.png] view at source ↗

read the original abstract

Tabular foundation models with different architectures converge in accuracy across a range of classification and regression tasks. This raises questions a leaderboard cannot answer: (i) whether the models execute the same in-context algorithm, (ii) where row, column, and class-permutation invariances originate, and (iii) how robust they are under perturbations engineered against the inferred mechanism. We characterize all three. The model families realize qualitatively distinct similarity-based readouts: from an attention-weighted vote over context labels to a class-conditional mean readout, each confirmed by causal intervention. We find that the representation collapse highlighted in prior work is not a practical concern for them. Each model's permutation invariances trace to specific positional parameters whose removal preserves accuracy and makes approximate invariance exact. Perturbations engineered against each readout reproduce predicted failure modes; hub and rank attacks isolate them from refit baselines. Together these results give a mechanistic account of contemporary tabular foundation models and identify which inductive biases govern both their accuracy and characteristic failures.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Tabular foundation models use distinct internal readouts like attention-weighted voting versus class-conditional means, backed by interventions, though the isolation of those interventions could be tighter.

read the letter

The main takeaway is that these models reach similar accuracies but run different internal algorithms. Some families appear to compute predictions via an attention-weighted vote over context labels, while others use a class-conditional mean readout. The authors support the distinction with causal interventions and then connect each model's permutation invariances to specific positional parameters whose removal keeps accuracy intact and turns approximate invariance into exact invariance. They also show that targeted perturbations reproduce the failure modes expected from each mechanism and separate the models from refit baselines. Representation collapse from earlier papers does not seem to matter much in practice here.

Referee Report

1 major / 2 minor

Summary. The manuscript claims that tabular foundation models with different architectures achieve similar accuracies but implement qualitatively distinct similarity-based readouts for in-context classification and regression: an attention-weighted vote over context labels versus a class-conditional mean readout. These mechanisms are confirmed via causal interventions. The work further shows that permutation invariances (row, column, class) originate from specific positional parameters whose removal preserves accuracy and renders approximate invariance exact. Targeted perturbations against each readout reproduce predicted failure modes and isolate the models from refit baselines. Representation collapse is argued not to be a practical concern.

Significance. If the causal interventions cleanly isolate the hypothesized readout mechanisms without confounding other pathways, this provides a substantive mechanistic account of tabular foundation models that goes beyond accuracy leaderboards. The explicit tracing of invariances to removable positional parameters and the reproduction of failure modes via engineered attacks would constitute a clear contribution to understanding inductive biases in this domain. The absence of circular derivations or invented entities in the analysis is a positive feature.

major comments (1)

[Section 4] Section 4 and associated intervention figures: the central claim that interventions (ablating attention heads, masking context labels) isolate distinct readout mechanisms is load-bearing, yet the manuscript does not report controls that hold positional encodings, column embeddings, and other shared representation layers fixed while only modifying the hypothesized similarity computation path. Without such isolation, behavioral changes could arise from unintended alterations to permutation-related parameters that the paper itself later links to invariance.

minor comments (2)

The methods section lacks sufficient detail on dataset characteristics, number of independent runs, statistical reporting (e.g., error bars, significance tests), and exact intervention hyperparameters; these omissions hinder reproducibility and assessment of intervention specificity.
[Section 4] Figures showing intervention results should include quantitative measures of effect size and variability across seeds or datasets to support the qualitative distinction between model families.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive and detailed feedback. We address the major comment on the isolation of readout mechanisms below.

read point-by-point responses

Referee: [Section 4] Section 4 and associated intervention figures: the central claim that interventions (ablating attention heads, masking context labels) isolate distinct readout mechanisms is load-bearing, yet the manuscript does not report controls that hold positional encodings, column embeddings, and other shared representation layers fixed while only modifying the hypothesized similarity computation path. Without such isolation, behavioral changes could arise from unintended alterations to permutation-related parameters that the paper itself later links to invariance.

Authors: We agree that explicit controls isolating the similarity computation path would strengthen the causal claims. Our ablations target attention heads that implement the weighted-vote readout and mask labels that feed the class-conditional mean; these operations do not directly edit the positional parameters later shown to control invariance. Nevertheless, to rule out indirect effects, the revised manuscript will add control experiments that freeze positional encodings, column embeddings, and other shared layers while repeating the head ablations and label masking. These controls will demonstrate that the observed behavioral shifts remain attributable to the distinct readout mechanisms. We view this addition as a straightforward strengthening of the existing analysis. revision: yes

Circularity Check

0 steps flagged

No significant circularity; empirical interventions form independent evidence

full rationale

The paper advances its central claims through direct causal interventions, ablation studies, and perturbation experiments that test hypothesized readout mechanisms and permutation invariances. No equations or derivations are presented that reduce a 'prediction' to a fitted parameter by construction, nor does any load-bearing premise rest on a self-citation chain whose validity is internal to the present work. The analysis of distinct similarity-based readouts, representation collapse, and engineered failure modes is grounded in observable behavioral changes under controlled modifications rather than self-referential definitions or ansatzes imported from prior author work. This structure keeps the derivation chain self-contained and falsifiable against external model behaviors.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The work relies on standard assumptions from mechanistic interpretability that interventions can isolate functional components.

axioms (1)

domain assumption Causal interventions on model activations can isolate the prediction readout mechanism without side effects on other computations
Invoked when confirming distinct readouts via intervention

pith-pipeline@v0.9.0 · 5705 in / 1149 out tokens · 40990 ms · 2026-05-21T06:10:45.435074+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

20 extracted references · 20 canonical work pages

[1]

arXiv , year =

Hu, Jingnan and Ghelichi, Sina , title =. arXiv , year =

work page
[2]

European Signal Processing Conference (EUSIPCO) , year =

Roy, Olivier and Vetterli, Martin , title =. European Signal Processing Conference (EUSIPCO) , year =

work page
[3]

bioRxiv , year =

Gao, Peiran and Trautmann, Eric and Yu, Byron and Santhanam, Gopal and Ryu, Stephen and Shenoy, Krishna and Ganguli, Surya , title =. bioRxiv , year =

work page
[4]

Accurate predictions on small data with a tabular foundation model , journal =

Hollmann, Noah and M. Accurate predictions on small data with a tabular foundation model , journal =. 2025 , publisher =

work page 2025
[5]

Proceedings of the 17th ACM International Conference on Web Search and Data Mining , pages=

Table meets llm: Can large language models understand structured table data? a benchmark and empirical study , author=. Proceedings of the 17th ACM International Conference on Web Search and Data Mining , pages=

work page
[6]

arXiv , year=

TabPFN Through The Looking Glass: An interpretability study of TabPFN and its internal representations , author=. arXiv , year=

work page
[7]

Advances in Neural Information Processing Systems , year=

When do neural nets outperform boosted trees on tabular data? , author=. Advances in Neural Information Processing Systems , year=

work page
[8]

International Conference on Machine Learning , year =

Qu, Jingang and Holzm. International Conference on Machine Learning , year =

work page
[9]

International Conference on Machine Learning , year=

Transformers learn in-context by gradient descent , author=. International Conference on Machine Learning , year=

work page
[10]

What learning algorithm is in-context learning?

Aky. What learning algorithm is in-context learning?. International Conference on Learning Representations , year =

work page
[11]

Advances in Neural Information Processing Systems , year =

Garg, Shivam and Tsipras, Dimitris and Liang, Percy S and Valiant, Gregory , title =. Advances in Neural Information Processing Systems , year =

work page
[12]

Advances in Neural Information Processing Systems , year =

Tabdpt: Scaling tabular foundation models on real data , author=. Advances in Neural Information Processing Systems , year =

work page
[13]

arXiv , year=

ScoringBench: A Benchmark for Evaluating Tabular Foundation Models with Proper Scoring Rules , author=. arXiv , year=

work page
[14]

Transformers Can Do

M. Transformers Can Do. International Conference on Learning Representations , year =

work page
[15]

Scientific Reports , volume=

Estimating the intrinsic dimension of datasets by a minimal neighborhood information , author=. Scientific Reports , volume=. 2017 , publisher=

work page 2017
[16]

ICML Workshop on Theoretical Foundations and Applications of Deep Generative Models , year =

Garnelo, Marta and Schwarz, Jonathan and Rosenbaum, Dan and Viola, Fabio and Rezende, Danilo J and Eslami, SM and Teh, Yee Whye , title =. ICML Workshop on Theoretical Foundations and Applications of Deep Generative Models , year =

work page
[17]

Advances in Neural Information Processing Systems , year =

Zaheer, Manzil and Kottur, Satwik and Ravanbakhsh, Siamak and Poczos, Barnabas and Salakhutdinov, Russ R and Smola, Alexander J , title =. Advances in Neural Information Processing Systems , year =

work page
[18]

Proceedings of the 2024 conference on empirical methods in natural language processing , year=

A survey on in-context learning , author=. Proceedings of the 2024 conference on empirical methods in natural language processing , year=

work page 2024
[19]

and Yin, Junming and Erickson, Nick and Ansari, Abdul Fatir and Han, Boran and Zhang, Shuai and Akoglu, Leman and Faloutsos, Christos and Mahoney, Michael W

Zhang, Xiyuan and Maddix, Danielle C. and Yin, Junming and Erickson, Nick and Ansari, Abdul Fatir and Han, Boran and Zhang, Shuai and Akoglu, Leman and Faloutsos, Christos and Mahoney, Michael W. and Hu, Cuixiong and Rangwala, Huzefa and Karypis, George and Wang, Bernie , title =. Advances in Neural Information Processing Systems , year =

work page
[20]

and Kaiser,

Vaswani, Ashish and Shazeer, Noam and Parmar, Niki and Uszkoreit, Jakob and Jones, Llion and Gomez, Aidan N. and Kaiser,. Attention is All You Need , booktitle =

work page

[1] [1]

arXiv , year =

Hu, Jingnan and Ghelichi, Sina , title =. arXiv , year =

work page

[2] [2]

European Signal Processing Conference (EUSIPCO) , year =

Roy, Olivier and Vetterli, Martin , title =. European Signal Processing Conference (EUSIPCO) , year =

work page

[3] [3]

bioRxiv , year =

Gao, Peiran and Trautmann, Eric and Yu, Byron and Santhanam, Gopal and Ryu, Stephen and Shenoy, Krishna and Ganguli, Surya , title =. bioRxiv , year =

work page

[4] [4]

Accurate predictions on small data with a tabular foundation model , journal =

Hollmann, Noah and M. Accurate predictions on small data with a tabular foundation model , journal =. 2025 , publisher =

work page 2025

[5] [5]

Proceedings of the 17th ACM International Conference on Web Search and Data Mining , pages=

Table meets llm: Can large language models understand structured table data? a benchmark and empirical study , author=. Proceedings of the 17th ACM International Conference on Web Search and Data Mining , pages=

work page

[6] [6]

arXiv , year=

TabPFN Through The Looking Glass: An interpretability study of TabPFN and its internal representations , author=. arXiv , year=

work page

[7] [7]

Advances in Neural Information Processing Systems , year=

When do neural nets outperform boosted trees on tabular data? , author=. Advances in Neural Information Processing Systems , year=

work page

[8] [8]

International Conference on Machine Learning , year =

Qu, Jingang and Holzm. International Conference on Machine Learning , year =

work page

[9] [9]

International Conference on Machine Learning , year=

Transformers learn in-context by gradient descent , author=. International Conference on Machine Learning , year=

work page

[10] [10]

What learning algorithm is in-context learning?

Aky. What learning algorithm is in-context learning?. International Conference on Learning Representations , year =

work page

[11] [11]

Advances in Neural Information Processing Systems , year =

Garg, Shivam and Tsipras, Dimitris and Liang, Percy S and Valiant, Gregory , title =. Advances in Neural Information Processing Systems , year =

work page

[12] [12]

Advances in Neural Information Processing Systems , year =

Tabdpt: Scaling tabular foundation models on real data , author=. Advances in Neural Information Processing Systems , year =

work page

[13] [13]

arXiv , year=

ScoringBench: A Benchmark for Evaluating Tabular Foundation Models with Proper Scoring Rules , author=. arXiv , year=

work page

[14] [14]

Transformers Can Do

M. Transformers Can Do. International Conference on Learning Representations , year =

work page

[15] [15]

Scientific Reports , volume=

Estimating the intrinsic dimension of datasets by a minimal neighborhood information , author=. Scientific Reports , volume=. 2017 , publisher=

work page 2017

[16] [16]

ICML Workshop on Theoretical Foundations and Applications of Deep Generative Models , year =

Garnelo, Marta and Schwarz, Jonathan and Rosenbaum, Dan and Viola, Fabio and Rezende, Danilo J and Eslami, SM and Teh, Yee Whye , title =. ICML Workshop on Theoretical Foundations and Applications of Deep Generative Models , year =

work page

[17] [17]

Advances in Neural Information Processing Systems , year =

Zaheer, Manzil and Kottur, Satwik and Ravanbakhsh, Siamak and Poczos, Barnabas and Salakhutdinov, Russ R and Smola, Alexander J , title =. Advances in Neural Information Processing Systems , year =

work page

[18] [18]

Proceedings of the 2024 conference on empirical methods in natural language processing , year=

A survey on in-context learning , author=. Proceedings of the 2024 conference on empirical methods in natural language processing , year=

work page 2024

[19] [19]

and Yin, Junming and Erickson, Nick and Ansari, Abdul Fatir and Han, Boran and Zhang, Shuai and Akoglu, Leman and Faloutsos, Christos and Mahoney, Michael W

Zhang, Xiyuan and Maddix, Danielle C. and Yin, Junming and Erickson, Nick and Ansari, Abdul Fatir and Han, Boran and Zhang, Shuai and Akoglu, Leman and Faloutsos, Christos and Mahoney, Michael W. and Hu, Cuixiong and Rangwala, Huzefa and Karypis, George and Wang, Bernie , title =. Advances in Neural Information Processing Systems , year =

work page

[20] [20]

and Kaiser,

Vaswani, Ashish and Shazeer, Noam and Parmar, Niki and Uszkoreit, Jakob and Jones, Llion and Gomez, Aidan N. and Kaiser,. Attention is All You Need , booktitle =

work page