pith. sign in

arxiv: 2601.23169 · v2 · pith:ASQEG52Nnew · submitted 2026-01-30 · 💻 cs.LG · cs.LO· cs.SC

Names Don't Matter: Symbol-Invariant Transformer for Open-Vocabulary Learning

classification 💻 cs.LG cs.LOcs.SC
keywords interchangeablesymbolsmechanismnovelopen-vocabularystreamstokensacross
0
0 comments X
read the original abstract

Current neural architectures lack a principled way to handle interchangeable tokens, i.e., symbols that are semantically equivalent yet distinguishable, such as bound variables. As a result, models trained on fixed vocabularies often struggle to generalize to unseen symbols, even when the underlying semantics remain unchanged. We propose a novel Transformer-based mechanism that is provably invariant to the renaming of interchangeable tokens. Our approach employs parallel embedding streams to isolate the contribution of each interchangeable token in the input, combined with an aggregated attention mechanism that enables structured information sharing across streams. Experimental results confirm the theoretical guarantees of our method and demonstrate substantial performance gains on open-vocabulary tasks that require generalization to novel symbols. Project page: https://bu-depend-lab.github.io/Symbol-Invariant-Transformer/

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. When Symbol Names Should Not Matter: A Logistic Theory of Fresh-Symbol Classification

    cs.LG 2026-05 unverdicted novelty 7.0

    Regularized kernel logistic classifiers decompose into an ideal template classifier plus a perturbation from token overlaps modeled by a colored collision graph, yielding high-probability margin-transfer guarantees fo...