Jorge P´erez, Pablo Barcel ´o, and Javier Marinkovic

· 2024 · arXiv 2407.17686

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Transformers Learn Latent Mixture Models In-Context via Mirror Descent

cs.LG · 2026-04-12 · unverdicted · novelty 8.0

A three-layer transformer exactly implements one step of mirror descent on latent mixture weights for next-token prediction, yielding a first-order approximation to the Bayes-optimal estimator.

Pre-trained Large Language Models Learn Hidden Markov Models In-context

cs.LG · 2025-06-08 · unverdicted · novelty 7.0

Pre-trained LLMs learn to predict HMM-generated sequences via in-context learning, approaching theoretical optimum on synthetic HMMs and matching expert models on real animal decision data.

The LZ78 Source

cs.IT · 2025-03-13 · unverdicted · novelty 6.0

LZ78 sources are almost stationary ergodic processes satisfying a Shannon-McMillan-Breiman property and local i.i.d. convergence, yet their finite-state compressibility exceeds the entropy rate by a Jensen gap.

citing papers explorer

Showing 3 of 3 citing papers.

Transformers Learn Latent Mixture Models In-Context via Mirror Descent cs.LG · 2026-04-12 · unverdicted · none · ref 1
A three-layer transformer exactly implements one step of mirror descent on latent mixture weights for next-token prediction, yielding a first-order approximation to the Bayes-optimal estimator.
Pre-trained Large Language Models Learn Hidden Markov Models In-context cs.LG · 2025-06-08 · unverdicted · none · ref 42
Pre-trained LLMs learn to predict HMM-generated sequences via in-context learning, approaching theoretical optimum on synthetic HMMs and matching expert models on real animal decision data.
The LZ78 Source cs.IT · 2025-03-13 · unverdicted · none · ref 17
LZ78 sources are almost stationary ergodic processes satisfying a Shannon-McMillan-Breiman property and local i.i.d. convergence, yet their finite-state compressibility exceeds the entropy rate by a Jensen gap.

Jorge P´erez, Pablo Barcel ´o, and Javier Marinkovic

fields

years

verdicts

representative citing papers

citing papers explorer