Byt5: Towards a token-free future with pre-trained byte-to-byte models

· 2022

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

The Query Channel: Information-Theoretic Limits of Masking-Based Explanations

cs.AI · 2026-04-17 · unverdicted · novelty 8.0

Masking-based explanations are governed by the information capacity of the query channel, with reliable recovery achievable below capacity via sparse maximum-likelihood decoding but impossible above it.

LLM-Viterbi: Semantic-Aware Decoding for Convolutional Codes

cs.IT · 2026-04-21 · unverdicted · novelty 7.0

An LLM-enhanced Viterbi decoder achieves roughly 1.5 dB extra coding gain in block error rate and over 50% better semantic similarity than conventional Viterbi for constraint-length-3 convolutional codes on AWGN channels.

MambaNetBurst: Direct Byte-level Network Traffic Classification without Tokenization or Pretraining

cs.CR · 2026-05-11 · unverdicted · novelty 6.0

A compact Mamba-2 model performs end-to-end byte-level network traffic classification without tokenization or pre-training and remains competitive with substantially larger pre-trained systems.

citing papers explorer

Showing 3 of 3 citing papers.

The Query Channel: Information-Theoretic Limits of Masking-Based Explanations cs.AI · 2026-04-17 · unverdicted · none · ref 31
Masking-based explanations are governed by the information capacity of the query channel, with reliable recovery achievable below capacity via sparse maximum-likelihood decoding but impossible above it.
LLM-Viterbi: Semantic-Aware Decoding for Convolutional Codes cs.IT · 2026-04-21 · unverdicted · none · ref 11
An LLM-enhanced Viterbi decoder achieves roughly 1.5 dB extra coding gain in block error rate and over 50% better semantic similarity than conventional Viterbi for constraint-length-3 convolutional codes on AWGN channels.
MambaNetBurst: Direct Byte-level Network Traffic Classification without Tokenization or Pretraining cs.CR · 2026-05-11 · unverdicted · none · ref 14
A compact Mamba-2 model performs end-to-end byte-level network traffic classification without tokenization or pre-training and remains competitive with substantially larger pre-trained systems.

Byt5: Towards a token-free future with pre-trained byte-to-byte models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer