Glyph-byt5: A customized text encoder for accurate visual text rendering

Zeyu Liu, Weicong Liang, Zhanhao Liang, Chong Luo, Ji Li, Gao Huang, Yuhui Yuan · 2024 · arXiv 2403.09622

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

StyleTextGen: Style-Conditioned Multilingual Scene Text Generation

cs.CV · 2026-05-14 · unverdicted · novelty 6.0

StyleTextGen proposes a dual-branch style encoder, text style consistency loss, and mask-guided inference to achieve superior style consistency and cross-lingual performance in multilingual scene text generation on a new bilingual benchmark.

HunyuanVideo 1.5 Technical Report

cs.CV · 2025-11-24 · unverdicted · novelty 6.0

HunyuanVideo 1.5 delivers state-of-the-art open-source text-to-video and image-to-video generation with an 8.3B parameter DiT model featuring SSTA attention, glyph-aware encoding, and progressive training.

citing papers explorer

Showing 2 of 2 citing papers.

StyleTextGen: Style-Conditioned Multilingual Scene Text Generation cs.CV · 2026-05-14 · unverdicted · none · ref 26
StyleTextGen proposes a dual-branch style encoder, text style consistency loss, and mask-guided inference to achieve superior style consistency and cross-lingual performance in multilingual scene text generation on a new bilingual benchmark.
HunyuanVideo 1.5 Technical Report cs.CV · 2025-11-24 · unverdicted · none · ref 15
HunyuanVideo 1.5 delivers state-of-the-art open-source text-to-video and image-to-video generation with an 8.3B parameter DiT model featuring SSTA attention, glyph-aware encoding, and progressive training.

Glyph-byt5: A customized text encoder for accurate visual text rendering

fields

years

verdicts

representative citing papers

citing papers explorer