Thaiocrbench: A task-diverse benchmark for vision-language understanding in thai

Nonesung, S · arXiv 2511.04479

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Self-Prompting Diffusion Transformer for Open-Vocabulary Scene Text Editing via In-Context Learning

cs.CV · 2026-05-15 · unverdicted · novelty 6.0

A self-prompting MM-DiT model performs open-vocabulary scene text editing by extracting style and glyph information from the original image without extra encoders.

citing papers explorer

Showing 1 of 1 citing paper.

Self-Prompting Diffusion Transformer for Open-Vocabulary Scene Text Editing via In-Context Learning cs.CV · 2026-05-15 · unverdicted · none · ref 6
A self-prompting MM-DiT model performs open-vocabulary scene text editing by extracting style and glyph information from the original image without extra encoders.

Thaiocrbench: A task-diverse benchmark for vision-language understanding in thai

fields

years

verdicts

representative citing papers

citing papers explorer