Textcot: Zoom-in for enhanced multimodal text-rich image understanding.ACM Transactions on Multimedia Computing, Communications and Applications, 22(4):1–19, 2026
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it