pith. sign in

An image is worth 1/2 tokens after layer 2: Plug-and-play inference acceleration for large vision-language models

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

citation-role summary

background 3

citation-polarity summary

roles

background 3

polarities

background 2 unclear 1

representative citing papers

UIPress: Bringing Optical Token Compression to UI-to-Code Generation

cs.CL · 2026-04-10 · unverdicted · novelty 7.0

UIPress is the first encoder-side learned optical compression method for UI-to-Code that compresses visual tokens to 256, outperforming the uncompressed baseline by 7.5% CLIP score and the best inference-time baseline by 4.6% while delivering 9.1x TTFT speedup.

citing papers explorer

Showing 7 of 7 citing papers.