pith. sign in

An image is worth 1/2 tokens after layer 2: Plug-and-play inference acceleration for large vision-language models

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

citation-role summary

background 3

citation-polarity summary

roles

background 3

polarities

background 2 unclear 1

clear filters

representative citing papers

UIPress: Bringing Optical Token Compression to UI-to-Code Generation

cs.CL · 2026-04-10 · unverdicted · novelty 7.0

UIPress is the first encoder-side learned optical compression method for UI-to-Code that compresses visual tokens to 256, outperforming the uncompressed baseline by 7.5% CLIP score and the best inference-time baseline by 4.6% while delivering 9.1x TTFT speedup.

citing papers explorer

Showing 1 of 1 citing paper after filters.