PixelRAG shows that operating RAG entirely over web screenshots outperforms text-based retrieval on NQ, SimpleQA, MMSearch, LiveVQA, and MoNaCo, with up to 18.1% accuracy gains and 3x token savings via image compression.
Leann: A low-storage vector index.arXiv preprint arXiv:2506.08276, 2025
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
TileFuse introduces fused kernels and data layouts for W4A16/W8A16 on AMD XDNA2 NPUs, reporting up to 2.0x lower LLM prefilling latency and 64.6% lower energy versus baselines.
citing papers explorer
-
PIXELRAG: Web Screenshots Beat Text for Retrieval-Augmented Generation
PixelRAG shows that operating RAG entirely over web screenshots outperforms text-based retrieval on NQ, SimpleQA, MMSearch, LiveVQA, and MoNaCo, with up to 18.1% accuracy gains and 3x token savings via image compression.
-
TileFuse: A Fused Mixed-Precision Kernel Library for Efficient Quantized LLM Inference on AMD NPUs
TileFuse introduces fused kernels and data layouts for W4A16/W8A16 on AMD XDNA2 NPUs, reporting up to 2.0x lower LLM prefilling latency and 64.6% lower energy versus baselines.