pith. sign in

Guangxuan Xiao

Identifiers

  • name variant Guangxuan Xiao 0.60 · backfill

Papers (8)

  1. SparDA: Sparse Decoupled Attention for Efficient Long-Context LLM Inference cs.CL · 2026 · author #2
  2. BLASST: Dynamic BLocked Attention Sparsity via Softmax Thresholding cs.CL · 2025 · author #6
  3. Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling cs.CL · 2025 · author #3
  4. StreamingVLM: Real-Time Understanding for Infinite Video Streams cs.CV · 2025 · author #2
  5. DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads cs.CL · 2024 · author #1
  6. Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference cs.CL · 2024 · author #4
  7. Efficient Streaming Language Models with Attention Sinks cs.CL · 2023 · author #1
  8. AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration cs.CL · 2023 · author #7

Mentions

  • 2606.04511 #2 · arxiv_oai · confidence 0.70 Guangxuan Xiao
  • 2410.10819 #1 · arxiv_oai · confidence 0.70 Guangxuan Xiao
  • 2510.09608 #2 · arxiv_oai · confidence 0.70 Guangxuan Xiao

Frequent Coauthors