Guangxuan Xiao
Identifiers
- name variant Guangxuan Xiao 0.60 · backfill
Papers (8)
- SparDA: Sparse Decoupled Attention for Efficient Long-Context LLM Inference cs.CL · 2026 · author #2
- BLASST: Dynamic BLocked Attention Sparsity via Softmax Thresholding cs.CL · 2025 · author #6
- Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling cs.CL · 2025 · author #3
- StreamingVLM: Real-Time Understanding for Infinite Video Streams cs.CV · 2025 · author #2
- DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads cs.CL · 2024 · author #1
- Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference cs.CL · 2024 · author #4
- Efficient Streaming Language Models with Attention Sinks cs.CL · 2023 · author #1
- AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration cs.CL · 2023 · author #7
Mentions
- 2606.04511 #2 · arxiv_oai · confidence 0.70 Guangxuan Xiao
- 2410.10819 #1 · arxiv_oai · confidence 0.70 Guangxuan Xiao
- 2510.09608 #2 · arxiv_oai · confidence 0.70 Guangxuan Xiao
Frequent Coauthors
- Song Han 8 shared papers
- Jiaming Tang 3 shared papers
- Haotian Tang 2 shared papers
- Junxian Guo 2 shared papers
- Shang Yang 2 shared papers
- Asit Mishra 1 shared papers
- Baris Kasikci 1 shared papers
- Beidi Chen 1 shared papers
- Bo Li 1 shared papers
- Cameron Shinn 1 shared papers
- Carlo del Mundo 1 shared papers
- Chuang Gan 1 shared papers
- Dominic Brown 1 shared papers
- George Klimiashvili 1 shared papers
- Huizi Mao 1 shared papers
- Jack Cook 1 shared papers
- Jiayi Yuan 1 shared papers
- Ji Lin 1 shared papers
- Jingwei Zuo 1 shared papers
- Jingze Cui 1 shared papers