Ho Fai Leung
Identifiers
No identifiers captured yet.
Papers (2)
- FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels cs.LG · 2026 · author #5
- RateQuant: Optimal Mixed-Precision KV Cache Quantization via Rate-Distortion Theory cs.LG · 2026 · author #5
Mentions
No mention provenance yet.
Frequent Coauthors
- Fei Zuo 2 shared papers
- Xiaoyan Xi 2 shared papers
- Feiyu Wang 1 shared papers
- Hao Cong 1 shared papers
- Quanyi Zeng 1 shared papers
- Zikang Zhou 1 shared papers