pith. sign in

Ion Stoica

Identifiers

  • name variant Ion Stoica 0.60 · backfill

Papers (71)

  1. Arena-T2I Hard: Benchmarking and Improving Faithfulness with Dependency-Aware Checklist cs.AI · 2026 · author #8
  2. DualEval: Joint Model-Item Calibration for Unified LLM Evaluation cs.LG · 2026 · author #9
  3. HERALD: High-Throughput Block Diffusion LLM Serving via CPU-GPU Cooperative KV Cache Retrieval cs.LG · 2026 · author #5
  4. Playful Agentic Robot Learning cs.RO · 2026 · author #16
  5. BenchEvolver: Frontier Task Synthesis via Solution-Centric Evolution cs.SE · 2026 · author #12
  6. Idleness is Relative: Exploiting Tool-Call Idle Windows for Offloading in Agentic Systems with MORI cs.OS · 2026 · author #8
  7. Inference Time Context Sparsity: Illusion or Opportunity? cs.AI · 2026 · author #6
  8. The Time is Here for Just-in-Time Systems: Challenges and Opportunities cs.DB · 2026 · author #11
  9. Inductive Deductive Synthesis: Enabling AI to Generate Formally Verified Systems cs.AI · 2026 · author #12
  10. optimize_anything: A Universal API for Optimizing any Text Parameter cs.CL · 2026 · author #10
  11. AstraFlow: Dataflow-Oriented Reinforcement Learning for Agentic LLMs cs.LG · 2026 · author #8
  12. Uncovering Intra-expert Activation Sparsity for Efficient Mixture-of-Expert Model Execution cs.LG · 2026 · author #4
  13. Unleashing Scalable Context Parallelism for Foundation Models Pre-Training via FCP cs.DC · 2026 · author #9
  14. ClawEnvKit: Automatic Environment Generation for Claw-Like Agents cs.AI · 2026 · author #3
  15. UCCL-Zip: Lossless Compression Supercharged GPU Communication cs.DC · 2026 · author #9
  16. Foundry: Template-Based CUDA Graph Context Materialization for Fast LLM Serving Cold Start cs.DC · 2026 · author #5
  17. AI-Driven Research for Databases cs.DB · 2026 · author #8
  18. Combee: Scaling Prompt Learning for Self-Improving Language Model Agents cs.AI · 2026 · author #13
  19. The Price Reversal Phenomenon: When Cheaper Reasoning Models Cost More cs.CL · 2026 · author #4
  20. M$^2$RNN: Non-Linear RNNs with Matrix-Valued States for Scalable Language Modeling cs.LG · 2026 · author #3
  21. Flash-KMeans: Fast and Memory-Efficient Exact K-Means cs.DC · 2026 · author #13
  22. Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization cs.LG · 2026 · author #14
  23. Qrita: High-performance Top-k and Top-p using Pivot-based Truncation and Selection cs.AI · 2026 · author #4
  24. Measuring Agents in Production cs.CY · 2025 · author #23
  25. Continuum: Efficient and Robust Multi-Turn LLM Agent Scheduling with KV Cache Time-to-Live cs.OS · 2025 · author #10
  26. RLBoost: Harvesting Preemptible Resources for Cost-Efficient Reinforcement Learning on LLMs cs.DC · 2025 · author #8
  27. vAttention: Verified Sparse Attention cs.LG · 2025 · author #8
  28. GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning cs.CL · 2025 · author #14
  29. Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation cs.CV · 2025 · author #13
  30. Why Do Multi-Agent LLM Systems Fail? cs.AI · 2025 · author #13
  31. BlendServe: Optimizing Offline Inference for Auto-regressive Large Models with Resource-aware Batching cs.LG · 2024 · author #8
  32. JudgeBench: A Benchmark for Evaluating LLM-based Judges cs.AI · 2024 · author #8
  33. RouteLLM: Learning to Route LLMs with Preference Data cs.LG · 2024 · author #8
  34. From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline cs.LG · 2024 · author #8
  35. LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code cs.SE · 2024 · author #10
  36. Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference cs.AI · 2024 · author #11
  37. SGLang: Efficient Execution of Structured Language Model Programs cs.AI · 2023 · author #9
  38. MemGPT: Towards LLMs as Operating Systems cs.AI · 2023 · author #6
  39. Efficient Memory Management for Large Language Model Serving with PagedAttention cs.LG · 2023 · author #9
  40. Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena cs.CL · 2023 · author #13
  41. Population Based Augmentation: Efficient Learning of Augmentation Policy Schedules cs.CV · 2019 · author #3
  42. Harmonia: Near-Linear Scalability for Replicated Storage with In-Network Conflict Detection cs.DC · 2019 · author #6
  43. Neural Packet Classification cs.NI · 2019 · author #4
  44. Cloud Programming Simplified: A Berkeley View on Serverless Computing cs.OS · 2019 · author #13
  45. The OoO VLIW JIT Compiler for GPU Inference cs.DC · 2019 · author #6
  46. DistCache: Provable Load Balancing for Large-Scale Storage Systems with Distributed Caching cs.DC · 2019 · author #8
  47. AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning cs.PL · 2019 · author #5
  48. Dynamic Space-Time Scheduling for GPU Inference cs.DC · 2018 · author #8
  49. numpywren: serverless linear algebra cs.DC · 2018 · author #6
  50. Learning to Optimize Join Queries With Deep Reinforcement Learning cs.DB · 2018 · author #5
  51. Tune: A Research Platform for Distributed Model Selection and Training cs.LG · 2018 · author #6
  52. Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning cs.LG · 2018 · author #3
  53. NetChain: Scale-Free Sub-RTT Coordination (Extended Version) cs.DC · 2018 · author #8
  54. RLlib: Abstractions for Distributed Reinforcement Learning cs.AI · 2017 · author #9
  55. Ray: A Distributed Framework for Emerging AI Applications cs.DC · 2017 · author #11
  56. A Berkeley View of Systems Challenges for AI cs.AI · 2017 · author #1
  57. DDCO: Discovery of Deep Continuous Options for Robot Learning from Demonstrations cs.RO · 2017 · author #3
  58. Multi-Level Discovery of Deep Options cs.LG · 2017 · author #3
  59. Real-Time Machine Learning: The Missing Pieces cs.DC · 2017 · author #10
  60. Occupy the Cloud: Distributed Computing for the 99% cs.DC · 2017 · author #4
  61. Clipper: A Low-Latency Online Prediction Serving System cs.DC · 2016 · author #6
  62. Fast and Accurate Performance Analysis of LTE Radio Access Networks cs.DC · 2016 · author #2
  63. SparkNet: Training Deep Networks in Spark stat.ML · 2015 · author #3
  64. Asynchronous Complex Analytics in a Distributed Dataflow Architecture cs.DB · 2015 · author #7
  65. GraphX: Unifying Data-Parallel and Graph-Parallel Analytics cs.DB · 2014 · author #6
  66. Coordination Avoidance in Database Systems (Extended Version) cs.DB · 2014 · author #6
  67. Highly Available Transactions: Virtues and Limitations (Extended Version) cs.DB · 2013 · author #6
  68. Shark: SQL and Rich Analytics at Scale cs.DB · 2012 · author #6
  69. Probabilistically Bounded Staleness for Practical Partial Quorums cs.DB · 2012 · author #5
  70. BlinkDB: Queries with Bounded Errors and Bounded Response Times on Very Large Data cs.DB · 2012 · author #5
  71. Faster and More Accurate Sequence Alignment with SNAP cs.DS · 2011 · author #7

Mentions

  • 2606.31711 #8 · arxiv_oai · confidence 0.70 Ion Stoica
  • 2606.26429 #9 · arxiv_oai · confidence 0.70 Ion Stoica
  • 2606.21633 #5 · arxiv_oai · confidence 0.70 Ion Stoica
  • 2606.19419 #16 · arxiv_oai · confidence 0.70 Ion Stoica
  • 2604.18543 #3 · arxiv_oai · confidence 0.70 Ion Stoica
  • 2411.16102 #8 · arxiv_oai · confidence 0.70 Ion Stoica
  • 2512.04123 #23 · arxiv_oai · confidence 0.70 Ion Stoica
  • 1511.06051 #3 · backfill · confidence 0.70 Ion Stoica
  • 1510.07092 #7 · backfill · confidence 0.70 Ion Stoica
  • 2606.01286 #12 · arxiv_oai · confidence 0.70 Ion Stoica
  • 2606.00866 #8 · arxiv_oai · confidence 0.70 Ion Stoica
  • 1402.2394 #6 · backfill · confidence 0.70 Ion Stoica
  • 1402.2237 #6 · backfill · confidence 0.70 Ion Stoica
  • 2603.23971 #4 · arxiv_oai · confidence 0.70 Ion Stoica
  • 2602.01518 #4 · arxiv_oai · confidence 0.70 Ion Stoica
  • 2511.02230 #10 · arxiv_oai · confidence 0.70 Ion Stoica
  • 2510.05688 #8 · arxiv_oai · confidence 0.70 Ion Stoica
  • 2605.24168 #6 · arxiv_oai · confidence 0.70 Ion Stoica
  • 2605.24096 #11 · arxiv_oai · confidence 0.70 Ion Stoica
  • 1302.0309 #6 · backfill · confidence 0.70 Ion Stoica
  • 1211.6176 #6 · backfill · confidence 0.70 Ion Stoica
  • 2605.23109 #12 · arxiv_oai · confidence 0.70 Ion Stoica
  • 1204.6082 #5 · backfill · confidence 0.70 Ion Stoica
  • 1203.5485 #5 · backfill · confidence 0.70 Ion Stoica
  • 1111.5572 #7 · backfill · confidence 0.70 Ion Stoica
  • 2605.19633 #10 · arxiv_oai · confidence 0.70 Ion Stoica
  • 2605.15565 #8 · arxiv_oai · confidence 0.70 Ion Stoica
  • 2410.12784 #8 · arxiv_oai · confidence 0.70 Ion Stoica
  • 2406.11939 #8 · arxiv_oai · confidence 0.70 Ion Stoica

Frequent Coauthors