pith. sign in

archive

Every paper Pith has read. Search by title, abstract, or pith.

89 papers in cs.OS · page 2

  1. cs.DC 2026-04-08 reviewed
    CPU-free LLM serving cuts P99 latency up to 8x

    Blink: CPU-Free LLM Inference by Delegating the Serving Stack to GPU and SmartNIC

    Mohammad Siavashi +4

  2. cs.DC 2026-04-08 reviewed
    Client scheduler hits 100% LLM deadlines at 4.2 requests per second

    Scheduling the Unschedulable: Taming Black-Box LLM Inference at Scale

    Renzhong Yuan +5

  3. cs.DC 2026-04-08 reviewed
    Nexus cuts serverless CPU use 44% by offloading I/O from VMs

    Nexus: Transparent I/O Offloading for High-Density Serverless Computing

    JooYoung Park +6

  4. quant-ph 2026-04-07 reviewed
    Scheduler cuts quantum queue times 30-75% at high load

    Qurator: Scheduling Hybrid Quantum-Classical Workflows Across Heterogeneous Cloud Providers

    Sinan Pehlivanoglu +3

  5. cs.CL 2026-04-06 reviewed
    Single GPU trains 120B-parameter models at full precision

    MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

    Zhengqing Yuan +3

  6. cs.OS 2026-04-02 reviewed
    Migratable actors on CXL SSDs dodge thermal cliffs

    WIO: Upload-Enabled Computational Storage on CXL SSDs

    Yiwei Yang +6

  7. cs.OS 2026-03-26 reviewed
    Scheduler pointer faults crash FreeRTOS far more often than TCB changes

    Experimental Analysis of FreeRTOS Dependability through Targeted Fault Injection Campaigns

    Luca Mannella +2

  8. cs.DC 2026-03-16 reviewed
    CoGPU shares GPUs spatially with zero token drift

    Performance Isolation and Semantic Determinism in Efficient GPU Spatial Sharing

    Zhenyuan Yang +3

  9. cs.NI 2026-03-14 reviewed
    CATS transport cuts first paint time by 78% in worst-case web load

    A Case for CATS: A Conductor-driven Asymmetric Transport Scheme for Semantic Prioritization

    Syed Muhammad Aqdas Rizvi

  10. cs.DC 2026-03-12 reviewed
    NCCLbpf adds verified eBPF policies to NCCL plugins with 130 ns overhead

    NCCLbpf: Verified, Composable Policy Execution for GPU Collective Communication

    Yusheng Zheng

  11. cs.CR 2026-03-10 reviewed
    Flexible mode switching speeds secure mobile LLM inference 10x

    FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation

    Yinpeng Wu +5

  12. cs.OS 2026-03-08 reviewed
    LLM agents run as native POSIX processes

    Quine: Realizing LLM Agents as Native POSIX Processes

    Hao Ke

  13. cs.DC 2026-03-04 reviewed
    Unified objects automate IoT edge-cloud apps with 9 nines availability

    EdgeWeaver: Accelerating IoT Application Development Across Edge-Cloud Continuum

    Pawissanutt Lertpongrujikorn +3

  14. cs.CR 2026-02-26 reviewed
    TEE architecture secures continuous attestation against platform control

    A TEE-Based Architecture for Confidential and Dependable Process Attestation in Authorship Verification

    David Condrey

  15. cs.LG 2026-02-20 reviewed
    Slack-tokenized Transformer meets more real-time deadlines

    TempoNet: Slack-Quantized Transformer-Guided Reinforcement Scheduler for Adaptive Deadline-Centric Real-Time Dispatchs

    Rong Fu +9

  16. cs.OS 2026-02-17 reviewed
    Graph engine keeps semantic state stable at microsecond speeds

    The Compute ICE-AGE: Invariant Compute Envelope under Addressable Graph Evolution

    R. Jay Martin II

  17. cs.OS 2026-02-12 reviewed
    Local generators keep update cost constant as system grows

    Bounded Local Generator Classes for Deterministic State Evolution

    R. Jay Martin II

  18. cs.DC 2026-02-11 reviewed
    The paper describes an integrated methodology combining hardware modeling

    Interferences within a certifiable design methodology for high-performance multi-core platforms

    Mohamed Amine Khelassi (LECA) +11

  19. cs.OS 2026-02-09 reviewed
    Equilibria enforces CXL fairness and raises performance 52 percent

    Equilibria: Fair Multi-Tenant CXL Memory Tiering At Scale

    Kaiyang Zhao +9

  20. cs.DC 2026-02-09 reviewed
    Original papers outperform tutorials for system design mastery

    The Computer System Trail

    Sushant Kumar Gupta

  21. cs.OS 2026-02-04 reviewed
    Host RAM enables single-GPU training of 120B LLMs

    Horizon-LM: A RAM-Centric Architecture for LLM Training

    Zhengqing Yuan +2

  22. cs.DC 2026-01-15 reviewed
    Beta metric delivers 96.5% optimal edge AI performance

    Mitigating GIL Bottlenecks in Edge AI Systems

    Mridankan Mandal +1

  23. cs.OS 2025-12-20 reviewed
    LLM agents finish over 80% of Rust system proofs

    VeruSAGE: A Study of Agent-Based Verification for Rust Systems

    Chenyuan Yang +4

  24. cs.DC 2025-12-17 reviewed
    Data movement bottlenecks sit outside the network core

    Reexamining Paradigms of End-to-End Data Movement

    Chin Fang +3

  25. cs.CR 2025-12-01 reviewed
    CAEC lets secure VMs share memory without encryption

    CAEC: Confidential, Attestable, and Efficient Inter-CVM Communication with Arm CCA

    Sina Abdollahi +4

  26. cs.OS 2025-11-04 reviewed
    KV cache TTL cuts multi-turn agent job times by over 8x

    Continuum: Efficient and Robust Multi-Turn LLM Agent Scheduling with KV Cache Time-to-Live

    Hanchen Li +9

  27. cs.CR 2025-10-31 reviewed
    Sockeye formalizes hardware manuals into provable security models

    Sockeye: a language for analyzing hardware documentation

    Ben Fiedler +2

  28. cs.OS 2025-09-25 reviewed
    NetCAS boosts remote storage speed 174% via dynamic I/O splits

    NetCAS: Dynamic Cache and Backend Device Management in Networked Environments

    Joon Yong Hwang +2

  29. cs.CR 2025-07-16 reviewed
    Tyche turns isolation into a composable cloud primitive

    Tyche: Composable Isolation as a Foundation to Manage Trust in the Cloud

    Adrien Ghosn +5

  30. cs.AI 2025-06-19 reviewed
    Best agents need 2.7-4.3x more steps than humans

    OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents

    Reyna Abhyankar +2

  31. cs.OS 2025-03-05 reviewed
    90% of Linux radiation failures route through one eMMC path

    Where Linux Breaks Under Radiation: A Cross-Architecture Kernel-Level Characterization of Proton-Induced Failures in COTS SoCs

    Saad Memon +7

  32. cs.CR 2025-01-08 reviewed
    Type-1 hypervisor matches Docker speed with stronger isolation

    Goldilocks Isolation: High Performance VMs with Edera

    Marina Moore +1

  33. cs.CR 2024-11-15 reviewed
    Review yields security framework for software-defined vehicles

    Contextualizing Security and Privacy of Software-Defined Vehicles: A Literature Review and Industry Perspectives

    Marco De Vincenzi +8

  34. cs.OS 2024-03-31 reviewed
    FPGA scheduler lifts fairness 24-98% by adding time and energy rules

    THEMIS: Time, Heterogeneity, and Energy Minded Scheduling for Fair Multi-Tenant Use in FPGAs

    Emre Karabulut +3

  35. cs.NI 2023-09-25 reviewed
    CPU-time budgets isolate tail latency in shared datapaths

    Tail Contagion: Sub-microsecond Time Protection in Shared Software Network Datapaths

    Matheus Stolet +3

  36. cs.OS 2019-07-27 reviewed
    New file system design reduces SSD write amplification without GC

    SSDFS: Towards LFS Flash-Friendly File System without GC operation

    Viacheslav Dubeyko

  37. cs.PL 2019-07-11 reviewed
    Smoosh semantics matches POSIX standard more closely than seven shells

    Executable formal semantics for the POSIX shell

    Michael Greenberg +1

  38. cs.OS 2019-07-07 reviewed
    DiOS guarantees identical traces for repeated POSIX program runs

    Reproducible Execution of POSIX Programs with DiOS

    Petr Ro\v{c}kai +4

  39. cs.OS 2019-06-29 reviewed
    Hardware scheduler delivers 12x speedup on accelerator systems

    HTS: A Hardware Task Scheduler for Heterogeneous Systems

    Kartik Hegde +2

  40. cs.DS 2019-06-26 reviewed
    Lawn timer handles any time range at constant speed

    Lawn: an Unbound Low Latency Timer Data Structure for Large Scale, High Throughput Systems

    Adam Lev-Libfeld

  41. cs.DC 2019-06-24 reviewed
    DMX keeps critical container performance stable as density rises

    Container Density Improvements with Dynamic Memory Extension using NAND Flash

    Jan S. Rellermeyer +3

  42. cs.OS 2019-06-24 reviewed
    TrustZone world switches carry measurable time and energy costs

    On The Performance of ARM TrustZone

    Julien Amacher +1