archive

Every paper Pith has read. Search by title, abstract, or pith.

89 papers in cs.OS · page 2

cs.DC 2026-04-08 reviewed

CPU-free LLM serving cuts P99 latency up to 8x
Blink: CPU-Free LLM Inference by Delegating the Serving Stack to GPU and SmartNIC

Mohammad Siavashi +4
cs.DC 2026-04-08 reviewed

Client scheduler hits 100% LLM deadlines at 4.2 requests per second
Scheduling the Unschedulable: Taming Black-Box LLM Inference at Scale

Renzhong Yuan +5
cs.DC 2026-04-08 reviewed

Nexus cuts serverless CPU use 44% by offloading I/O from VMs
Nexus: Transparent I/O Offloading for High-Density Serverless Computing

JooYoung Park +6
quant-ph 2026-04-07 reviewed

Scheduler cuts quantum queue times 30-75% at high load
Qurator: Scheduling Hybrid Quantum-Classical Workflows Across Heterogeneous Cloud Providers

Sinan Pehlivanoglu +3
cs.CL 2026-04-06 reviewed

Single GPU trains 120B-parameter models at full precision
MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Zhengqing Yuan +3
cs.OS 2026-04-02 reviewed

Migratable actors on CXL SSDs dodge thermal cliffs
WIO: Upload-Enabled Computational Storage on CXL SSDs

Yiwei Yang +6
cs.OS 2026-03-26 reviewed

Scheduler pointer faults crash FreeRTOS far more often than TCB changes
Experimental Analysis of FreeRTOS Dependability through Targeted Fault Injection Campaigns

Luca Mannella +2
cs.DC 2026-03-16 reviewed

CoGPU shares GPUs spatially with zero token drift
Performance Isolation and Semantic Determinism in Efficient GPU Spatial Sharing

Zhenyuan Yang +3
cs.NI 2026-03-14 reviewed

CATS transport cuts first paint time by 78% in worst-case web load
A Case for CATS: A Conductor-driven Asymmetric Transport Scheme for Semantic Prioritization

Syed Muhammad Aqdas Rizvi
cs.DC 2026-03-12 reviewed

NCCLbpf adds verified eBPF policies to NCCL plugins with 130 ns overhead
NCCLbpf: Verified, Composable Policy Execution for GPU Collective Communication

Yusheng Zheng
cs.CR 2026-03-10 reviewed

Flexible mode switching speeds secure mobile LLM inference 10x
FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation

Yinpeng Wu +5
cs.OS 2026-03-08 reviewed

LLM agents run as native POSIX processes
Quine: Realizing LLM Agents as Native POSIX Processes

Hao Ke
cs.DC 2026-03-04 reviewed

Unified objects automate IoT edge-cloud apps with 9 nines availability
EdgeWeaver: Accelerating IoT Application Development Across Edge-Cloud Continuum

Pawissanutt Lertpongrujikorn +3
cs.CR 2026-02-26 reviewed

TEE architecture secures continuous attestation against platform control
A TEE-Based Architecture for Confidential and Dependable Process Attestation in Authorship Verification

David Condrey
cs.LG 2026-02-20 reviewed

Slack-tokenized Transformer meets more real-time deadlines
TempoNet: Slack-Quantized Transformer-Guided Reinforcement Scheduler for Adaptive Deadline-Centric Real-Time Dispatchs

Rong Fu +9
cs.OS 2026-02-17 reviewed

Graph engine keeps semantic state stable at microsecond speeds
The Compute ICE-AGE: Invariant Compute Envelope under Addressable Graph Evolution

R. Jay Martin II
cs.OS 2026-02-12 reviewed

Local generators keep update cost constant as system grows
Bounded Local Generator Classes for Deterministic State Evolution

R. Jay Martin II
cs.DC 2026-02-11 reviewed

The paper describes an integrated methodology combining hardware modeling
Interferences within a certifiable design methodology for high-performance multi-core platforms

Mohamed Amine Khelassi (LECA) +11
cs.OS 2026-02-09 reviewed

Equilibria enforces CXL fairness and raises performance 52 percent
Equilibria: Fair Multi-Tenant CXL Memory Tiering At Scale

Kaiyang Zhao +9
cs.DC 2026-02-09 reviewed

Original papers outperform tutorials for system design mastery
The Computer System Trail

Sushant Kumar Gupta
cs.OS 2026-02-04 reviewed

Host RAM enables single-GPU training of 120B LLMs
Horizon-LM: A RAM-Centric Architecture for LLM Training

Zhengqing Yuan +2
cs.DC 2026-01-15 reviewed

Beta metric delivers 96.5% optimal edge AI performance
Mitigating GIL Bottlenecks in Edge AI Systems

Mridankan Mandal +1
cs.OS 2025-12-20 reviewed

LLM agents finish over 80% of Rust system proofs
VeruSAGE: A Study of Agent-Based Verification for Rust Systems

Chenyuan Yang +4
cs.DC 2025-12-17 reviewed

Data movement bottlenecks sit outside the network core
Reexamining Paradigms of End-to-End Data Movement

Chin Fang +3
cs.CR 2025-12-01 reviewed

CAEC lets secure VMs share memory without encryption
CAEC: Confidential, Attestable, and Efficient Inter-CVM Communication with Arm CCA

Sina Abdollahi +4
cs.OS 2025-11-04 reviewed

KV cache TTL cuts multi-turn agent job times by over 8x
Continuum: Efficient and Robust Multi-Turn LLM Agent Scheduling with KV Cache Time-to-Live

Hanchen Li +9
cs.CR 2025-10-31 reviewed

Sockeye formalizes hardware manuals into provable security models
Sockeye: a language for analyzing hardware documentation

Ben Fiedler +2
cs.OS 2025-09-25 reviewed

NetCAS boosts remote storage speed 174% via dynamic I/O splits
NetCAS: Dynamic Cache and Backend Device Management in Networked Environments

Joon Yong Hwang +2
cs.CR 2025-07-16 reviewed

Tyche turns isolation into a composable cloud primitive
Tyche: Composable Isolation as a Foundation to Manage Trust in the Cloud

Adrien Ghosn +5
cs.AI 2025-06-19 reviewed

Best agents need 2.7-4.3x more steps than humans
OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents

Reyna Abhyankar +2
cs.OS 2025-03-05 reviewed

90% of Linux radiation failures route through one eMMC path
Where Linux Breaks Under Radiation: A Cross-Architecture Kernel-Level Characterization of Proton-Induced Failures in COTS SoCs

Saad Memon +7
cs.CR 2025-01-08 reviewed

Type-1 hypervisor matches Docker speed with stronger isolation
Goldilocks Isolation: High Performance VMs with Edera

Marina Moore +1
cs.CR 2024-11-15 reviewed

Review yields security framework for software-defined vehicles
Contextualizing Security and Privacy of Software-Defined Vehicles: A Literature Review and Industry Perspectives

Marco De Vincenzi +8
cs.OS 2024-03-31 reviewed

FPGA scheduler lifts fairness 24-98% by adding time and energy rules
THEMIS: Time, Heterogeneity, and Energy Minded Scheduling for Fair Multi-Tenant Use in FPGAs

Emre Karabulut +3
cs.NI 2023-09-25 reviewed

CPU-time budgets isolate tail latency in shared datapaths
Tail Contagion: Sub-microsecond Time Protection in Shared Software Network Datapaths

Matheus Stolet +3
cs.OS 2019-07-27 reviewed

New file system design reduces SSD write amplification without GC
SSDFS: Towards LFS Flash-Friendly File System without GC operation

Viacheslav Dubeyko
cs.PL 2019-07-11 reviewed

Smoosh semantics matches POSIX standard more closely than seven shells
Executable formal semantics for the POSIX shell

Michael Greenberg +1
cs.OS 2019-07-07 reviewed

DiOS guarantees identical traces for repeated POSIX program runs
Reproducible Execution of POSIX Programs with DiOS

Petr Ro\v{c}kai +4
cs.OS 2019-06-29 reviewed

Hardware scheduler delivers 12x speedup on accelerator systems
HTS: A Hardware Task Scheduler for Heterogeneous Systems

Kartik Hegde +2
cs.DS 2019-06-26 reviewed

Lawn timer handles any time range at constant speed
Lawn: an Unbound Low Latency Timer Data Structure for Large Scale, High Throughput Systems

Adam Lev-Libfeld
cs.DC 2019-06-24 reviewed

DMX keeps critical container performance stable as density rises
Container Density Improvements with Dynamic Memory Extension using NAND Flash

Jan S. Rellermeyer +3
cs.OS 2019-06-24 reviewed

TrustZone world switches carry measurable time and energy costs
On The Performance of ARM TrustZone

Julien Amacher +1