pith. sign in

archive

Every paper Pith has read. Search by title, abstract, or pith.

1164 papers in cs.DC · page 15

  1. cs.LO 2026-04-03 reviewed
    HistMSO logic expresses 39 of 42 consistency models

    HistMSO: A Logic for Reasoning about Consistency Models with MONA

    Isabelle Coget +1

  2. cs.DC 2026-04-03 reviewed
    Pessimistic sync cuts redundant I/Os in disaggregated KV stores

    CIDER: Boosting Memory-Disaggregated Key-Value Stores with Pessimistic Synchronization

    Yuxuan Du +4

  3. cs.LG 2026-04-03 reviewed
    Fixed gating stabilizes federated averaging of pretrained models

    FedSQ: Optimized Weight Averaging via Fixed Gating

    Cristian P\'erez-Corral +5

  4. cs.DC 2026-04-03 reviewed
    Sparsity scores cut multimodal LLM latency 30 percent

    MSAO: Adaptive Modality Sparsity-Aware Offloading with Edge-Cloud Collaboration for Efficient Multimodal LLM Inference

    Zheming Yang +6

  5. cs.DC 2026-04-03 reviewed
    Digital twin optimizes metaverse XR offloading and resources

    Digital Twin-Assisted In-Network and Edge Collaboration for Joint User Association, Task Offloading, and Resource Allocation in the Metaverse

    Ibrahim Aliyu +3

  6. cs.NI 2026-04-03 reviewed
    Temporal gating cuts edge-cloud video costs by up to 60%

    R2E-VID: Two-Stage Robust Routing via Temporal Gating for Elastic Edge-Cloud Video Inference

    Zheming Yang +6

  7. math.OC 2026-04-03 reviewed
    Sketch-based GPU solver handles 5000-asset portfolios in seconds

    Scalable Mean-Variance Portfolio Optimization via Subspace Embeddings and GPU-Friendly Nesterov-Accelerated Projected Gradient

    Yi-Shuai Niu +1

  8. cs.DC 2026-04-03 reviewed
    Heterogeneous memory lets GPUs run large nonlinear simulations

    Accelerating Nonlinear Time-History Analysis with Complex Constitutive Laws via Heterogeneous Memory Management: From 3D Seismic Simulation to Neural Network Training

    Tsuyoshi Ichimura +4

  9. cs.LG 2026-04-03 reviewed
    Communication-free sampling scales GNN training to 2048 GPUs

    Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training

    Cunyang Wei +9

  10. cs.DC 2026-04-02 reviewed
    Cold TLB misses slow small GPU collectives up to 1.4x

    Analyzing Reverse Address Translation Overheads in Multi-GPU Scale-Up Pods

    Amel Fatima +2

  11. cs.DC 2026-04-02 reviewed
    DWDP lifts LLM output speed 8.8% per GPU by skipping rank sync

    DWDP: Distributed Weight Data Parallelism for High-Performance LLM Inference on NVL72

    Wanqian Li +9

  12. cs.DC 2026-04-01 reviewed
    A probabilistic bin-packing method lets cloud schedulers overcommit VMs while bounding…

    Hotspot-Aware Scheduling of Virtual Machines with Overcommitment for Ultimate Utilization in Cloud Datacenters

    Jiaxi Wu +8

  13. cs.DC 2026-04-01 reviewed
    Multi-scale graphs improve microservice latency estimates

    Scene-Aware Latency Estimation for Microservices via Multi-Scale Graph Fusion

    Zhichao Sun +2

  14. cs.DC 2026-03-31 reviewed
    Shared replicas run fine-tuning and inference together on edge GPUs

    CoLLM: Continuous Adaptation for SLO-Aware LLM Serving on Shared GPU Clusters

    Shaoyuan Huang +7

  15. cs.DC 2026-03-31 reviewed
    Multi-agent LLM workflow maps service text to KVI intervals

    KPI2KVI: A Multi Agent Workflow for Calculating Key Value Indicators from Service Descriptions

    Masoud Shokrnezhad +3

  16. cs.CR 2026-03-31 reviewed
    Semantic triggers backdoor federated learning models

    Beyond Corner Patches: Semantics-Aware Backdoor Attack in Federated Learning

    Kavindu Herath +2

  17. cs.DC 2026-03-31 reviewed
    Edge AI cuts sensor energy use via dynamic activation

    An AI-Driven Framework for Energy-Efficient Environmental Monitoring in Smart Cities Using Edge Intelligence

    Yichen Liu +4

  18. cs.SE 2026-03-30 reviewed
    Lumos captures bug provenance automatically for root cause ID

    Wherefore Art Thou? Provenance-Guided Automatic Online Debugging with Lumos

    Jingyuan Chen +5

  19. cs.DC 2026-03-30 reviewed
    GPU-FPGA pairing speeds LLM memory processing 2.2x

    Understand and Accelerate Memory Processing Pipeline for Large Language Model Inference

    Zifan He +3

  20. cs.LG 2026-03-29 reviewed
    Block-wise FL improves multimodal results up to 37.7% under sparse modalities

    BLOSSOM: Block-wise Federated Learning Over Shared and Sparse Observed Modalities

    Pranav M R +3

  21. quant-ph 2026-03-28 reviewed
    Binary thresholds mark quantum advantage in sub-chips of two qubit technologies

    Benchmarking Quantum Computers via Protocols, Comparing Superconducting and Ion-Trap Quantum Technology

    Nitay Mayo +2

  22. cs.AR 2026-03-28 reviewed
    Lossless compressor speeds Ascend NPU inference up to 6.3 times

    ENEC: A Lossless AI Model Compression Method Enabling Fast Inference on Ascend NPUs

    Jinwu Yang +19

  23. cs.DC 2026-03-27 reviewed
    Tiny fingerprint selects best cache policy for shifting workloads

    SCION: Size-aware Policy Orchestration for Nonstationary Object Caches (Long Paper Version)

    Qizhi Wang

  24. cs.DC 2026-03-27 reviewed
    Scheduler cuts multimodal LLM first-token latency by 54%

    TCM-Serve: Modality-aware Scheduling for Multimodal Large Language Model Inference

    Konstantinos Papaioannou +1

  25. cs.AR 2026-03-27 reviewed
    NoC with direct core access speeds ML collectives 5.3x

    A Lightweight High-Throughput Collective-Capable NoC for Large-Scale ML Accelerators

    Luca Colagrande +5

  26. cs.NE 2026-03-27 reviewed
    Network evolves protocols from intents into bytecode at runtime

    DarwinNet: An Evolutionary Network Architecture for Agent-Driven Protocol Synthesis

    Jinliang Xu +1

  27. cs.DC 2026-03-26 reviewed
    Erasure coding reduces LLM checkpoint latency 2.7x

    GhostServe: A Lightweight Checkpointing System in the Shadow for Fault-Tolerant LLM Serving

    Shakya Jayakody +3

  28. cs.DC 2026-03-26 reviewed
    Data profiling cuts multimodal LLM training time up to 3.6x

    DFLOP: A Data-driven Framework for Multimodal LLM Training Pipeline Optimization

    Hyeonjun An +11

  29. physics.plasm-ph 2026-03-25 reviewed
    Hybrid MPI+OpenMP scales PIC Monte Carlo to 16,000 GPUs

    Multi-GPU Hybrid Particle-in-Cell Monte Carlo Simulations for Exascale Computing Systems

    Jeremy J. Williams +15

  30. cs.DC 2026-03-25 reviewed
    GPU framework speeds up graph edit distance by orders of magnitude

    Efficient Accelerated Graph Edit Distance Computation on GPU

    Adel Dabah +1

  31. eess.AS 2026-03-24 reviewed
    Lightning V2 achieves 4x lower TTS cost on Tenstorrent vs L40S

    Rewriting TTS Inference Economics: Lightning V2 on Tenstorrent Achieves 4x Lower Cost Than NVIDIA L40S

    Ranjith M. S. +2

  32. cs.AI 2026-03-23 reviewed
    Reasoning provenance cannot be recovered from state checkpoints alone

    Reasoning Provenance for Autonomous AI Agents: Structured Behavioral Analytics Beyond State Checkpoints and Execution Traces

    Neelmani Vispute +1

  33. cs.DC 2026-03-22 reviewed
    Product graph proves livelock freedom for all ring sizes

    Practical Livelock Analysis in Parameterized Unidirectional Rings

    Aly Farahat

  34. cs.LG 2026-03-22 reviewed
    WRP matrix maps LLM optimizations to 3x3 grid

    The Workload-Router-Pool Architecture for LLM Inference Optimization: A Vision Paper from the vLLM Semantic Router Project

    Huamin Chen +7

  35. cs.DC 2026-03-21 reviewed
    RoboECC splits VLA models for 3.28x edge-cloud speedup

    RoboECC: Multi-Factor-Aware Edge-Cloud Collaborative Deployment for VLA Models

    Zihao Zheng +8

  36. cs.DC 2026-03-21 reviewed
    Updated Amdahl sets specialization threshold at 1-1/R

    Modernizing Amdahl's Law: How AI Scaling Laws Shape Computer Architecture

    Chien-Ping Lu

  37. cs.DC 2026-03-20 reviewed
    Text-only supervision cannot enforce model honesty

    Epistemic Observability in Language Models

    Tony Mason +1

  38. cs.DC 2026-03-19 reviewed
    Edge YOLO models keep hardware metrics stable under input faults

    Hardware Utilization and Inference Performance of Edge Object Detection Under Fault Injection

    Faezeh Pasandideh +2

  39. cs.DC 2026-03-19 reviewed
    YOLO edge inference holds steady hardware metrics under faults

    Hardware Utilization and Inference Performance of Edge Object Detection Under Fault Injection

    Faezeh Pasandideh +2

  40. cs.AI 2026-03-18 reviewed
    Training memory bounded to twice inference for geometric AI

    Adaptive Domain Models: Bayesian Evolution, Warm Rotation, and Principled Training for Geometric and Neuromorphic AI

    Houston Haynes

  41. cs.DC 2026-03-18 reviewed
    Tokens per watt halves when context window doubles

    The 1/W Law: An Analytical Study of Context-Length Routing Topology and GPU Generation Gains for LLM Inference Energy Efficiency

    Huamin Chen +5

  42. cs.DC 2026-03-17 reviewed
    Structural monitoring signals catch quiet GPU detachments early

    When GPUs Fail Quietly: Observability-Aware Early Warning Beyond Numeric Telemetry

    Michael Bidollahkhani +2

  43. cs.DC 2026-03-16 reviewed
    Edge agents orchestrate smart homes with MQTT and Git

    HearthNet: Edge Multi-Agent Orchestration for Smart Homes

    Zhonghao Zhan +3

  44. cs.DC 2026-03-16 reviewed
    CoGPU shares GPUs spatially with zero token drift

    Performance Isolation and Semantic Determinism in Efficient GPU Spatial Sharing

    Zhenyuan Yang +3

  45. quant-ph 2026-03-16 reviewed
    Twin-field QKD secures blockchain with linear scaling

    Security-enhanced Blockchain with Twin-Field Quantum Key Distribution: A Physical Layer enabled Architecture

    Xuan Li +1

  46. cs.DC 2026-03-15 reviewed
    Tezos protocol embeds native liquid staking

    Canonical LST: A Protocol-Native Liquid Staking Solution for Tezos

    Mathias Bourgoin +7

  47. cs.DC 2026-03-15 reviewed
    DCGen builds datacenter models with IT

    DCGen 1.1 Technical Report: Generating Datacenter Configurations (including IT, Power, Cooling)

    Wedan Emmanuel Gnibga +1

  48. cs.GL 2026-03-14 reviewed
    First CS research paper written entirely in Telugu

    On the First Computer Science Research Paper in an Indian Language and the Future of Science in Indian Languages

    Siddhartha Visveswara Jayanti

  49. cs.NI 2026-03-14 reviewed
    CATS transport cuts first paint time by 78% in worst-case web load

    A Case for CATS: A Conductor-driven Asymmetric Transport Scheme for Semantic Prioritization

    Syed Muhammad Aqdas Rizvi

  50. cs.DC 2026-03-14 reviewed
    Calibrated microgrid simulations match real node power to R^2 of 0.95

    Calibrating Microgrid Simulations for Energy-Aware Computing Systems

    Marvin Steinke