pith. sign in

archive

Every paper Pith has read. Search by title, abstract, or pith.

1164 papers in cs.DC · page 3

  1. cs.DC 2026-05-16 reviewed
    Coding plus sketching cuts distributed ML runtime

    Approximate Distributed Coded Computing: Polynomial Codes and Randomized Sketching

    Neophytos Charalambides +1

  2. cs.DC 2026-05-15 reviewed
    HexAGenT cuts required SLO scale by 20% for agentic LLM workflows

    HexAGenT: Efficient Agentic LLM Serving via Workflow- and Heterogeneity-Aware Scheduling

    You Peng +7

  3. cs.DC 2026-05-15 reviewed
    BF16 tensor cores outperform native FP32 SGEMM in speed and accuracy

    Exceeding the Numerical and Performance Characteristics of IEEE-754 SGEMM with BFloat16 Tensor Cores on GPUs for Scientific Computing

    Harun Bayraktar +11

  4. cs.DC 2026-05-15 reviewed
    Datacenters should plan for deployable AI power capacity over time

    Designing Datacenter Power Delivery Hierarchies for the AI Era

    Grant Wilkins +4

  5. cs.DC 2026-05-15 reviewed
    Runtime system makes second-order optimizers work for 7B LLMs

    Runtime-Orchestrated Second-Order Optimization for Scalable LLM Training

    Yishun Lu +3

  6. cs.DC 2026-05-15 reviewed
    GPU system samples causal walks on billion-edge streams in real time

    A GPU Accelerated Temporal Window-Based Random Walk Sampler

    Md Ashfaq Salehin +2

  7. cs.CR 2026-05-15 reviewed
    Manufacturing ransomware recovery goes beyond backups

    From Backup Restoration to Minimum Viable Factory Recovery: A Systematization of Ransomware Recovery in Manufacturing Systems

    Chun Yin Chiu

  8. cs.CR 2026-05-15 reviewed
    Diffusion model poisons FL data more stealthily than GANs

    PCDM: A Diffusion-Based Data Poisoning Attack Against Federated Learning Systems

    Wei Sun +6

  9. cs.DC 2026-05-15 reviewed
    One GPU runs DG ocean model at speed of 1500 CPU cores

    An efficient multi-GPU implementation for the Discontinuous Galerkin ocean model SLIM

    Miguel De Le Court +5

  10. cs.DC 2026-05-15 reviewed
    Parallel code speeds star-M SVD compression for big datasets

    High-Performance Star-M SVD for Big Data Compression

    Md Taufique Hussain +5

  11. cs.DC 2026-05-15 reviewed
    SNN latency increases 47 times at half a CPU core

    Evaluating Container Orchestration for Neuromorphic Workloads in Virtual Edge Environments

    Huyen Pham +1

  12. cs.DC 2026-05-15 reviewed
    Online delay tracker holds container SLA violations under 5%

    ADAPT: A Self-Calibrating Proactive Autoscaler for Container Orchestration

    Himanshu Singh Baghel

  13. cs.DC 2026-05-15 reviewed
    Deep RL scheduler nears optimal for edge serverless containers

    Scale: Deep Reinforcement Learning for Container Scheduling in Serverless Edge Computing

    Chen Chen +4

  14. cs.DC 2026-05-15 reviewed
    ParamSpMM adapts SpMM for GNNs to gain 1.92x average speedup

    ParamSpMM: Adaptive and Efficient Sparse Matrix-Matrix Multiplication on GPUs for GNNs

    Lixing Zhang +4

  15. cs.DC 2026-05-15 reviewed
    Emulate 8192-GPU training on a few GPUs

    A Few GPUs, A Whole Lotta Scale: Faithful LLM Training Emulation with PrismLLM

    Shaoke Xi +13

  16. cs.LG 2026-05-15 reviewed
    One client inflates its attribution score in distributed ML training

    On the Fragility of Data Attribution When Learning Is Distributed

    Xian Gao +3

  17. cs.DC 2026-05-14 reviewed
    OSDF joins U.S

    Open Science Data Federation -- operation and monitoring

    Fabio Andrijauskas +2

  18. cs.DC 2026-05-14 reviewed
    OSDF integration gives BBSO data reliable global access

    Using the Open Science Data Federation for data distribution: Big Bear Solar Observatory use case

    Sydney Montiel +2

  19. cs.DC 2026-05-14 reviewed
    3D satellite clusters scale nodes with cube of radius ratio

    Designing Dense Satellite Clusters for Distributed Space-based Datacenters

    Jules P\'enot +1

  20. cs.AI 2026-05-14 reviewed
    APWA scales agent workflows by parallelizing non-communicating subproblems

    APWA: A Distributed Architecture for Parallelizable Agentic Workflows

    Evan Rose +4

  21. quant-ph 2026-05-14 reviewed
    Cache reorganization lifts GPU speedups for 28-qubit simulations on laptops

    Accelerating State-Vector Quantum Simulation on Integrated GPUs via Cache Locality Optimization: A Cross-Architecture Evaluation

    Gabriel Fernandes Thomaz +4

  22. cs.PL 2026-05-14 reviewed
    Mat2Boundary turns boundary conditions into SpMV for PDE solvers

    Mat2Boundary: Treating User-Defined Boundary Condition as SpMV for Distributed PDE Solvers on Block-Structured Grids

    Yanzheng Cai +8

  23. cs.DC 2026-05-14 reviewed
    Wi-Fi logs build hierarchical mobility models with lower complexity

    Analysis of wireless network access logs for a hierarchical characterization of user mobility

    Francisco Talavera +2

  24. cs.GR 2026-05-14 reviewed
    Unified GPU solver gives exact gradients for stiff heterogeneous soft bodies

    DiffPhD: A Unified Differentiable Solver for Projective Heterogeneous Materials in Elastodynamics with Contact-Rich GPU-Acceleration

    Shih-Yu Lai +11

  25. cs.DC 2026-05-14 reviewed
    Exploration fails above ceil(k/(n-2))-1 deactivations per round

    Semi-Synchronous Exploration in Dynamic Graphs

    Ashish Saxena +3

  26. cs.DC 2026-05-13 reviewed
    Distributed Sumcheck gives statistical zero-knowledge for graph problems

    Distributed Statistical Zero-Knowledge Proofs via Sumcheck

    Benjamin Jauregui +1

  27. cs.LG 2026-05-13 reviewed
    EMA cuts model adaptation costs 15-42% in shifting environments

    EMA: Efficient Model Adaptation for Learning-based Systems

    Daiyang Yu +5

  28. cs.LG 2026-05-13 reviewed
    MinT manages million LoRA policies over shared 1T models

    MinT: Managed Infrastructure for Training and Serving Millions of LLMs

    Mind Lab: Song Cao +60

  29. cs.LG 2026-05-13 reviewed
    Federated fine-tuning matches centralized LLM training on private data

    Towards the Next Frontier of LLMs, Training on Private Data: A Cross-Domain Benchmark for Federated Fine-Tuning

    Daniel M. Jimenez-Gutierrez +5

  30. cs.DC 2026-05-13 reviewed
    Adaptive KV compression speeds disaggregated LLM serving up to 9x

    KVServe: Service-Aware KV Cache Compression for Communication-Efficient Disaggregated LLM Serving

    Zedong Liu +11

  31. cs.CR 2026-05-13 reviewed
    Client committee speeds secure aggregation 4.6x

    DisAgg: Distributed Aggregators for Efficient Secure Aggregation in Federated Learning

    Haaris Mehmood +6

  32. cs.DC 2026-05-13 reviewed
    Multi-agent RL cuts LLM carbon by 33% and water by 43%

    MARLIN: Multi-Agent Game-Theoretic Reinforcement Learning for Sustainable LLM Inference in Cloud Datacenters

    H. Moore +4

  33. cs.DC 2026-05-13 reviewed
    Hybrid method cuts graph scheduling violations 45 percent

    Sustainable Graph Analytics Workload Scheduling with Evolutionary Reinforcement Learning in Edge-Cloud Systems

    P. Ramicetty +7

  34. cs.LG 2026-05-13 reviewed
    Router sends 36% of VLM queries to edge

    INAR-VL: Input-Aware Routing for Edge-Cloud Vision-Language Inference

    Ahmed \v{S}abanovi\'c +2

  35. cs.LG 2026-05-13 reviewed
    Rescaled stepsizes remove bias in async SGD

    Rescaled Asynchronous SGD: Optimal Distributed Optimization under Data and System Heterogeneity

    Ammar Mahran +2

  36. cs.DC 2026-05-13 reviewed
    TurboGR trains 0.2B-param generative recommenders at 54.71% MFU

    TurboGR: An Accelerated Training System for Large-Scale Generative Recommendation

    Huichao Chai +10

  37. cs.AR 2026-05-13 reviewed
    FPGA lock agents boost OLTP throughput 51X over CPUs

    FPGA-Accelerated Lock Management and Transaction Processing: Architecture, Optimization, and Design Space Exploration

    Shien Zhu +1

  38. cs.MA 2026-05-13 reviewed
    One rule unifies voting, proposals and constitutional amendment in metric spaces

    Constitutional Governance in Metric Spaces

    Ehud Shapiro +1

  39. cs.MA 2026-05-13 reviewed
    Metric-space protocol lets communities self-amend constitutions in polynomial time

    Constitutional Governance in Metric Spaces

    Ehud Shapiro +1

  40. cs.GR 2026-05-13 reviewed
    Transformer preconditioner speeds stiff physics 28x

    Hierarchical Transformer Preconditioning for Interactive Physics Simulation

    Carl Osborne +3

  41. cs.GR 2026-05-13 reviewed
    Hierarchical transformer preconditioner reaches 21 fps on stiff Poisson systems

    Hierarchical Transformer Preconditioning for Interactive Physics Simulation

    Carl Osborne +3

  42. cs.NI 2026-05-13 reviewed
    Drone swarms adapt composition to deliver lower latency connectivity

    Swarm Network-as-a-Service (SNaaS)

    Balsam Alkouz +2

  43. cs.DC 2026-05-13 reviewed
    Pipeline overlap speeds cloud-edge LLM inference up to 2.16x

    PipeSD: An Efficient Cloud-Edge Collaborative Pipeline Inference Framework with Speculative Decoding

    Yunhe Han +6

  44. cs.DC 2026-05-13 reviewed
    Pipeline speeds cloud-edge LLM inference 1.16-2.16x

    PipeSD: An Efficient Cloud-Edge Collaborative Pipeline Inference Framework with Speculative Decoding

    Yunhe Han +6

  45. cs.DC 2026-05-13 reviewed
    Heterogeneous solvers up to 32% faster than GPU-only for big matrices

    Comparing the Performance of Heterogeneous Conjugate Gradient and Cholesky Solvers on Various Hardware Using SYCL

    Tim Th\"uring +2

  46. cs.GT 2026-05-12 reviewed
    Dynamic pricing stabilizes mempool volume at target capacity

    Dynamic Transaction Scheduling and Pricing in the Ethereum Mempool

    Fatemeh Fardno +1

  47. cs.DC 2026-05-12 reviewed
    LCL complexity on trees shifts without exact n knowledge

    The Distributed Complexity Landscape on Trees Depends on the Knowledge About the Network Size

    Alkida Balliu +5

    1 Piths
  48. cs.DC 2026-05-12 reviewed
    Overdecomposition supported efficiently on mixed GPGPU clusters

    Efficient and Portable Support for Overdecomposition on Distributed Memory GPGPU Platforms

    Aditya Bhosale +5

  49. cs.LG 2026-05-12 reviewed
    Parallel training lets RNNs learn from sequences over 10,000 steps

    Parallel-in-Time Training of Recurrent Neural Networks for Dynamical Systems Reconstruction

    Florian Hess +2

  50. cs.LG 2026-05-12 reviewed
    Adaptive eviction cuts LLM prefill time 1.4x to 2.7x

    Not All Tokens Are Worth Caching: Learning Semantic-Aware Eviction for LLM Prefix Caches

    Shaoke Fang +5