archive

Every paper Pith has read. Search by title, abstract, or pith.

1164 papers in cs.DC · page 3

cs.DC 2026-05-16 reviewed

Coding plus sketching cuts distributed ML runtime
Approximate Distributed Coded Computing: Polynomial Codes and Randomized Sketching

Neophytos Charalambides +1
cs.DC 2026-05-15 reviewed

HexAGenT cuts required SLO scale by 20% for agentic LLM workflows
HexAGenT: Efficient Agentic LLM Serving via Workflow- and Heterogeneity-Aware Scheduling

You Peng +7
cs.DC 2026-05-15 reviewed

BF16 tensor cores outperform native FP32 SGEMM in speed and accuracy
Exceeding the Numerical and Performance Characteristics of IEEE-754 SGEMM with BFloat16 Tensor Cores on GPUs for Scientific Computing

Harun Bayraktar +11
cs.DC 2026-05-15 reviewed

Datacenters should plan for deployable AI power capacity over time
Designing Datacenter Power Delivery Hierarchies for the AI Era

Grant Wilkins +4
cs.DC 2026-05-15 reviewed

Runtime system makes second-order optimizers work for 7B LLMs
Runtime-Orchestrated Second-Order Optimization for Scalable LLM Training

Yishun Lu +3
cs.DC 2026-05-15 reviewed

GPU system samples causal walks on billion-edge streams in real time
A GPU Accelerated Temporal Window-Based Random Walk Sampler

Md Ashfaq Salehin +2
cs.CR 2026-05-15 reviewed

Manufacturing ransomware recovery goes beyond backups
From Backup Restoration to Minimum Viable Factory Recovery: A Systematization of Ransomware Recovery in Manufacturing Systems

Chun Yin Chiu
cs.CR 2026-05-15 reviewed

Diffusion model poisons FL data more stealthily than GANs
PCDM: A Diffusion-Based Data Poisoning Attack Against Federated Learning Systems

Wei Sun +6
cs.DC 2026-05-15 reviewed

One GPU runs DG ocean model at speed of 1500 CPU cores
An efficient multi-GPU implementation for the Discontinuous Galerkin ocean model SLIM

Miguel De Le Court +5
cs.DC 2026-05-15 reviewed

Parallel code speeds star-M SVD compression for big datasets
High-Performance Star-M SVD for Big Data Compression

Md Taufique Hussain +5
cs.DC 2026-05-15 reviewed

SNN latency increases 47 times at half a CPU core
Evaluating Container Orchestration for Neuromorphic Workloads in Virtual Edge Environments

Huyen Pham +1
cs.DC 2026-05-15 reviewed

Online delay tracker holds container SLA violations under 5%
ADAPT: A Self-Calibrating Proactive Autoscaler for Container Orchestration

Himanshu Singh Baghel
cs.DC 2026-05-15 reviewed

Deep RL scheduler nears optimal for edge serverless containers
Scale: Deep Reinforcement Learning for Container Scheduling in Serverless Edge Computing

Chen Chen +4
cs.DC 2026-05-15 reviewed

ParamSpMM adapts SpMM for GNNs to gain 1.92x average speedup
ParamSpMM: Adaptive and Efficient Sparse Matrix-Matrix Multiplication on GPUs for GNNs

Lixing Zhang +4
cs.DC 2026-05-15 reviewed

Emulate 8192-GPU training on a few GPUs
A Few GPUs, A Whole Lotta Scale: Faithful LLM Training Emulation with PrismLLM

Shaoke Xi +13
cs.LG 2026-05-15 reviewed

One client inflates its attribution score in distributed ML training
On the Fragility of Data Attribution When Learning Is Distributed

Xian Gao +3
cs.DC 2026-05-14 reviewed

OSDF joins U.S
Open Science Data Federation -- operation and monitoring

Fabio Andrijauskas +2
cs.DC 2026-05-14 reviewed

OSDF integration gives BBSO data reliable global access
Using the Open Science Data Federation for data distribution: Big Bear Solar Observatory use case

Sydney Montiel +2
cs.DC 2026-05-14 reviewed

3D satellite clusters scale nodes with cube of radius ratio
Designing Dense Satellite Clusters for Distributed Space-based Datacenters

Jules P\'enot +1
cs.AI 2026-05-14 reviewed

APWA scales agent workflows by parallelizing non-communicating subproblems
APWA: A Distributed Architecture for Parallelizable Agentic Workflows

Evan Rose +4
quant-ph 2026-05-14 reviewed

Cache reorganization lifts GPU speedups for 28-qubit simulations on laptops
Accelerating State-Vector Quantum Simulation on Integrated GPUs via Cache Locality Optimization: A Cross-Architecture Evaluation

Gabriel Fernandes Thomaz +4
cs.PL 2026-05-14 reviewed

Mat2Boundary turns boundary conditions into SpMV for PDE solvers
Mat2Boundary: Treating User-Defined Boundary Condition as SpMV for Distributed PDE Solvers on Block-Structured Grids

Yanzheng Cai +8
cs.DC 2026-05-14 reviewed

Wi-Fi logs build hierarchical mobility models with lower complexity
Analysis of wireless network access logs for a hierarchical characterization of user mobility

Francisco Talavera +2
cs.GR 2026-05-14 reviewed

Unified GPU solver gives exact gradients for stiff heterogeneous soft bodies
DiffPhD: A Unified Differentiable Solver for Projective Heterogeneous Materials in Elastodynamics with Contact-Rich GPU-Acceleration

Shih-Yu Lai +11
cs.DC 2026-05-14 reviewed

Exploration fails above ceil(k/(n-2))-1 deactivations per round
Semi-Synchronous Exploration in Dynamic Graphs

Ashish Saxena +3
cs.DC 2026-05-13 reviewed

Distributed Sumcheck gives statistical zero-knowledge for graph problems
Distributed Statistical Zero-Knowledge Proofs via Sumcheck

Benjamin Jauregui +1
cs.LG 2026-05-13 reviewed

EMA cuts model adaptation costs 15-42% in shifting environments
EMA: Efficient Model Adaptation for Learning-based Systems

Daiyang Yu +5
cs.LG 2026-05-13 reviewed

MinT manages million LoRA policies over shared 1T models
MinT: Managed Infrastructure for Training and Serving Millions of LLMs

Mind Lab: Song Cao +60
cs.LG 2026-05-13 reviewed

Federated fine-tuning matches centralized LLM training on private data
Towards the Next Frontier of LLMs, Training on Private Data: A Cross-Domain Benchmark for Federated Fine-Tuning

Daniel M. Jimenez-Gutierrez +5
cs.DC 2026-05-13 reviewed

Adaptive KV compression speeds disaggregated LLM serving up to 9x
KVServe: Service-Aware KV Cache Compression for Communication-Efficient Disaggregated LLM Serving

Zedong Liu +11
cs.CR 2026-05-13 reviewed

Client committee speeds secure aggregation 4.6x
DisAgg: Distributed Aggregators for Efficient Secure Aggregation in Federated Learning

Haaris Mehmood +6
cs.DC 2026-05-13 reviewed

Multi-agent RL cuts LLM carbon by 33% and water by 43%
MARLIN: Multi-Agent Game-Theoretic Reinforcement Learning for Sustainable LLM Inference in Cloud Datacenters

H. Moore +4
cs.DC 2026-05-13 reviewed

Hybrid method cuts graph scheduling violations 45 percent
Sustainable Graph Analytics Workload Scheduling with Evolutionary Reinforcement Learning in Edge-Cloud Systems

P. Ramicetty +7
cs.LG 2026-05-13 reviewed

Router sends 36% of VLM queries to edge
INAR-VL: Input-Aware Routing for Edge-Cloud Vision-Language Inference

Ahmed \v{S}abanovi\'c +2
cs.LG 2026-05-13 reviewed

Rescaled stepsizes remove bias in async SGD
Rescaled Asynchronous SGD: Optimal Distributed Optimization under Data and System Heterogeneity

Ammar Mahran +2
cs.DC 2026-05-13 reviewed

TurboGR trains 0.2B-param generative recommenders at 54.71% MFU
TurboGR: An Accelerated Training System for Large-Scale Generative Recommendation

Huichao Chai +10
cs.AR 2026-05-13 reviewed

FPGA lock agents boost OLTP throughput 51X over CPUs
FPGA-Accelerated Lock Management and Transaction Processing: Architecture, Optimization, and Design Space Exploration

Shien Zhu +1
cs.MA 2026-05-13 reviewed

One rule unifies voting, proposals and constitutional amendment in metric spaces
Constitutional Governance in Metric Spaces

Ehud Shapiro +1
cs.MA 2026-05-13 reviewed

Metric-space protocol lets communities self-amend constitutions in polynomial time
Constitutional Governance in Metric Spaces

Ehud Shapiro +1
cs.GR 2026-05-13 reviewed

Transformer preconditioner speeds stiff physics 28x
Hierarchical Transformer Preconditioning for Interactive Physics Simulation

Carl Osborne +3
cs.GR 2026-05-13 reviewed

Hierarchical transformer preconditioner reaches 21 fps on stiff Poisson systems
Hierarchical Transformer Preconditioning for Interactive Physics Simulation

Carl Osborne +3
cs.NI 2026-05-13 reviewed

Drone swarms adapt composition to deliver lower latency connectivity
Swarm Network-as-a-Service (SNaaS)

Balsam Alkouz +2
cs.DC 2026-05-13 reviewed

Pipeline overlap speeds cloud-edge LLM inference up to 2.16x
PipeSD: An Efficient Cloud-Edge Collaborative Pipeline Inference Framework with Speculative Decoding

Yunhe Han +6
cs.DC 2026-05-13 reviewed

Pipeline speeds cloud-edge LLM inference 1.16-2.16x
PipeSD: An Efficient Cloud-Edge Collaborative Pipeline Inference Framework with Speculative Decoding

Yunhe Han +6
cs.DC 2026-05-13 reviewed

Heterogeneous solvers up to 32% faster than GPU-only for big matrices
Comparing the Performance of Heterogeneous Conjugate Gradient and Cholesky Solvers on Various Hardware Using SYCL

Tim Th\"uring +2
cs.GT 2026-05-12 reviewed

Dynamic pricing stabilizes mempool volume at target capacity
Dynamic Transaction Scheduling and Pricing in the Ethereum Mempool

Fatemeh Fardno +1
cs.DC 2026-05-12 reviewed

LCL complexity on trees shifts without exact n knowledge
The Distributed Complexity Landscape on Trees Depends on the Knowledge About the Network Size

Alkida Balliu +5

1 Piths
cs.DC 2026-05-12 reviewed

Overdecomposition supported efficiently on mixed GPGPU clusters
Efficient and Portable Support for Overdecomposition on Distributed Memory GPGPU Platforms

Aditya Bhosale +5
cs.LG 2026-05-12 reviewed

Parallel training lets RNNs learn from sequences over 10,000 steps
Parallel-in-Time Training of Recurrent Neural Networks for Dynamical Systems Reconstruction

Florian Hess +2
cs.LG 2026-05-12 reviewed

Adaptive eviction cuts LLM prefill time 1.4x to 2.7x
Not All Tokens Are Worth Caching: Learning Semantic-Aware Eviction for LLM Prefix Caches

Shaoke Fang +5