archive
Every paper Pith has read. Search by title, abstract, or pith.
225 papers in cs.PF · page 5
-
Batch mix delay surges as batch size nears sender count
Anonymity Mixes as (Partial) Assembly Queues: Modeling and Analysis
-
Hermite-like basis shrinks DG stencil to one value plus one derivative
A Hermite-like basis for faster matrix-free evaluation of interior penalty discontinuous Galerkin operators
-
Beam abstraction slows stream queries up to 58x
Quantitative Impact Evaluation of an Abstraction Layer for Data Stream Processing Systems
-
Approximate model evaluates QoS for large Beowulf clusters
Approximate Solution Approach and Performability Evaluation of Large Scale Beowulf Clusters
-
XFLAT runs neutrino oscillation models on Xeon Phi processors
Simulating Nonlinear Neutrino Oscillations on Next-Generation Many-Core Architectures
-
Profiling picks swaps or recomputes to fit 50GB nets on 16GB GPU
Profiling based Out-of-core Hybrid Method for Large Neural Networks
-
SysMART cuts in-store shopping time using IoT devices
A Unified Analysis Approach for Hardware and Software Implementations
-
Simple SPT matches complex schedulers with inexact job sizes
Scheduling With Inexact Job Sizes: The Merits of Shortest Processing Time First
-
Exact algorithm returns every Pareto-optimal workload split for speed and energy
Bi-objective Optimisation of Data-parallel Applications on Heterogeneous Platforms for Performance and Energy via Workload Distribution
-
Link-level and system-level simulators implemented for C-V2X
Methodologies of Link-Level Simulator and System-Level Simulator for C-V2X Communication
-
Convex hull models skip 80% of runtime bounds checks
CHOP: Bypassing Runtime Bounds Checking Through Convex Hull OPtimization
-
E-IOTA cuts random walks in tip selection while keeping security
Metamorphic IOTA
-
Guidelines target bias in benchmarks for model optimization fitting
Guidelines for benchmarking of optimization approaches for fitting mathematical models
-
Spilling registers to shared memory boosts GPU speed 9%
RegDem: Increasing GPU Performance via Shared Memory Register Spilling
-
Loop transformations keep adjoint stencil code parallelizable
Automatic Differentiation for Adjoint Stencil Loops
-
Fast-rate WLAN collection raises measurement success per second
A Fast-rate WLAN Measurement Tool for Improved Miss-rate in Indoor Navigation
-
SDR receiver adds channel estimates to boost single-AP indoor positioning
Fast prototyping of an SDR WLAN 802.11b receiver for an indoor positioning system
-
LabVIEW with DLLs runs real-time GPS receiver on portable hardware
Exploiting Acceleration Features of LabVIEW platform for Real-Time GNSS Software Receiver Optimization
-
MOSIX migration plus DiCOM cuts Open-MPI run times
Open-MPI over MOSIX: paralleled computing in a clustered world
-
Hardware monitors locate Java memory waste at 7% overhead
Pinpointing Performance Inefficiencies in Java
-
GPUs accelerate database processing but leave open challenges
State-of-the-Art on Query & Transaction Processing Acceleration
-
ML cloud services improve when users pick accuracy-speed tiers
One Size Does Not Fit All: Quantifying and Exposing the Accuracy-Latency Trade-off in Machine Learning Cloud Service APIs via Tolerance Tiers
-
Stress-SGX adapts Stress-NG to test SGX enclave loads
Stress-SGX: Load and Stress your Enclaves for Fun and Profit
-
MCM prototypes integrate two Zynq chips and pass 10 Gbps tests
FPGA-based Multi-Chip Module for High-Performance Computing
-
Normalized method compares security across wireless systems
Security Rating Metrics for Distributed Wireless Systems
-
EasyCrash turns 54% of HPC crashes into correct recomputations
EasyCrash: Exploring Non-Volatility of Non-Volatile Memory for High Performance Computing Under Failures
-
Metrics from standard profiling predict near-memory speedups
Platform Independent Software Analysis for Near Memory Computing
-
TrustZone world switches carry measurable time and energy costs
On The Performance of ARM TrustZone
-
Survey compiles retrial queue theory and applications
Retrial Queueing Models: A Survey on Theory and Applications
-
Tight secrecy-rate bounds derived for SM-based indoor VLC
On the Secrecy Rate of Spatial Modulation Based Indoor Visible Light Communications
-
ILP finds optimal bundling numbers for sensor nodes
Optimal Message Bundling with Delay and Synchronization Constraints in Wireless Sensor Networks
-
WSN sync scheme saves 95% energy at microsecond accuracy
A Beaconless Asymmetric Energy-Efficient Time Synchronization Scheme for Resource-Constrained Multi-Hop Wireless Sensor Networks
-
OpenMP could gain user-defined loop schedulers
Toward a Standard Interface for User-Defined Scheduling in OpenMP
-
Neighbor spectrum reuse adds D2D pairs in LTE without cutting primary throughput
Enhancing Spectral Utilization by Maximizing the Reuse in LTE Network