DAE4HLS enables explicit decoupling of access and execute in HLS to unlock memory-level parallelism, delivering 10-79x speedups for complex workloads on commercial and dynamic HLS tools.
Araya-Polo, J
7 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
representative citing papers
An adaptive anisotropic composite quadrature strategy combined with refresh-based training narrows the gap between training and reference losses in neural residual minimization for PDEs while using quadrature points more efficiently.
COMPASS formalizes HPC configuration questions as ML tasks on traces, quantifies recommendation trustworthiness, and delivers 65.93% lower average job turnaround time plus 80.93% lower node usage versus prior methods in simulator tests.
A method for adjoint differentiation of stencil loops that preserves their structure and parallelizability via combined AD and loop transformations, released as the PerforAD tool with seismic and CFD test cases.
The thesis introduces a topology-aware tensor-network heuristic called SpinGlassPEPS.jl and thermodynamic metrics to benchmark quantum annealers on Ising problems while accounting for dissipation and effective temperature.
LTQ_n admits floor(n/2) completely edge-independent spanning trees via a constructive algorithm.
A systematic mapping study of Karma mechanisms that compares applications, structures design parameters, and maps future research directions in non-monetary resource allocation.
citing papers explorer
-
DAE4HLS: Exposing Memory-Level Parallelism for High-Level Synthesis using Explicit Decoupling
DAE4HLS enables explicit decoupling of access and execute in HLS to unlock memory-level parallelism, delivering 10-79x speedups for complex workloads on commercial and dynamic HLS tools.
-
Adaptive anisotropic composite quadratures for residual minimisation in neural PDE approximations
An adaptive anisotropic composite quadrature strategy combined with refresh-based training narrows the gap between training and reference losses in neural residual minimization for PDEs while using quadrature points more efficiently.
-
COMPASS: A Unified Decision-Intelligence System for Navigating Performance Trade-off in HPC
COMPASS formalizes HPC configuration questions as ML tasks on traces, quantifies recommendation trustworthiness, and delivers 65.93% lower average job turnaround time plus 80.93% lower node usage versus prior methods in simulator tests.
-
Automatic Differentiation for Adjoint Stencil Loops
A method for adjoint differentiation of stencil loops that preserves their structure and parallelizability via combined AD and loop transformations, released as the PerforAD tool with seismic and CFD test cases.
-
Neural and Tensor Networks in the Study of Quantum Annealing Processors
The thesis introduces a topology-aware tensor-network heuristic called SpinGlassPEPS.jl and thermodynamic metrics to benchmark quantum annealers on Ising problems while accounting for dissipation and effective temperature.
-
On Completely Edge-Independent Spanning Trees in Locally Twisted Cubes
LTQ_n admits floor(n/2) completely edge-independent spanning trees via a constructive algorithm.
-
Resource Allocation with Karma Mechanisms
A systematic mapping study of Karma mechanisms that compares applications, structures design parameters, and maps future research directions in non-monetary resource allocation.