pith. sign in

arxiv: 1707.02639 · v1 · pith:JFITWMZKnew · submitted 2017-07-09 · 💻 cs.DC

Towards a Comprehensive Framework for Telemetry Data in HPC Environments

classification 💻 cs.DC
keywords dataapplicationsframeworktelemetryapplicationdevelopmentadaptiveadoption
0
0 comments X
read the original abstract

Current HPC platforms do not provide the infrastructure, interfaces and conceptual models to collect, store, analyze, and access such data. Today, applications depend on application and platform specific techniques for collecting telemetry data; introducing significant development overheads that inhibit portability and mobility. The development and adoption of adaptive, context-aware strategies is thereby impaired. To facilitate 2nd generation applications, more efficient application development, and swift adoption of adaptive applications in production, a comprehensive framework for telemetry data management must be provided by future HPC systems and services. We introduce a conceptual model and a software framework to collect, store, analyze, and exploit streams of telemetry data generated by HPC systems and their applications. We show how this framework can be integrated with HPC platform architectures and how it enables common application execution strategies.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.