[hpc-announce] [CFP] MODA23 at ISC 2023
thomas.jakobsche at unibas.ch
Thu Jan 19 02:33:06 CST 2023
[We apologize if you receive multiple copies of this CFP.]
4th ISC HPC International Workshop on Monitoring & Operational Data Analytics (MODA23)
May 25, 2023, Hamburg, Germany
Twitter: https://twitter.com/moda_hpc (@moda_hpc)
Following the successful previous editions initiated at ISC HPC, we are inviting contributions to the 4th ISC HPC International Workshop on Monitoring and Operational Data Analytics (MODA23). The goal of the MODA workshop series is to provide a venue for sharing insight into current trends in MODA, identify potential gaps, and offer an outlook into the future of the involved fields: high performance computing, databases, machine learning, and solutions that can contribute to the design and procurement of upcoming Exascale systems.
=== Important Dates ===
Abstract submission: March 24, 2023 (AoE)
Paper submission: March 31, 2023 (AoE)
Author notification: April 21, 2023
Camera-ready: one month after May 25, 2023
MODA23 workshop: May 25, 2023, Hamburg, Germany
=== Goals ===
While MODA is already a common practice at various HPC and data centers, each site adopts a different, insular approach, rarely adopted in production environments, and mostly limited to the visualization of the system and building infrastructure metrics for health check purposes. In this regard, we observe a gap between the collection of operational data and its meaningful and effective analysis and exploitation, which prevents closing the feedback loop between the monitored HPC and data processing system, its operation, and its end-users.
Under the above premises, the goals of the MODA 2023 workshop are:
(1) Gather and share knowledge and establish a common ground within the international community with respect to best practices in monitoring and operational data analytics.
(2) Discuss future strategies and alternatives for MODA, potentially improving existing solutions and envisioning a common baseline approach in computing and data centers.
(3) Establish a debate on the usefulness and applicability of AI/ML techniques on collected operational data for optimizing the operation of production systems (for practices such as predictive maintenance, runtime optimization, optimal and adaptive resource allocation and scheduling).
=== Scope ===
We seek novel research ideas that align with the above goals and match (see note below) the scope of the MODA workshop series:
(a) Challenges, solutions, and best practices for monitoring systems at HPC and data centers. A significant focus is on operational data collection mechanisms
- covering different system levels, from building infrastructure sensor data to processing-core performance metrics, and
- targeting different end-users, from system administrators and operators to computer scientists, application developers, and computational scientists.
(b) Effective strategies for analyzing and interpreting the collected operational data. Such strategies should particularly include (but are not limited to):
- different visualization approaches and
- machine learning-based strategies, potentially inferring knowledge of the system behavior and allowing for the realization of a proactive control loop.
Note: New solutions proposed in the context of application performance modeling and/or application performance analysis tools fall outside the scope of the MODA workshop series. Novel contributions in the area of compiler analysis, debugging, programming models, and/or sustainability of scientific software are also considered out of the scope of the workshop.
We cordially invite you to submit your contributions to the MODA23 workshop, in the form of short (6 pages) and full (12 pages) submissions, that address but are not limited to:
* Monitoring and operational data analysis challenges and approaches (data collection, storage, visualization, integration into system software, adoption)
* State-of-the-practice method, tools, techniques in monitoring at various HPC and data centers
* Solutions for monitoring and analysis of operational data deployed productively on large- to extreme-scale systems, with a large number of users
* Solutions that have proven limitations in terms of quality of the collected data or efficiency of real-time collection of operational data
* Opportunities and challenges of using machine learning methods for efficient monitoring and analysis of operational data
* Examples of successful integration of monitoring and data analysis practices into production system software (energy and resource management) and runtime systems (scheduling and resource allocation)
* Explicit gaps between operational data collection, processing, effective analysis, and impactful exploitation; new approaches for closing these gaps for the benefit of improving HPC and data center planning, operations, and research
* Means to identify (intentional or unintentional) misuse of resources, and methods to mitigate its effects: taking automatic steps to contain the effects of one application/job/user allocation on others, supporting users to identify causes for the misbehavior of their application, linking to intrusion detection. and safe and trusted multitenancy
* Concepts to integrate MODA into the system codesign at all levels, including dedicated hardware components, middleware features, and tool support that make ‘monitoring and analysis by design and by default’ a viable option without sacrificing performance.
* Examples and challenges of applying FAIR data practices, including sharing of monitoring workflows and tools across sites while ensuring compliance with GDPR regulations, user access agreements, or special operational security requirements.
* Concrete use cases of improvements achieved by applying ML models in HPC operations
* Suggestions for appropriate data to collect towards an open data set (ODS) (ideally containing anomalies) that captures the execution of a set of representative applications on a representative production HPC system
* Methodologies to prepare the ODS in view of deploying appropriate ML models
* Use of MODA to tackle the rising challenges of sustainable HPC, mainly at the level of energy efficiency, but also in hardware stability and replacement policies, from application-level to sitewide approaches.
=== Submission and Publication
All papers submitted to MODA23 must be original and not simultaneously submitted to another venue. All submissions will be peer-reviewed by the program committee members and accepted papers are expected to be presented during the workshop.
Papers should be submitted through the EasyChair online system at https://easychair.org/conferences/?conf=moda23. Papers are required to be formatted according to the ISC research papers guidelines using LNCS style (see Springer’s website):
* Single-column format.
* Maximum 6 (six) for short papers or 12 (twelve) pages for full papers (including figures and references).
* Use Springer’s LaTeX document class or Word template (see Springer’s Proceedings Guidelines).
The workshop chairs reserve the right to reject incorrectly formatted papers.
The MODA23 workshop papers will be published together with the ISC HPC 2023 post-conference proceedings (published by Springer LNCS), including an abstract of the keynote and invited talks, and a description of the panel session.
=== Workshop Organizers ===
* Florina Ciorba – University of Basel, Switzerland
* Utz-Uwe Haus – HPE EMEA Research Lab, Switzerland
* Nicolas Lachiche – University of Strasbourg, France
* Martin Schulz – Technische Universität München, Germany
=== Publicity Chair ===
* Thomas Jakobsche – University of Basel, Switzerland
We are looking forward to your submissions and to seeing you on May 25, 2023 in Hamburg, Germany.
More information about the hpc-announce