[hpc-announce] [CFP] MODA24 at ISC 2024
Thomas Jakobsche
thomas.jakobsche at unibas.ch
Mon Jan 22 06:19:04 CST 2024
[We apologize if you receive multiple copies of this CFP.]
5th ISC HPC International Workshop on Monitoring & Operational Data Analytics (MODA24)
May 16, 2024, Hamburg, Germany
Website: https://moda.dmi.unibas.ch
Submission: https://easychair.org/my/conference?conf=moda24
Twitter: https://twitter.com/moda_hpc (@moda_hpc)
Following the successful previous editions initiated at ISC HPC, we are inviting contributions to the 5th ISC HPC International Workshop on Monitoring and Operational Data Analytics (MODA24). The goal of the MODA workshop series is to provide a venue for sharing insights into current trends in MODA for HPC systems and data centers, identify potential gaps, and offer an outlook into the future of the involved fields: high performance computing, databases, machine learning, and possible solutions that can contribute to the codesign and procurement of future computing and data processing systems.
=== Important Dates ===
Abstract submission: February 23, 2024 (AoE)
Paper submission: March 01, 2024 (AoE)
Author notification: April 05, 2024
Camera-ready: June 16, 2024
MODA24 workshop: May 16, 2024, Hamburg, Germany
=== Goals ===
While MODA is already a common practice at various HPC and data centers, each site adopts a different, insular approach, rarely adopted in production environments, and mostly limited to the visualization of the system and building infrastructure metrics for health check purposes. In this regard, we observe a gap between the collection of operational data and its meaningful and effective analysis and exploitation, which prevents closing the feedback loop between the monitored HPC and data processing system, its operation, and its end-users.
Under the above premises, the goals of the MODA 2024 workshop are:
1) Gather and share knowledge and establish a common ground within the international community with respect to best practices in monitoring and operational data analytics.
2) Discuss future strategies and alternatives for MODA, potentially improving existing solutions and envisioning a common baseline approach in computing and data centers.
3) Establish a debate on the usefulness and applicability of AI/ML techniques on collected operational data for optimizing the operation and energy-consumption of production systems (for practices such as predictive maintenance, runtime optimization, optimal and adaptive resource allocation and scheduling).
=== Scope ===
We seek novel research ideas that align with the above goals and match (see note below) the scope of the MODA workshop series:
a) Challenges, solutions, and best practices for monitoring systems at HPC and data centers. A significant focus is on operational data collection mechanisms
- covering different system levels, from building infrastructure sensor data to processing-core performance metrics, and
- targeting different end-users, from system administrators and operators to computer scientists, application developers, and computational scientists.
b) Effective strategies for analyzing and interpreting the collected operational data. Such strategies should particularly include (but are not limited to):
- different visualization approaches and
- machine learning-based strategies, potentially inferring knowledge of the system behavior and allowing for the realization of a proactive control loop.
c) Methods, tools, and techniques for automated control of HPC systems. Such contributions may include (but are not limited to):
- use-cases that need autonomy loops and automated control and
- instances of partial autonomy loops that need more automation and control.
Note: New solutions proposed in the context of application performance modeling and/or application performance analysis tools fall outside the scope of the MODA workshop series. Novel contributions in the area of compiler analysis, debugging, programming models, and/or sustainability of scientific software are also considered out of the scope of the workshop.
=== Topics ===
We cordially invite you to submit your contributions to the MODA24 workshop, in the form of full (12 pages), short/work-in-progress (6 pages) papers, and abstracts for lightning talks (1 page), that address but are not limited to:
* Monitoring and operational data analysis challenges and approaches (data collection, storage, visualization, integration into system software, adoption).
* State-of-the-practice method, tools, techniques in monitoring at various HPC and data centers.
* Solutions for monitoring and analysis of operational data deployed productively on large- to extreme-scale systems, with a large number of users.
* Solutions that have proven limitations in terms of quality of the collected data or efficiency of real-time collection of operational data.
* Opportunities and challenges of using machine learning methods for efficient monitoring and analysis of operational data.
* Examples of successful integration of monitoring and data analysis practices into production system software (energy and resource management) and runtime systems (scheduling and resource allocation).
* Explicit gaps between operational data collection, processing, effective analysis, and impactful exploitation; new approaches for closing these gaps for the benefit of improving HPC and data center planning, operations, and research.
* Means to identify (intentional or unintentional) misuse of resources, and methods to mitigate its effects: taking automatic steps to contain the effects of one application/job/user allocation on others, supporting users to identify causes for the misbehavior of their application, linking to intrusion detection. and safe and trusted multitenancy.
* Concepts to integrate MODA into the system codesign at all levels, including dedicated hardware components, middleware features, and tool support that make ‘monitoring and analysis by design and by default’ a viable option without sacrificing performance.
* Examples and challenges of applying FAIR data practices, including sharing of monitoring workflows and tools across sites while ensuring compliance with GDPR regulations, user access agreements, or special operational security requirements.
* Concrete use cases of improvements achieved by applying ML models in HPC operations.
* Suggestions for appropriate data to collect towards an open data set (ODS) (ideally containing anomalies) that captures the execution of a set of representative applications on a representative production HPC system.
* Methodologies to prepare the ODS in view of deploying appropriate ML models
* Use of MODA to tackle the rising challenges of sustainable HPC, mainly at the level of energy efficiency, but also in hardware stability and replacement policies, from application-level to sitewide approaches.
=== Submission and Publication ===
All papers submitted to MODA24 must be original and not simultaneously submitted to another venue. All submissions will be peer-reviewed by the program committee members and accepted papers are expected to be presented during the workshop.
Papers should be submitted through the EasyChair online system at https://easychair.org/my/conference?conf=moda24. Papers are required to be formatted according to the ISC research papers<https://www.isc-hpc.com/research-papers-2021.html> guidelines using LNCS style (see Springer’s website<http://www.springer.com/de/it-informatik/lncs/conference-proceedings-guidelines>):
* Single-column format
* Maximum 6 for short/work-in-progress papers or 12 pages for full papers (including figures and references)
* Maximum 1 page for lightning talk abstracts (including figures and references)
* Use Springer’s LaTeX document class or Word template (see Springer’s Proceedings Guidelines<https://www.springer.com/gp/computer-science/lncs/conference-proceedings-guidelines>).
The workshop chairs reserve the right to desk reject incorrectly formatted papers. Submissions cannot have previously been peer-reviewed and published or simultaneously under another peer review.
The MODA 2024 workshop papers and lightning talk abstracts will be published together with the ISC 2024 post-conference proceedings (published by Springer LNCS), including an abstract of the keynote and invited talks, and a short white paper of the panel session.
=== Workshop Organizers ===
* Florina Ciorba – University of Basel, Switzerland
* Utz-Uwe Haus – HPE EMEA Research Lab, Switzerland
* Nicolas Lachiche – University of Strasbourg, France
* Martin Schulz – Technische Universität München, Germany
=== Publicity Chair ===
* Thomas Jakobsche – University of Basel, Switzerland
We are looking forward to your submissions and to seeing you on May 16, 2024 in Hamburg, Germany.
More information about the hpc-announce
mailing list