[hpc-announce] [Call for Participation] MODA24 at ISC 2024
Thomas Jakobsche
thomas.jakobsche at unibas.ch
Sun May 12 14:59:55 CDT 2024
[We apologize if you receive multiple copies of this email.]
5th ISC HPC International Workshop on Monitoring & Operational Data Analytics (MODA24)
May 16, 2024, Hamburg, Germany
Website: https://urldefense.us/v3/__https://moda.dmi.unibas.ch__;!!G_uCfscf7eWS!ft_oH0KzKmvh2QdZ1KNGg9H6c076KiAVyWHj1HY06B_bwgynAWpTEFHWSlZ5oMH9tplOxuVsh18pK9tIG28NdaaO-e8r4UNidpLs$
Twitter: https://urldefense.us/v3/__https://twitter.com/moda_hpc__;!!G_uCfscf7eWS!ft_oH0KzKmvh2QdZ1KNGg9H6c076KiAVyWHj1HY06B_bwgynAWpTEFHWSlZ5oMH9tplOxuVsh18pK9tIG28NdaaO-e8r4WzDGjno$ (@moda_hpc)
Following the successful previous editions initiated at ISC HPC, we are inviting contributions to the 5th ISC HPC International Workshop on Monitoring and Operational Data Analytics (MODA24). The goal of the MODA workshop series is to provide a venue for sharing insights into current trends in MODA for HPC systems and data centers, identify potential gaps, and offer an outlook into the future of the involved fields: high performance computing, databases, machine learning, and possible solutions that can contribute to the codesign and procurement of future computing and data processing systems.
=== Goals ===
While MODA is already a common practice at various HPC and data centers, each site adopts a different, insular approach, rarely adopted in production environments, and mostly limited to the visualization of the system and building infrastructure metrics for health check purposes. In this regard, we observe a gap between the collection of operational data and its meaningful and effective analysis and exploitation, which prevents closing the feedback loop between the monitored HPC and data processing system, its operation, and its end-users.
Under the above premises, the goals of the MODA 2024 workshop are:
1) Gather and share knowledge and establish a common ground within the international community with respect to best practices in monitoring and operational data analytics.
2) Discuss future strategies and alternatives for MODA, potentially improving existing solutions and envisioning a common baseline approach in computing and data centers.
3) Establish a debate on the usefulness and applicability of AI/ML techniques on collected operational data for optimizing the operation and energy-consumption of production systems (for practices such as predictive maintenance, runtime optimization, optimal and adaptive resource allocation and scheduling).
=== Program (All times are CEST) ===
14:00 – 14:05 Opening
14:05 – 15:00 Keynote Presentation:
Operating High Performance Computers in remote and restrictive environments
Brad Evans (Pawsey Supercomputing Research Centre, Australia)
15:00 – 15:35 Full Paper Presentation:
An Exascale Slurm Testing and Evaluation Environment Utilizing Generated DAG Workloads
Laslo Hunhold and Stefan Wesner (University of Cologne, Germany)
15:35 – 16:00 Short Paper Presentation:
Challenges for monitoring and data analytics in a leadership public data repository
Patrick Widener, Alex May, Tatiyanna Singleton and Olga Kuchar (Oak Ridge National Laboratory, USA)
16:00 – 16:30 Coffee Break
16:30 – 16:45 Lightning Talk:
EMOI: CSCS Extensible Monitoring and Observability Infrastructure
Jean-Guillaume Piccinali and Massimo Benini (Swiss National Supercomputing Centre, Switzerland)
16:45 – 17:00 Lightning Talk:
How well can we predict two most important metrics for HPC jobs: runtime and queue time?
Kevin Menear, Dmitry Duplyakin (National Renewable Energy Laboratory, USA) and Kadidia Konate (Lawrence Berkeley National Laboratory, USA)
17:00 – 17:15 Lightning Talk:
Monitoring of Energy and Emissions of HPC Batch Job Using CEEMS
Mahendra Paipuri (Centre National de la Recherche Scientifique, France)
17:15 – 17:30 Lightning Talk:
Next-Generation Data Explanation: Bridging the Gap from Data Collection to Operational Data Analytics
Cary Whitney (National Energy Research Scientific Computing Center, USA), Melissa Romanus, Thomas Davis and Elizabeth Bautista (Lawrence Berkeley National Laboratory, USA)
17:30 – 17:55 Panel Discussion:
MODA in multitenant and federated environments
Utz-Uwe Haus (Hewlett Packard Enterprise, Switzerland)
17:55 – 18:00 Closing
=== Workshop Organizers ===
* Florina Ciorba – University of Basel, Switzerland
* Utz-Uwe Haus – HPE EMEA Research Lab, Switzerland
* Nicolas Lachiche – University of Strasbourg, France
* Martin Schulz – Technische Universität München, Germany
=== Publicity Chair ===
* Thomas Jakobsche – University of Basel, Switzerland
We are looking forward to your participation and to seeing you on May 16, 2024 in Hamburg, Germany.
More information about the hpc-announce
mailing list